An Efficient Classification Model for Cyber Text
Hossen, Md Sakhawat, Borshon, Md. Zashid Iqbal, Badrudduza, A. S. M.
–arXiv.org Artificial Intelligence
The uprising of deep learning methodology and practice in recent years has brought about a severe consequence of increasing carbon footprint due to the insatiable demand for computational resources and power. The field of text analytics also experienced a massive transformation in this trend of monopolizing methodology. In this paper, the original TF-IDF algorithm has been modified, and Clement Term Frequency-Inverse Document Frequency (CTF-IDF) has been proposed for data preprocessing. This paper primarily discusses the effectiveness of classical machine learning techniques in text analytics with CTF-IDF and a faster IRLBA algorithm for dimensionality reduction. The introduction of both of these techniques in the conventional text analytics pipeline ensures a more efficient, faster, and less computationally intensive application when compared with deep learning methodology regarding carbon footprint, with minor compromise in accuracy. The experimental results also exhibit a manifold of reduction in time complexity and improvement of model accuracy for the classical machine learning methods discussed further in this paper.
arXiv.org Artificial Intelligence
Nov-6-2025
- Country:
- Asia
- Bangladesh (0.04)
- Middle East > Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (0.49)
- Technology: