Text Classification Algorithms: A Survey
Kowsari, Kamran, Meimandi, Kiana Jafari, Heidarysafa, Mojtaba, Mendu, Sanjana, Barnes, Laura E., Brown, Donald E.
–arXiv.org Artificial Intelligence
In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in the real-world problem are discussed.
arXiv.org Artificial Intelligence
Apr-25-2019
- Country:
- Africa > Middle East
- Tunisia > Tunis Governorate > Tunis (0.04)
- Asia
- China
- India
- Karnataka > Bengaluru (0.04)
- Tamil Nadu > Chennai (0.04)
- Indonesia (0.04)
- Malaysia (0.04)
- Middle East
- Iran > Arabian Gulf (0.14)
- Israel > Haifa District
- Haifa (0.04)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Saudi Arabia > Arabian Gulf (0.04)
- Singapore (0.04)
- South Korea (0.04)
- Europe
- Switzerland > Basel-City
- Basel (0.04)
- Greece
- Attica > Athens (0.04)
- Central Macedonia > Thessaloniki (0.04)
- United Kingdom > England
- Berkshire > Reading (0.04)
- Cambridgeshire > Cambridge (0.14)
- Greater London > London (0.04)
- Serbia > Vojvodina
- South Bačka District > Novi Sad (0.04)
- Spain (0.04)
- Germany
- Baden-Württemberg > Karlsruhe Region
- Heidelberg (0.05)
- Berlin (0.04)
- Hamburg (0.04)
- Baden-Württemberg > Karlsruhe Region
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Switzerland > Basel-City
- Indian Ocean > Arabian Gulf (0.04)
- North America
- Barbados (0.04)
- Canada > Quebec
- Montreal (0.04)
- Cuba (0.04)
- Mexico > Quintana Roo
- Cancún (0.04)
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- California
- Alameda County > Berkeley (0.14)
- San Diego County
- San Francisco County > San Francisco (0.14)
- Santa Clara County > San Jose (0.04)
- Massachusetts
- Middlesex County
- Burlington (0.04)
- Cambridge (0.04)
- Lowell (0.14)
- Suffolk County > Boston (0.04)
- Middlesex County
- District of Columbia > Washington (0.04)
- Washington > King County
- Bellevue (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Virginia > Albemarle County
- Charlottesville (0.14)
- Maryland
- Baltimore (0.04)
- Baltimore County (0.04)
- New York > Tompkins County
- Ithaca (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Arizona
- Maricopa County > Scottsdale (0.04)
- Pima County > Tucson (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Wisconsin
- Dane County > Madison (0.04)
- Milwaukee County > Milwaukee (0.04)
- Texas
- Lavaca County (0.04)
- Travis County > Austin (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Florida
- Orange County > Orlando (0.04)
- Polk County > Lakeland (0.04)
- Pennsylvania > Allegheny County
- South America > Chile
- Africa > Middle East
- Genre:
- Instructional Material > Course Syllabus & Notes (0.45)
- Overview (1.00)
- Research Report
- Experimental Study (0.68)
- New Finding (0.46)
- Promising Solution (0.46)
- Industry:
- Education > Educational Setting
- Online (0.45)
- Government > Regional Government
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area (1.00)
- Information Technology (1.00)
- Law (1.00)
- Transportation (0.67)
- Education > Educational Setting
- Technology: