PyThaiNLP: Thai Natural Language Processing in Python
Phatthiyaphaibun, Wannaphong, Chaovavanich, Korakot, Polpanumas, Charin, Suriyawongkul, Arthit, Lowphansirikul, Lalita, Chormai, Pattarawat, Limkonchotiwat, Peerat, Suntorntip, Thanathip, Udomcharoenchaikit, Can
–arXiv.org Artificial Intelligence
We present PyThaiNLP, a free and open-source natural language processing (NLP) library for Thai language implemented in Python. It provides a wide range of software, models, and datasets for Thai language. We first provide a brief historical context of tools for Thai language prior to the development of PyThaiNLP. We then outline the functionalities it provided as well as datasets and pre-trained language models. We later summarize its development milestones and discuss our experience during its development. We conclude by demonstrating how industrial and research communities utilize PyThaiNLP in their work. The library is freely available at https://github.com/pythainlp/pythainlp.
arXiv.org Artificial Intelligence
Dec-7-2023
- Country:
- Asia
- China (0.04)
- Japan (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Nepal > Bagmati Province
- Kathmandu District > Kathmandu (0.04)
- Singapore (0.04)
- Thailand
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Belgium > Brussels-Capital Region
- North America > United States
- California > Santa Clara County
- San Jose (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > Santa Clara County
- Oceania
- Australia > Victoria
- Melbourne (0.04)
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- Australia > Victoria
- Asia
- Genre:
- Research Report (0.40)
- Industry:
- Banking & Finance (0.68)
- Education (0.46)
- Information Technology (0.68)
- Technology: