Taxonomic survey of Hindi Language NLP systems
Desai, Nikita P., Prof., null, Dabhi, Vipul K.
–arXiv.org Artificial Intelligence
The field of Natural language processing can be formally defined as - "A theoretically motivated range of computational techniques for analyzing and representing naturally occurring texts at one or more levels of linguistic analysis for the purpose of achieving human-like language processing for a range of tasks or applications"[69]. The naturally occurring text can be in written or spoken form.A wide array of domains contribute to NLP development like linguistics, computer science and psychology.The linguistics field helps to understand the formal structure of language while computer science domain helps to find efficient internal representations and data structures.The study of "Psychology" can be useful to understand the methodology used by humans for dealing with languages. NLP can be considered to be having two distinct focus namely (1)Natural Language Generation(NLG) and (2)Natural Language Understanding(NLU). The NLG deals with planning to use the representation of language to decide what should be generated at each point in interaction, while NLU needs to analyze language and decide which is best way to represent it meaningfully.We, in this survey paper, concentrate on area of NLU for written text.Hence the NLP henceforth might be considered as NLU and vice versa. Motivation for designing Indian NLP systems Hindi and English are the official languages in central government of India(GOI). Indian community faces a "Digital Divide" due to dominance of English as mode of communication in higher education, judiciary, corporate sector and Public administration at Central level whereas the government in states work in their respective regional languages [67].The expansion of Internet has inter-connected the socioeconomic environment of the world and redefined the concept of global culture.As per a report in 2017 by the companies kpmg and Google
arXiv.org Artificial Intelligence
Jan-30-2021
- Country:
- North America > United States
- Maine (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Asia
- North America > United States
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.45)
- Industry:
- Government > Regional Government > Asia Government > India Government (0.54)
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning
- Expert Systems (0.93)
- Ontologies (0.93)
- Rule-Based Reasoning (0.69)
- Natural Language
- Text Processing (1.00)
- Machine Translation (1.00)
- Information Retrieval (1.00)
- Grammars & Parsing (1.00)
- Information Extraction (0.67)
- Discourse & Dialogue (0.67)
- Machine Learning
- Statistical Learning (1.00)
- Learning Graphical Models (0.93)
- Neural Networks (0.93)
- Performance Analysis > Accuracy (0.67)
- Representation & Reasoning
- Information Technology > Artificial Intelligence