AITopics | quanteda

Collaborating Authors

quanteda

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The R package sentometrics to compute, aggregate and predict with textual sentiment

Ardia, David, Bluteau, Keven, Borms, Samuel, Boudt, Kris

arXiv.org Machine LearningOct-20-2021

We provide a hands-on introduction to optimized textual sentiment indexation using the R package sentometrics. Textual sentiment analysis is increasingly used to unlock the potential information value of textual data. The sentometrics package implements an intuitive framework to efficiently compute sentiment scores of numerous texts, to aggregate the scores into multiple time series, and to use these time series to predict other variables. The workflow of the package is illustrated with a built-in corpus of news articles from two major U.S. journals to forecast the CBOE Volatility Index.

corpus, lexicon, sentiment, (16 more...)

arXiv.org Machine Learning

doi: 10.18637/jss.v099.i02

2110.10817

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Quebec > Montreal (0.04)
Europe > Belgium (0.04)
(7 more...)

Genre: Workflow (0.89)

Industry:

Banking & Finance > Trading (1.00)
Banking & Finance > Economy (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.66)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.48)

Add feedback

Advancing Text Mining with R and quanteda

#artificialintelligenceNov-12-2019, 16:07:47 GMT

The data that we usually use for text analysis is available in text formats (e.g., .txt After reading in the data, we need to generate a corpus. A corpus is a type of dataset that is used in text analysis. It contains "a collection of text or speech material that has been brought together according to a certain set of predetermined criteria" (Shmelova et al. 2019, p. 33). These criteria are usually set by the researchers and are in concordance with the guiding question.

corpus, data frequency matrix, quanteda, (10 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science > Data Mining > Text Mining (0.40)

Add feedback

Data Science Tallahassee

#artificialintelligenceMay-17-2017, 14:40:25 GMT

Dr. Mark Jack is an experienced Data Scientist and Associate Professor of Physics at Florida A&M University with several years of experience in computational modeling in particle physics, neuroscience, nanoscience and high-performance computing. He is a certified trainer in machine learning and statistical programming in R. He has spoken in several Data Science conferences which includes the Global Big Data Conferences in Tampa, Fl and Atlanta, GA. The creation of a corpus of documents from three text data files mostly relies on the use of the library'quanteda' in R. It allows to quickly tokenize the corpus of documents to remove text features such as punctuation, numbers, white space, lowercase words etc. The processing time for the complete text data is considerable.

artificial intelligence, corpus, natural language, (13 more...)

#artificialintelligence

Country:

North America > United States > Florida > Leon County > Tallahassee (0.40)
North America > United States > Georgia > Fulton County > Atlanta (0.26)
North America > United States > Florida > Hillsborough County > Tampa (0.26)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.33)

Add feedback