AITopics | octis

Collaborating Authors

octis

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Improving Contextualized Topic Models with Negative Sampling

Adhya, Suman, Lahiri, Avishek, Sanyal, Debarshi Kumar, Das, Partha Pratim

arXiv.org Artificial IntelligenceMar-27-2023

Topic modeling has emerged as a dominant method for exploring large document collections. Recent approaches to topic modeling use large contextualized language models and variational autoencoders. In this paper, we propose a negative sampling mechanism for a contextualized topic model to improve the quality of the generated topics. In particular, during model training, we perturb the generated document-topic vector and use a triplet loss to encourage the document reconstructed from the correct document-topic vector to be similar to the input document and dissimilar to the document reconstructed from the perturbed vector. Experiments for different topic counts on three publicly available benchmark datasets show that in most cases, our approach leads to an increase in topic coherence over that of the baselines. Our model also achieves very high topic diversity.

machine learning, natural language, topic model, (19 more...)

arXiv.org Artificial Intelligence

2303.14951

Country:

Asia > Middle East > Jordan (0.04)
Asia > India > West Bengal > Kolkata (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.77)

Add feedback

A beginner's guide to OCTIS: Optimizing and Comparing Topic Models Is Simple

#artificialintelligenceSep-4-2021, 16:28:14 GMT

Topic models are promising generative statistical methods that aim to extract the hidden topics underlying a collection of documents. Typically, topic models have two matrices as output. Then, the top-n words from this matrix with the highest probability are then used to represent a topic. The most popular topic modeling method is Latent Dirichlet Allocation, and many articles are written about its workings and implementations. However, focusing on LDA only is restrictive and might be suboptimal for a given corpus.

octis, probability, topic model, (10 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback