AITopics | frequent term

Collaborating Authors

frequent term

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Embedded Topic Models Enhanced by Wikification

Shibuya, Takashi, Utsuro, Takehito

arXiv.org Artificial IntelligenceOct-3-2024

Topic modeling analyzes a collection of documents to learn meaningful patterns of words. However, previous topic models consider only the spelling of words and do not take into consideration the homography of words. In this study, we incorporate the Wikipedia knowledge into a neural topic model to make it aware of named entities. We evaluate our method on two datasets, 1) news articles of \textit{New York Times} and 2) the AIDA-CoNLL dataset. Our experiments show that our method improves the performance of neural topic models in generalizability. Moreover, we analyze frequent terms in each topic and the temporal dependencies between topics to demonstrate that our entity-aware topic models can capture the time-series development of topics well.

computational linguistic, proceedings, topic model, (15 more...)

arXiv.org Artificial Intelligence

2410.02441

Country:

North America > United States > New York > New York County > New York City (0.05)
Asia > Middle East > Jordan (0.05)
Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.04)
(13 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

An Algorithm for Generating Gap-Fill Multiple Choice Questions of an Expert System

Sirithumgul, Pornpat, Prasertsilp, Pimpaka, Olfman, Lorne

arXiv.org Artificial IntelligenceSep-16-2021

This research is aimed to propose an artificial intelligence algorithm comprising an ontology-based design, text mining, and natural language processing for automatically generating gap-fill multiple choice questions (MCQs). The simulation of this research demonstrated an application of the algorithm in generating gap-fill MCQs about software testing. The simulation results revealed that by using 103 online documents as inputs, the algorithm could automatically produce more than 16 thousand valid gap-fill MCQs covering a variety of topics in the software testing domain. Finally, in the discussion section of this paper we suggest how the proposed algorithm should be applied to produce gap-fill MCQs being collected in a question pool used by a knowledge expert system.

algorithm, frequent term, question phrase, (15 more...)

arXiv.org Artificial Intelligence

2109.11421

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Texas > El Paso County > El Paso (0.04)
North America > United States > Texas > Bexar County > San Antonio (0.04)
(8 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.61)

Add feedback

Predefined Sparseness in Recurrent Sequence Models

Demeester, Thomas, Deleu, Johannes, Godin, Fréderic, Develder, Chris

arXiv.org Artificial IntelligenceAug-27-2018

Inducing sparseness while training neural networks has been shown to yield models with a lower memory footprint but similar effectiveness to dense models. However, sparseness is typically induced starting from a dense model, and thus this advantage does not hold during training. We propose techniques to enforce sparseness upfront in recurrent sequence models for NLP applications, to also benefit training. First, in language modeling, we show how to increase hidden state sizes in recurrent layers without increasing the number of parameters, leading to more expressive models. Second, for sequence labeling, we show that word embeddings with predefined sparseness lead to similar performance as dense embeddings, at a fraction of the number of trainable parameters.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1808.0872

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DreamNLP: Novel NLP System for Clinical Report Metadata Extraction using Count Sketch Data Streaming Algorithm: Preliminary Results

Choi, Sanghyun, Ivkin, Nikita, Braverman, Vladimir, Jacobs, Michael A.

arXiv.org Machine LearningAug-25-2018

Extracting information from electronic health records (EHR) is a challenging task since it requires prior knowledge of the reports and some natural language processing algorithm (NLP). With the growing number of EHR implementations, such knowledge is increasingly challenging to obtain in an efficient manner. We address this challenge by proposing a novel methodology to analyze large sets of EHRs using a modified Count Sketch data streaming algorithm termed DreamNLP. By using DreamNLP, we generate a dictionary of frequently occurring terms or heavy hitters in the EHRs using low computational memory compared to conventional counting approach other NLP programs use. We demonstrate the extraction of the most important breast diagnosis features from the EHRs in a set of patients that underwent breast imaging. Based on the analysis, extraction of these terms would be useful for defining important features for downstream tasks such as machine learning for precision medicine.

artificial intelligence, natural language, text processing, (14 more...)

arXiv.org Machine Learning

1809.02665

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Maryland > Baltimore (0.05)
North America > United States > Hawaii (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Health Care Technology > Medical Record (0.89)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.50)

Add feedback

Clouds, clouds, and more clouds

#artificialintelligenceMay-10-2017, 06:40:18 GMT

There are at least eleven kinds of clouds: cirrus, cirrocumulus, cirrostratus, altocumulus, altostratus, cumulonimbus, cumulus, nimbostratus, stratocumulus, small Cu, and stratus. But this article is not about those kinds of clouds. Of course there are other kinds of clouds, like iCloud, Google Cloud, Azure Cloud, Amazon Cloud, and the list goes on. But this article is not about those clouds either. This article is about text analytics.

cloud computing, machine learning, natural language, (18 more...)

#artificialintelligence

Industry: Information Technology > Services (0.57)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.98)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.71)

Add feedback

Training a CNN with the same data but different labels • /r/MachineLearning

#artificialintelligenceApr-11-2016, 20:25:41 GMT

I apologize for the ambiguous title, but it was difficult to compile my question into a sentence. I have a large data-set of paintings, and corresponding class labels generated from their medium. I'm not interested in the output class, but rather the 9216 dimension feature vector generated from the Pool5 layer of the network (I'm using AlexNet). Now, when I generate the class labels from the meta-data assosiated with the painting I'm using the least frequent term as it tends to be more telling. As an example, a painting's medium meta-data may be "Oil and Chalk on Paper"; currently, the least frequent term would have been applied as the target label, in this case "Oil".

artificial intelligence, machine learning, machinelearning, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Training a CNN with the same data but different labels • /r/MachineLearning

@machinelearnbotApr-9-2016, 21:50:50 GMT

artificial intelligence, machine learning, machinelearning, (4 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback