AITopics | oov

Collaborating Authors

oov

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Supplementary AViT 3B model

Neural Information Processing SystemsApr-25-2026, 06:33:50 GMT

The ViT model we use in this work is based on a standard Vision Transformer [7] model scaled to577 nearly 3 billion parameters, using a patch size of 14, 16 heads, 64 blocks, an MLP dimension of 8192578 and a hidden dimension of 2048. The model is defined and trained in Lingvo [32]; we additionally579 employ GSPMD [41] for training. The model is pre-trained on JFT-3B [35] using training settings580 that optimize for performance on JFT-3B rather than for fine-tuning on ImageNet; notably, we do not581 use the training recipe that helps few-shot transfer performance [44]. BReview tools586 We include screenshots of the reviewing tools we built to analyze model mistakes. Figure 3 shows587 the UI for reviewing model predictions and Figure 4 shows the UI that displays the labeling guide588 and slide bar to browse images for a particular class.

artificial intelligence, machine learning, pred, (12 more...)

Neural Information Processing Systems

Country: North America (0.14)

Industry: Transportation > Ground (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Automated Classification of Model Errors on ImageNet

Neural Information Processing SystemsFeb-14-2026, 12:42:05 GMT

While the ImageNet dataset has been driving computer vision research over the past decade, significant label noise and ambiguity have made top-1 accuracy an insufficient measure of further progress.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > United Kingdom > England > Staffordshire (0.04)
Asia > China (0.04)
(6 more...)

Genre: Research Report (0.68)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment > Sports (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

2cd5737c59645f7ef23b2842b705edf2-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 02:26:56 GMT

context gt, pred, training image gt, (10 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Maryland (0.04)
North America > Canada > Newfoundland and Labrador > Newfoundland (0.04)
(3 more...)

Industry: Transportation > Ground (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Fine-TuningOut-of-VocabularyItem RecommendationwithUserSequenceImagination

Neural Information Processing SystemsFeb-8-2026, 01:46:17 GMT

Tobridge thegap,wepropose a novelUserSequenceIMagination (USIM)fine-tuning framework, which first imagines the user sequences and then refines the generated OOV embeddings with the user behavioral embeddings.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

7480ed13740773505262791131c12b89-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 22:02:32 GMT

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > United Kingdom > England > Staffordshire (0.04)
Asia > China (0.04)
(6 more...)

Genre: Research Report (0.68)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment > Sports (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Automated Classification of Model Errors on ImageNet

Peychev, Momchil, Müller, Mark Niklas, Fischer, Marc, Vechev, Martin

arXiv.org Artificial IntelligenceNov-13-2023

While the ImageNet dataset has been driving computer vision research over the past decade, significant label noise and ambiguity have made top-1 accuracy an insufficient measure of further progress. To address this, new label-sets and evaluation protocols have been proposed for ImageNet showing that state-of-the-art models already achieve over 95% accuracy and shifting the focus on investigating why the remaining errors persist. Recent work in this direction employed a panel of experts to manually categorize all remaining classification errors for two selected models. However, this process is time-consuming, prone to inconsistencies, and requires trained experts, making it unsuitable for regular model evaluation thus limiting its utility. To overcome these limitations, we propose the first automated error classification framework, a valuable tool to study how modeling choices affect error distributions. We use our framework to comprehensively evaluate the error distribution of over 900 models. Perhaps surprisingly, we find that across model architectures, scales, and pre-training corpora, top-1 accuracy is a strong predictor for the portion of all error types. In particular, we observe that the portion of severe errors drops significantly with top-1 accuracy indicating that, while it underreports a model's true performance, it remains a valuable performance metric.

accuracy, artifact, pred, (15 more...)

arXiv.org Artificial Intelligence

2401.0243

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > United Kingdom > England > Staffordshire (0.04)
Asia > China (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment > Sports (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Meta Semantics: Towards better natural language understanding and reasoning

Hu, Xiaolin

arXiv.org Artificial IntelligenceApr-20-2023

Natural language understanding is the study of making machines understand the daily used informal text. There are two main categories of methods, statistic-based methods and rule-based methods. Benefiting from the blow-up of deep learning algorithms such as transformer[1], the statistic-based methods upgrade from the traditional Bayesian methods and have better robustness. On the hand, the rule-based methods are wildly used in expert systems, which are run by handwritten rules from experts and use the patterns to map the natural language to machine-readable commands such as SQL, the LUNAR system[2], as an example, which is used in the analysis of lunar geology. Although both methods have got great achievements, there still exist some main challenges that we need to resolve. In section 2, we will discuss the success and challenges of the existing natural language understanding models. In section 3, a potential solution to the OOV problem from word embedding which limits the deep neural method to reasoning and understanding will be presented. In section 4, we will propose our semantic model in detail to move the natural language understanding into the next stage.

logic & formal reasoning, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2304.10663

Country:

Europe > United Kingdom > England > Leicestershire > Leicester (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Berlin (0.04)
Asia > China > Heilongjiang Province > Daqing (0.04)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Understanding (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Complete Guide on Feature Extraction Techniques - Analytics Vidhya

#artificialintelligenceMay-31-2022, 04:35:54 GMT

This article was published as a part of the Data Science Blogathon. In Natural Language Processing, Feature Extraction is one of the most important steps to be followed for a better understanding of the context of what we are dealing with. After the initial text is cleaned, we need to transform it into its features to be used for modeling. Document data is not computable so it must be transformed into numerical data such as a vector space model. This transformation task is generally called feature extraction of document data.

analytic vidhya, feature extraction technique, natural language processing, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Feature Extraction (0.90)

Add feedback

Deep learning models for representing out-of-vocabulary words

Lochter, Johannes V., Silva, Renato M., Almeida, Tiago A.

arXiv.org Artificial IntelligenceJul-28-2020

Communication has become increasingly dynamic with the popularization of social networks and applications that allow people to express themselves and communicate instantly. In this scenario, distributed representation models have their quality impacted by new words that appear frequently or that are derived from spelling errors. These words that are unknown by the models, known as out-of-vocabulary (OOV) words, need to be properly handled to not degrade the quality of the natural language processing (NLP) applications, which depend on the appropriate vector representation of the texts. To better understand this problem and finding the best techniques to handle OOV words, in this study, we present a comprehensive performance evaluation of deep learning models for representing OOV words. We performed an intrinsic evaluation using a benchmark dataset and an extrinsic evaluation using different NLP tasks: text categorization, named entity recognition, and part-of-speech tagging. Although the results indicated that the best technique for handling OOV words is different for each task, Comick, a deep learning method that infers the embedding based on the context and the morphological structure of the OOV word, obtained promising results.

experiment, oov, representation, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-030-61377-8_29

2007.07318

Country:

Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)
South America > Brazil > São Paulo > Campinas (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(12 more...)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Services (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Leveraging External Knowledge for Out-Of-Vocabulary Entity Labeling

de Wynter, Adrian, Mathias, Lambert

arXiv.org Machine LearningAug-26-2019

Dealing with previously unseen slots is a challenging problem in a real-world multi-domain dialogue state tracking task. Other approaches rely on predefined mappings to generate candidate slot keys, as well as their associated values. This, however, may fail when the key, the value, or both, are not seen during training. To address this problem we introduce a neural network that leverages external knowledge bases (KBs) to better classify out-of-vocabulary slot keys and values. This network projects the slot into an attribute space derived from the KB, and, by leveraging similarities in this space, we propose candidate slot keys and values to the dialogue state tracker. We provide extensive experiments that demonstrate that our stratagem can improve upon a previous approach, which relies on predefined candidate mappings. In particular, we evaluate this approach by training a state-of-the-art model with candidates generated from our network, and obtained relative increases of 57.7% and 82.7% in F1 score and accuracy, respectively, for the aforementioned model, when compared to the current candidate generation strategy.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1908.09936

Country: North America > United States (0.46)

Genre: Research Report (0.70)

Industry: Media (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback