AITopics | Nikishina, Irina

Collaborating Authors

Nikishina, Irina

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Do I look like a `cat.n.01` to you? A Taxonomy Image Generation Benchmark

Moskvoretskii, Viktor, Lobanova, Alina, Neminova, Ekaterina, Biemann, Chris, Panchenko, Alexander, Nikishina, Irina

arXiv.org Artificial IntelligenceMar-13-2025

This paper explores the feasibility of using text-to-image models in a zero-shot setup to generate images for taxonomy concepts. While text-based methods for taxonomy enrichment are well-established, the potential of the visual dimension remains unexplored. To address this, we propose a comprehensive benchmark for Taxonomy Image Generation that assesses models' abilities to understand taxonomy concepts and generate relevant, high-quality images. The benchmark includes common-sense and randomly sampled WordNet concepts, alongside the LLM generated predictions. The 12 models are evaluated using 9 novel taxonomy-related text-to-image metrics and human feedback. Moreover, we pioneer the use of pairwise evaluation with GPT-4 feedback for image generation. Experimental results show that the ranking of models differs significantly from standard T2I tasks. Playground-v2 and FLUX consistently outperform across metrics and subsets and the retrieval-based approach performs poorly. These findings highlight the potential for automating the curation of structured data resources.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2503.10357

Country: North America > United States (0.28)

Genre:

Research Report > New Finding (0.87)
Research Report > Experimental Study (0.68)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Self-Taught Self-Correction for Small Language Models

Moskvoretskii, Viktor, Biemann, Chris, Nikishina, Irina

arXiv.org Artificial IntelligenceMar-11-2025

Although large language models (LLMs) have achieved remarkable performance across various tasks, they remain prone to errors. A key challenge is enabling them to self-correct. While prior research has relied on external tools or large proprietary models, this work explores self-correction in small language models (SLMs) through iterative fine-tuning using solely self-generated data. We introduce the Self-Taught Self-Correction (STaSC) algorithm, which incorporates multiple algorithmic design choices. Experimental results on a question-answering task demonstrate that STaSC effectively learns self-correction, leading to significant performance improvements. Our analysis further provides insights into the mechanisms of self-correction and the impact of different design choices on learning dynamics and overall performance. To support future research, we release our user-friendly codebase and lightweight models.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2503.08681

Country:

North America > United States (0.14)
Europe > Middle East > Malta (0.14)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (0.68)
Media > Film (0.47)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)

Add feedback

Argument-Based Comparative Question Answering Evaluation Benchmark

Nikishina, Irina, Anwar, Saba, Dolgov, Nikolay, Manina, Maria, Ignatenko, Daria, Moskvoretskii, Viktor, Shelmanov, Artem, Baldwin, Tim, Biemann, Chris

arXiv.org Artificial IntelligenceFeb-20-2025

In this paper, we aim to solve the problems standing in the way of automatic comparative question answering. To this end, we propose an evaluation framework to assess the quality of comparative question answering summaries. We formulate 15 criteria for assessing comparative answers created using manual annotation and annotation from 6 large language models and two comparative question asnwering datasets. We perform our tests using several LLMs and manual annotation under different settings and demonstrate the constituency of both evaluations. Our results demonstrate that the Llama-3 70B Instruct model demonstrates the best results for summary evaluation, while GPT-4 is the best for answering comparative questions. All used data, code, and evaluation results are publicly available\footnote{\url{https://anonymous.4open.science/r/cqa-evaluation-benchmark-4561/README.md}}.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2502.14476

Country:

North America > Canada (0.14)
Europe > Ireland (0.14)
North America > United States (0.14)
(5 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home

Moskvoretskii, Viktor, Lysyuk, Maria, Salnikov, Mikhail, Ivanov, Nikolay, Pletenev, Sergey, Galimzianova, Daria, Krayko, Nikita, Konovalov, Vasily, Nikishina, Irina, Panchenko, Alexander

arXiv.org Artificial IntelligenceJan-22-2025

Retrieval Augmented Generation (RAG) improves correctness of Question Answering (QA) and addresses hallucinations in Large Language Models (LLMs), yet greatly increase computational costs. Besides, RAG is not always needed as may introduce irrelevant information. Recent adaptive retrieval methods integrate LLMs' intrinsic knowledge with external information appealing to LLM self-knowledge, but they often neglect efficiency evaluations and comparisons with uncertainty estimation techniques. We bridge this gap by conducting a comprehensive analysis of 35 adaptive retrieval methods, including 8 recent approaches and 27 uncertainty estimation techniques, across 6 datasets using 10 metrics for QA performance, self-knowledge, and efficiency. Our findings show that uncertainty estimation techniques often outperform complex pipelines in terms of efficiency and self-knowledge, while maintaining comparable QA performance.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2501.12835

Country:

North America > United States > Hawaii (0.14)
North America > Mexico > Mexico City (0.14)
Europe > Austria > Vienna (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Semantic Tasks

Moskvoretskii, Viktor, Neminova, Ekaterina, Lobanova, Alina, Panchenko, Alexander, Nikishina, Irina

arXiv.org Artificial IntelligenceJun-17-2024

In this paper, we explore the capabilities of LLMs in capturing lexical-semantic knowledge from WordNet on the example of the LLaMA-2-7b model and test it on multiple lexical semantic tasks. As the outcome of our experiments, we present TaxoLLaMA, the everything-in-one model, lightweight due to 4-bit quantization and LoRA. It achieves 11 SotA results, 4 top-2 results out of 16 tasks for the Taxonomy Enrichment, Hypernym Discovery, Taxonomy Construction, and Lexical Entailment tasks. Moreover, it demonstrates very strong zero-shot performance on Lexical Entailment and Taxonomy Construction with no fine-tuning. We also explore its hidden multilingual and domain adaptation capabilities with a little tuning or few-shot learning. All datasets, code, and model are available online at https://github.com/VityaVitalich/TaxoLLaMA

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2403.09207

Country:

Asia (0.93)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Large Language Models Meet Knowledge Graphs to Answer Factoid Questions

Salnikov, Mikhail, Le, Hai, Rajput, Prateek, Nikishina, Irina, Braslavski, Pavel, Malykh, Valentin, Panchenko, Alexander

arXiv.org Artificial IntelligenceOct-3-2023

Recently, it has been shown that the incorporation of structured knowledge into Large Language Models significantly improves the results for a variety of NLP tasks. In this paper, we propose a method for exploring pre-trained Text-to-Text Language Models enriched with additional information from Knowledge Graphs for answering factoid questions. More specifically, we propose an algorithm for subgraphs extraction from a Knowledge Graph based on question entities and answer candidates. Then, we procure easily interpreted information with Transformer-based models through the linearization of the extracted subgraphs. Final re-ranking of the answer candidates with the extracted information boosts Hits@1 scores of the pre-trained text-to-text language models by 4-6%.

large language model, machine learning, subgraph, (18 more...)

arXiv.org Artificial Intelligence

2310.02166

Country:

North America > United States > Texas (0.14)
North America > United States > Maryland (0.14)
Europe > Austria > Vienna (0.14)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

RUSSE'2020: Findings of the First Taxonomy Enrichment Task for the Russian language

Nikishina, Irina, Logacheva, Varvara, Panchenko, Alexander, Loukachevitch, Natalia

arXiv.org Artificial IntelligenceMay-22-2020

Taxonomies are tree structures that organize terms into a semantic hierarchy. Taxonomic relations (or hypernyms) are "isa" relations: cat is-an animal, banana isa fruit, Microsoft isa company, etc. This type of relations is useful in a wide range of Natural Language Processing (NLP) tasks for performing semantic analysis. While substantial interest is drawn to the extraction of hypernyms and taxonomic structures from text [6, 7, 9], the fully automatic taxonomy induction methods are still not widely used for routine construction of lexical resources, such as taxonomies. Nevertheless, the automatic hypernym candidate generation can facilitate and accelerate the manual taxonomy extension. Therefore, it is extremely useful to develop support tools for creation, enrichment, and maintenance of the existing semantic resources as well as their tuning to specific tasks and/or text collections.

artificial intelligence, hypernym, text processing, (19 more...)

arXiv.org Artificial Intelligence

2005.11176

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback