AITopics | Georgiev, Georgi

Collaborating Authors

Georgiev, Georgi

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

Wang, Yuxia, Wang, Minghan, Iqbal, Hasan, Georgiev, Georgi, Geng, Jiahui, Nakov, Preslav

arXiv.org Artificial IntelligenceMay-9-2024

The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs. Difficulties lie in assessing the factuality of free-form responses in open domains. Also, different papers use disparate evaluation benchmarks and measurements, which renders them hard to compare and hampers future progress. To mitigate these issues, we propose OpenFactCheck, a unified factuality evaluation framework for LLMs. OpenFactCheck consists of three modules: (i) CUSTCHECKER allows users to easily customize an automatic fact-checker and verify the factual correctness of documents and claims, (ii) LLMEVAL, a unified evaluation framework assesses LLM's factuality ability from various perspectives fairly, and (iii) CHECKEREVAL is an extensible solution for gauging the reliability of automatic fact-checkers' verification results using human-annotated datasets. OpenFactCheck is publicly released at https://github.com/yuxiaw/OpenFactCheck.

factuality, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2405.05583

Country:

North America > United States (0.14)
North America > Canada (0.14)
Europe > Spain (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback

Factuality of Large Language Models in the Year 2024

Wang, Yuxia, Wang, Minghan, Manzoor, Muhammad Arslan, Liu, Fei, Georgiev, Georgi, Das, Rocktim Jyoti, Nakov, Preslav

arXiv.org Artificial IntelligenceFeb-9-2024

Large language models (LLMs), especially when instruction-tuned for chat, have become part of our daily lives, freeing people from the process of searching, extracting, and integrating information from multiple sources by offering a straightforward answer to a variety of questions in a single place. Unfortunately, in many cases, LLM responses are factually incorrect, which limits their applicability in real-world scenarios. As a result, research on evaluating and improving the factuality of LLMs has attracted a lot of research attention recently. In this survey, we critically analyze existing work with the aim to identify the major challenges and their associated causes, pointing out to potential solutions for improving the factuality of LLMs, and analyzing the obstacles to automated factuality evaluation for open-ended text generation. We further offer an outlook on where future research should go.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2402.0242

Genre:

Overview (1.00)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Leaf: Multiple-Choice Question Generation

Vachev, Kristiyan, Hardalov, Momchil, Karadzhov, Georgi, Georgiev, Georgi, Koychev, Ivan, Nakov, Preslav

arXiv.org Artificial IntelligenceJan-22-2022

Testing with quiz questions has proven to be an effective way to assess and improve the educational process. However, manually creating quizzes is tedious and time-consuming. To address this challenge, we present Leaf, a system for generating multiple-choice questions from factual text. In addition to being very well suited for the classroom, Leaf could also be used in an industrial setting, e.g., to facilitate onboarding and knowledge sharing, or as a component of chatbots, question answering systems, or Massive Open Online Courses (MOOCs). The code and the demo are available on GitHub.

computational linguistic, natural language, question answering, (18 more...)

arXiv.org Artificial Intelligence

2201.09012

Country:

Europe (1.00)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.15)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Instructional Material > Online (0.56)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.91)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

Add feedback

Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields

Georgiev, Georgi, Nakov, Preslav, Ganchev, Kuzman, Osenova, Petya, Simov, Kiril Ivanov

arXiv.org Artificial IntelligenceSep-26-2021

The paper presents a feature-rich approach to the automatic recognition and categorization of named entities (persons, organizations, locations, and miscellaneous) in news text for Bulgarian. We combine well-established features used for other languages with language-specific lexical, syntactic and morphological information. In particular, we make use of the rich tagset annotation of the BulTreeBank (680 morpho-syntactic tags), from which we derive suitable task-specific tagsets (local and nonlocal). We further add domain-specific gazetteers and additional unlabeled data, achieving F1=89.4%, which is comparable to the state-of-the-art results for English.

machine learning, natural language, text processing, (4 more...)

arXiv.org Artificial Intelligence

2109.15121

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Generating Answer Candidates for Quizzes and Answer-Aware Question Generators

Vachev, Kristiyan, Hardalov, Momchil, Karadzhov, Georgi, Georgiev, Georgi, Koychev, Ivan, Nakov, Preslav

arXiv.org Artificial IntelligenceAug-29-2021

In education, open-ended quiz questions have become an important tool for assessing the knowledge of students. Yet, manually preparing such questions is a tedious task, and thus automatic question generation has been proposed as a possible alternative. So far, the vast majority of research has focused on generating the question text, relying on question answering datasets with readily picked answers, and the problem of how to come up with answer candidates in the first place has been largely ignored. Here, we aim to bridge this gap. In particular, we propose a model that can generate a specified number of answer candidates for a given passage of text, which can then be used by instructors to write questions manually or can be passed as an input to automatic answer-aware question generators. Our experiments show that our proposed answer candidate generation model outperforms several baselines.

artificial intelligence, natural language, proceedings, (17 more...)

arXiv.org Artificial Intelligence

2108.12898

Country:

Europe (1.00)
Asia > Middle East > Qatar (0.14)
North America > United States > Texas (0.14)
(3 more...)

Genre: Research Report (0.40)

Industry: Education > Assessment & Standards > Student Performance (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Where Classification Fails, Interpretation Rises

Nguyen, Chanh, Georgiev, Georgi, Ji, Yujie, Wang, Ting

arXiv.org Machine LearningDec-2-2017

An intriguing property of deep neural networks is their inherent vulnerability to adversarial inputs, which significantly hinders their application in security-critical domains. Most existing detection methods attempt to use carefully engineered patterns to distinguish adversarial inputs from their genuine counterparts, which however can often be circumvented by adaptive adversaries. In this work, we take a completely different route by leveraging the definition of adversarial inputs: while deceiving for deep neural networks, they are barely discernible for human visions. Building upon recent advances in interpretable models, we construct a new detection framework that contrasts an input's interpretation against its classification. We validate the efficacy of this framework through extensive experiments using benchmark datasets and attacks. We believe that this work opens a new direction for designing adversarial input detection methods.

arxiv preprint arxiv, deep learning, neural network, (18 more...)

arXiv.org Machine Learning

1712.00558

Country: North America > United States (0.29)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback