AITopics | Tuzla Canton

Collaborating Authors

Tuzla Canton

Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm

Bojic, Ljubisa, Zagovora, Olga, Zelenkauskaite, Asta, Vukovic, Vuk, Cabarkapa, Milan, Jerkovic, Selma Veseljević, Jovančevic, Ana

arXiv.org Artificial IntelligenceJan-5-2025

In the era of rapid digital communication, vast amounts of textual data are generated daily, demanding efficient methods for latent content analysis to extract meaningful insights. Large Language Models (LLMs) offer potential for automating this process, yet comprehensive assessments comparing their performance to human annotators across multiple dimensions are lacking. This study evaluates the reliability, consistency, and quality of seven state-of-the-art LLMs, including variants of OpenAI's GPT-4, Gemini, Llama, and Mixtral, relative to human annotators in analyzing sentiment, political leaning, emotional intensity, and sarcasm detection. A total of 33 human annotators and eight LLM variants assessed 100 curated textual items, generating 3,300 human and 19,200 LLM annotations, with LLMs evaluated across three time points to examine temporal consistency. Inter-rater reliability was measured using Krippendorff's alpha, and intra-class correlation coefficients assessed consistency over time. The results reveal that both humans and LLMs exhibit high reliability in sentiment analysis and political leaning assessments, with LLMs demonstrating higher internal consistency than humans. In emotional intensity, LLMs displayed higher agreement compared to humans, though humans rated emotional intensity significantly higher. Both groups struggled with sarcasm detection, evidenced by low agreement. LLMs showed excellent temporal consistency across all dimensions, indicating stable performance over time. This research concludes that LLMs, especially GPT-4, can effectively replicate human analysis in sentiment and political leaning, although human expertise remains essential for emotional intensity interpretation. The findings demonstrate the potential of LLMs for consistent and high-quality performance in certain areas of latent content analysis.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2501.02532

Country:

North America > United States > Washington > King County > Seattle (0.14)
Europe > Lithuania > Vilnius County > Vilnius (0.04)
Europe > Serbia > Šumadija and Western Serbia > Šumadija District > Kragujevac (0.04)
(23 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments

Cheng, Sitao, Zhuang, Ziyuan, Xu, Yong, Yang, Fangkai, Zhang, Chaoyun, Qin, Xiaoting, Huang, Xiang, Chen, Ling, Lin, Qingwei, Zhang, Dongmei, Rajmohan, Saravan, Zhang, Qi

arXiv.org Artificial IntelligenceJul-3-2024

Large Language Models (LLMs) have shown potential in reasoning over structured environments, e.g., knowledge graph and table. Such tasks typically require multi-hop reasoning, i.e., match natural language utterance with instances in the environment. Previous methods leverage LLMs to incrementally build a reasoning path, where the LLMs either invoke tools or pick up schemas by step-by-step interacting with the environment. We propose Reasoning-Path-Editing (Readi), a novel framework where LLMs can efficiently and faithfully reason over structured environments. In Readi, LLMs initially generate a reasoning path given a query, and edit the path only when necessary. We instantiate the path on structured environments and provide feedback to edit the path if anything goes wrong. Experimental results on three KGQA and two TableQA datasets show the effectiveness of Readi, significantly surpassing previous LLM-based methods (by 9.1% Hit@1 on WebQSP, 12.4% on MQA-3H and 9.5% on WTQ), comparable with state-of-the-art fine-tuned methods (67% on CWQ and 74.7% on WebQSP) and substantially boosting the vanilla LLMs (by 14.9% on CWQ). Our code will be available on https://aka.ms/readi.

readi, reasoning path, relation, (14 more...)

arXiv.org Artificial Intelligence

2403.08593

Country:

North America > United States > District of Columbia > Washington (0.14)
Europe > France (0.06)
Europe > Netherlands > Gelderland > Nijmegen (0.05)
(22 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

A Long-Short-Term Mixed-Integer Formulation for Highway Lane Change Planning

Reiter, Rudolf, Nurkanovic, Armin, Bernadini, Daniele, Diehl, Moritz, Bemporad, Alberto

arXiv.org Artificial IntelligenceMay-5-2024

Abstract--This work considers the problem of optimal lane changing in a structured multi-agent road environment. The long-term decision variables account for selecting gaps between SVs on each lane. These lane transitions are used for I. N recent years many approaches have been proposed for vehicle motion planning in structured multi-lane road transition gaps on consecutive lanes are modeled by disjunctive environments. LTF are formulated consistently, i.e., a transition point constrains In fact, even deterministic two-dimensional motion planning the point-mass model trajectory to the corresponding problems with rectangular obstacles are NP-hard [1], [2]. Contrary to strict hierarchical decomposition, the coarser This work proposes a novel iterative planning algorithm, approximation of the high-level plan cannot be infeasible for referred to as long-short-term motion planner (LSTMP) that the low-level planner. The STF aims at optimizing a fourstate Within the formulation of the LTF, the locations of transitions discrete-time trajectory of a point-mass model including in time and position are continuous.

constraint, transition, vehicle, (17 more...)

arXiv.org Artificial Intelligence

2405.02979

Country:

Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Europe > Austria > Styria > Graz (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(12 more...)

Genre: Research Report (0.63)

Industry:

Transportation > Ground > Road (1.00)
Energy (0.94)
Automobiles & Trucks (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.92)
(2 more...)

Add feedback

Analyzing An After-Sales Service Process Using Object-Centric Process Mining: A Case Study

Park, Gyunam, Aydin, Sevde, Ugur, Cuneyt, van der Aalst, Wil M. P.

arXiv.org Artificial IntelligenceOct-16-2023

Process mining, a technique turning event data into business process insights, has traditionally operated on the assumption that each event corresponds to a singular case or object. However, many real-world processes are intertwined with multiple objects, making them object-centric. This paper focuses on the emerging domain of object-centric process mining, highlighting its potential yet underexplored benefits in actual operational scenarios. Through an in-depth case study of Borusan Cat's after-sales service process, this study emphasizes the capability of object-centric process mining to capture entangled business process details. Utilizing an event log of approximately 65,000 events, our analysis underscores the importance of embracing this paradigm for richer business insights and enhanced operational improvements.

object-centric process mining, schedule and technician, technician, (10 more...)

arXiv.org Artificial Intelligence

2310.10174

Country:

Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Europe > Bosnia and Herzegovina > Federation of Bosnia and Herzegovina > Tuzla Canton > Tuzla (0.04)

Genre: Research Report (0.40)

Industry: Materials > Metals & Mining (0.47)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Add feedback

Geographic Adaptation of Pretrained Language Models

Hofmann, Valentin, Glavaš, Goran, Ljubešić, Nikola, Pierrehumbert, Janet B., Schütze, Hinrich

arXiv.org Artificial IntelligenceJan-1-2023

Geographic features are commonly used to improve the performance of pretrained language models (PLMs) on NLP tasks where they are intuitively beneficial (e.g., geolocation prediction, dialect feature prediction). Existing methods, however, leverage geographic information in task-specific fine-tuning and fail to integrate it into the geo-linguistic knowledge encoded by PLMs, which would make it transferable across different tasks. In this paper, we introduce an approach to task-agnostic geoadaptation of PLMs that forces them to learn associations between linguistic phenomena and geographic locations. Geoadaptation is an intermediate training step that couples language modeling and geolocation prediction in a multi-task learning setup. In our main set of experiments, we geoadapt BERTi\'{c}, a PLM for Bosnian-Croatian-Montenegrin-Serbian (BCMS), using a corpus of geotagged BCMS tweets. Evaluation on three tasks, namely fine-tuned as well as zero-shot geolocation prediction and zero-shot prediction of dialect features, shows that geoadaptation is very effective: e.g., we obtain state-of-the-art performance in supervised geolocation prediction and report massive gains over geographically uninformed PLMs on zero-shot geolocation prediction. Moreover, in follow-up experiments we successfully geoadapt two other PLMs, specifically ScandiBERT on Norwegian, Swedish, and Danish tweets and GermanBERT on Jodel posts in German from Austria, Germany, and Switzerland, proving that the benefits of geoadaptation are not limited to a particular language area and PLM.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2203.08565

Country:

Europe > Switzerland (0.24)
Europe > Austria (0.24)
North America > United States > Wisconsin > Dane County > Madison (0.14)
(13 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Efficient HEX-Program Evaluation Based on Unfounded Sets

Eiter, T., Fink, M., Krennwallner, T., Redl, C., Schüller, P.

Journal of Artificial Intelligence ResearchFeb-26-2014

HEX-programs extend logic programs under the answer set semantics with external computations through external atoms. As reasoning from ground Horn programs with nonmonotonic external atoms of polynomial complexity is already on the second level of the polynomial hierarchy, minimality checking of answer set candidates needs special attention. To this end, we present an approach based on unfounded sets as a generalization of related techniques for ASP programs. The unfounded set detection is expressed as a propositional SAT problem, for which we provide two different encodings and optimizations to them. We then integrate our approach into a previously developed evaluation framework for HEX-programs, which is enriched by additional learning techniques that aim at avoiding the reconstruction of the same or related unfounded sets. Furthermore, we provide a syntactic criterion that allows one to skip the minimality check in many cases. An experimental evaluation shows that the new approach significantly decreases runtime.

assignment, atom, external atom, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4175

AI Access Foundation

10865

Journal of Artificial Intelligence Research

Country:

Europe > Austria > Vienna (0.14)
Europe > Hungary > Budapest > Budapest (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
(5 more...)

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback