AITopics | text type

Collaborating Authors

text type

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SafeConstellations: Steering LLM Safety to Reduce Over-Refusals Through Task-Specific Trajectory

Maskey, Utsav, Yadav, Sumit, Dras, Mark, Naseem, Usman

arXiv.org Artificial IntelligenceAug-18-2025

LLMs increasingly exhibit over-refusal behavior, where safety mechanisms cause models to reject benign instructions that superficially resemble harmful content. This phenomena diminishes utility in production applications that repeatedly rely on common prompt templates or applications that frequently rely on LLMs for specific tasks (e.g. sentiment analysis, language translation). Through comprehensive evaluation, we demonstrate that LLMs still tend to refuse responses to harmful instructions when those instructions are reframed to appear as benign tasks. Our mechanistic analysis reveal that LLMs follow distinct "constellation" patterns in embedding space as representations traverse layers, with each task maintaining consistent trajectories that shift predictably between refusal and non-refusal cases. We introduce SafeConstellations, an inference-time trajectory-shifting approach that tracks task-specific trajectory patterns and guides representations toward non-refusal pathways. By selectively guiding model behavior only on tasks prone to over-refusal, and by preserving general model behavior, our method reduces over-refusal rates by up to 73% with minimal impact on utility-offering a principled approach to mitigating over-refusals.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.1129

Country:

Asia (0.69)
North America > Mexico (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Measuring Copyright Risks of Large Language Model via Partial Information Probing

Zhao, Weijie, Shao, Huajie, Xu, Zhaozhuo, Duan, Suzhen, Zhang, Denghui

arXiv.org Artificial IntelligenceSep-20-2024

Abstracting with credit is permitted.

llm, partial information, rouge-l score, (12 more...)

arXiv.org Artificial Intelligence

2409.13831

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Idaho > Ada County > Boise (0.05)
North America > United States > New Jersey > Hudson County > Hoboken (0.05)
(2 more...)

Genre: Research Report > New Finding (0.94)

Industry:

Law > Intellectual Property & Technology Law (1.00)
Government > Regional Government > North America Government > United States Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction

Dai, Xiang, Karimi, Sarvnaz, Sarker, Abeed, Hachey, Ben, Paris, Cecile

arXiv.org Artificial IntelligenceMay-28-2024

Objective. Active adverse event surveillance monitors Adverse Drug Events (ADE) from different data sources, such as electronic health records, medical literature, social media and search engine logs. Over years, many datasets are created, and shared tasks are organised to facilitate active adverse event surveillance. However, most-if not all-datasets or shared tasks focus on extracting ADEs from a particular type of text. Domain generalisation-the ability of a machine learning model to perform well on new, unseen domains (text types)-is under-explored. Given the rapid advancements in natural language processing, one unanswered question is how far we are from having a single ADE extraction model that are effective on various types of text, such as scientific literature and social media posts}. Methods. We contribute to answering this question by building a multi-domain benchmark for adverse drug event extraction, which we named MultiADE. The new benchmark comprises several existing datasets sampled from different text types and our newly created dataset-CADECv2, which is an extension of CADEC (Karimi, et al., 2015), covering online posts regarding more diverse drugs than CADEC. Our new dataset is carefully annotated by human annotators following detailed annotation guidelines. Conclusion. Our benchmark results show that the generalisation of the trained models is far from perfect, making it infeasible to be deployed to process different types of text. In addition, although intermediate transfer learning is a promising approach to utilising existing resources, further investigation is needed on methods of domain adaptation, particularly cost-effective methods to select useful training instances.

adverse drug event, dataset, text type, (14 more...)

arXiv.org Artificial Intelligence

2405.18015

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > Middle East > Oman (0.04)

Genre: Research Report > New Finding (0.88)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Consumer Health (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization

Liu, Yang Janet, Zeldes, Amir

arXiv.org Artificial IntelligenceJun-19-2023

Automatic summarization with pre-trained language models has led to impressively fluent results, but is prone to 'hallucinations', low performance on non-news genres, and outputs which are not exactly summaries. Targeting ACL 2023's 'Reality Check' theme, we present GUMSum, a small but carefully crafted dataset of English summaries in 12 written and spoken genres for evaluation of abstractive summarization. Summaries are highly constrained, focusing on substitutive potential, factuality, and faithfulness. We present guidelines and evaluate human agreement as well as subjective judgments on recent system outputs, comparing general-domain untuned approaches, a fine-tuned one, and a prompt-based approach, to human performance. Results show that while GPT3 achieves impressive scores, it still underperforms humans, with varying quality across genres. Human judgments reveal different types of errors in supervised, prompted, and human-generated summaries, shedding light on the challenges of producing a good summary.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2306.11256

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(10 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Media > News (0.47)
Education (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Machine translation, no match for humans: machines translate words, humans the underlying message University of Helsinki

#artificialintelligenceDec-11-2019, 13:29:13 GMT

Many of us are familiar with Google Translate, translation applications for travellers' smartphones and the instruction manuals of various devices and products. Professional translators also make use of machines. Training a computer to translate between two specific languages takes millions of sentences or billions of words worth of text. Maarit Koponen, a postdoctoral researcher at the University of Helsinki, is investigating which errors made by machines lead to misunderstandings and how those mistakes could be identified. The learning algorithms behind machine translation are called artificial intelligence, but machines are not intelligent in the way humans or the super AIs of science-fiction films are.

koponen, machine translation, translation, (14 more...)

#artificialintelligence

Country: Europe > Finland > Uusimaa > Helsinki (0.61)

Genre: Research Report (0.35)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback