AITopics | marianne

Collaborating Authors

marianne

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization

Kim, Hyuntak, Kim, Byung-Hak

arXiv.org Artificial IntelligenceJun-2-2025

Summarizing long-form narratives--such as books, movies, and TV scripts--requires capturing intricate plotlines, character interactions, and thematic coherence, a task that remains challenging for existing LLMs. We introduce NexusSum, a multi-agent LLM framework for narrative summarization that processes long-form text through a structured, sequential pipeline--without requiring fine-tuning. Our approach introduces two key innovations: (1) Dialogue-to-Description Transformation: A narrative-specific preprocessing method that standardizes character dialogue and descriptive text into a unified format, improving coherence. (2) Hierarchical Multi-LLM Summarization: A structured summarization pipeline that optimizes chunk processing and controls output length for accurate, high-quality summaries. Our method establishes a new state-of-the-art in narrative summarization, achieving up to a 30.0% improvement in BERTScore (F1) across books, movies, and TV scripts. These results demonstrate the effectiveness of multi-agent LLMs in handling long-form content, offering a scalable approach for structured summarization in diverse storytelling domains.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.24575

Country:

Asia (0.93)
Europe (0.92)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.87)

Industry:

Media (0.92)
Law Enforcement & Public Safety (0.67)
Government > Military (0.67)
Leisure & Entertainment > Social Events (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

End-to-End Long Document Summarization using Gradient Caching

Saxena, Rohit, Tang, Hao, Keller, Frank

arXiv.org Artificial IntelligenceJan-3-2025

Training transformer-based encoder-decoder models for long document summarization poses a significant challenge due to the quadratic memory consumption during training. Several approaches have been proposed to extend the input length at test time, but training with these approaches is still difficult, requiring truncation of input documents and causing a mismatch between training and test conditions. In this work, we propose CachED (Gradient $\textbf{Cach}$ing for $\textbf{E}$ncoder-$\textbf{D}$ecoder models), an approach that enables end-to-end training of existing transformer-based encoder-decoder models, using the entire document without truncation. Specifically, we apply non-overlapping sliding windows to input documents, followed by fusion in decoder. During backpropagation, the gradients are cached at the decoder and are passed through the encoder in chunks by re-computing the hidden vectors, similar to gradient checkpointing. In the experiments on long document summarization, we extend BART to CachED BART, processing more than 500K tokens during training and achieving superior performance without using any additional parameters.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2501.01805

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > Mexico > Mexico City > Mexico City (0.04)
North America > Canada > Ontario > Toronto (0.04)
(11 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Simulating discrimination in virtual reality

#artificialintelligenceJan-5-2023, 18:53:06 GMT

Have you ever been advised to "walk a mile in someone else's shoes?" Considering another person's perspective can be a challenging endeavor -- but recognizing our errors and biases is key to building understanding across communities. By challenging our preconceptions, we confront prejudice, such as racism and xenophobia, and potentially develop a more inclusive perspective about others. To assist with perspective-taking, MIT researchers have developed "On the Plane," a virtual reality role-playing game (VR RPG) that simulates discrimination. In this case, the game portrays xenophobia directed against a Malaysian America woman, but the approach can be generalized.

artificial intelligence, machine learning, passenger, (18 more...)

#artificialintelligence

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.63)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

BookSum: A Collection of Datasets for Long-form Narrative Summarization

Kryściński, Wojciech, Rajani, Nazneen, Agarwal, Divyansh, Xiong, Caiming, Radev, Dragomir

arXiv.org Artificial IntelligenceDec-6-2022

The majority of available text summarization datasets include short-form source documents that lack long-range causal and temporal dependencies, and often contain strong layout and stylistic biases. While relevant, such datasets will offer limited challenges for future generations of text summarization systems. We address these issues by introducing BookSum, a collection of datasets for long-form narrative summarization. Our dataset covers source documents from the literature domain, such as novels, plays and stories, and includes highly abstractive, human written summaries on three levels of granularity of increasing difficulty: paragraph-, chapter-, and book-level. The domain and structure of our dataset poses a unique set of challenges for summarization systems, which include: processing very long documents, non-trivial causal and temporal dependencies, and rich discourse structures. To facilitate future work, we trained and evaluated multiple extractive and abstractive summarization models as baselines for our dataset.

dashwood, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2105.08209

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > United Kingdom > England (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(13 more...)

Genre: Research Report > New Finding (0.45)

Industry:

Law (0.67)
Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.67)

Add feedback

The Medium Doesn't Live Up to Its Best Ideas

WIREDFeb-3-2021, 13:00:00 GMT

Playing The Medium, a new horror game on Xbox and PC from developer Bloober Team, is like watching The Chilling Adventures of Sabrina on Netflix. The Medium has some fun ideas that it executes well, but the overall experience is bland and forgettable. Like Sabrina and a thousand other shows on Netflix, The Medium is inoffensive. It's a pleasant way to pass the time, but you probably won't finish it and you won't remember it a month after you put it down. It's the perfect game for Xbox's Game Pass, the service that seeks to be Netflix but for video games.

best idea, marianne, spirit world, (9 more...)

WIRED

Country: Europe > Poland (0.09)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games (0.57)

Add feedback

Some Applications of Markov Chain in Python

#artificialintelligenceJan-19-2018, 03:37:10 GMT

In this article a few simple applications of Markov chain are going to be discussed as a solution to a few text processing problems. These problems appeared as assignments in a few courses, the descriptions are taken straightaway from the courses themselves. Use a Markov chain to create a statistical model of a piece of English text. Simulate the Markov chain to generate stylized pseudo-random text. In the 1948 landmark paper A Mathematical Theory of Communication, Claude Shannon founded the field of information theory and revolutionized the telecommunications industry, laying the groundwork for today's Information Age. In this paper, Shannon proposed using a Markov chain to create a statistical model of the sequences of letters in a piece of English text. Markov chains are now widely used in speech recognition, handwriting recognition, information retrieval, data compression, and spam filtering. They also have many scientific computing applications including the genemark algorithm for gene prediction, the Metropolis algorithm for measuring thermodynamical properties, and Google's PageRank algorithm for Web search.

machine learning, markov model, natural language, (18 more...)

#artificialintelligence

Country: North America > United States (0.14)

Industry: Telecommunications (0.88)

Technology: