AITopics | mtv

Collaborating Authors

mtv

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

27571b74d6cd650b8eb6cf1837953ae8-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 18:36:22 GMT

dataset, language model, mtv, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

Neural Information Processing SystemsDec-24-2025, 12:07:09 GMT

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

27571b74d6cd650b8eb6cf1837953ae8-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 21:23:50 GMT

dataset, language model, mtv, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Instability in Downstream Task Performance During LLM Pretraining

Nishida, Yuto, Isonuma, Masaru, Oda, Yusuke

arXiv.org Artificial IntelligenceOct-7-2025

When training large language models (LLMs), it is common practice to track downstream task performance throughout the training process and select the checkpoint with the highest validation score. However, downstream metrics often exhibit substantial fluctuations, making it difficult to identify the checkpoint that truly represents the best-performing model. In this study, we empirically analyze the stability of downstream task performance in an LLM trained on diverse web-scale corpora. We find that task scores frequently fluctuate throughout training, both at the aggregate and example levels. To address this instability, we investigate two post-hoc checkpoint integration methods: checkpoint averaging and ensemble, motivated by the hypothesis that aggregating neighboring checkpoints can reduce performance volatility. We demonstrate both empirically and theoretically that these methods improve downstream performance stability without requiring any changes to the training procedure.

checkpoint, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.04848

Country: North America > United States (0.46)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

Neural Information Processing SystemsMay-26-2025, 19:16:45 GMT

The recent success of interleaved Large Multimodal Models (LMMs) in few-shot learning suggests that in-context learning (ICL) with many examples can be promising for learning new tasks. However, this many-shot multimodal ICL setting has one crucial problem: it is fundamentally limited by the model's context length set at pretraining. The problem is especially prominent in the multimodal domain, which processes both text and images, requiring additional tokens. In this work, we enable LMMs to perform multimodal, many-shot in-context learning by leveraging Multimodal Task Vectors (MTV)---compact implicit representations of in-context examples compressed in the model's attention heads. Specifically, we first demonstrate the existence of such MTV in LMMs and then leverage these extracted MTV to enable many-shot in-context learning for various vision-and-language tasks. Our experiments suggest that MTV can scale in performance with the number of compressed shots and generalize to similar out-of-domain tasks without additional context length for inference.

artificial intelligence, enable many-shot multimodal in-context learning, machine learning, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning

Huang, Brandon, Mitra, Chancharik, Arbelle, Assaf, Karlinsky, Leonid, Darrell, Trevor, Herzig, Roei

arXiv.org Artificial IntelligenceJun-21-2024

The recent success of interleaved Large Multimodal Models (LMMs) in few-shot learning suggests that in-context learning (ICL) with many examples can be promising for learning new tasks. However, this many-shot multimodal ICL setting has one crucial problem: it is fundamentally limited by the model's context length set at pretraining. The problem is especially prominent in the multimodal domain, which processes both text and images, requiring additional tokens. This motivates the need for a multimodal method to compress many shots into fewer tokens without finetuning. In this work, we enable LMMs to perform multimodal, many-shot in-context learning by leveraging Multimodal Task Vectors (MTV)--compact implicit representations of in-context examples compressed in the model's attention heads. Specifically, we first demonstrate the existence of such MTV in LMMs and then leverage these extracted MTV to enable many-shot in-context learning for various vision-and-language tasks. Our experiments suggest that MTV can scale in performance with the number of compressed shots and generalize to similar out-of-domain tasks without additional context length for inference.

arxiv, dataset, language model, (12 more...)

arXiv.org Artificial Intelligence

2406.15334

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Optimal Initialization of Batch Bayesian Optimization

Ren, Jiuge, Sweet, David

arXiv.org Machine LearningApr-27-2024

Field experiments and computer simulations are effective but time-consuming methods of measuring the quality of engineered systems at different settings. To reduce the total time required, experimenters may employ Bayesian optimization, which is parsimonious with measurements, and take measurements of multiple settings simultaneously, in a batch. In practice, experimenters use very few batches, thus, it is imperative that each batch be as informative as possible. Typically, the initial batch in a Batch Bayesian Optimization (BBO) is constructed from a quasi-random sample of settings values. We propose a batch-design acquisition function, Minimal Terminal Variance (MTV), that designs a batch by optimization rather than random sampling. MTV adapts a design criterion function from Design of Experiments, called I-Optimality, which minimizes the variance of the post-evaluation estimates of quality, integrated over the entire space of settings. MTV weights the integral by the probability that a setting is optimal, making it able to design not only an initial batch but all subsequent batches, as well. Applicability to both initialization and subsequent batches is novel among acquisition functions. Numerical experiments on test functions and simulators show that MTV compares favorably to other BBO methods.

acquisition function, batch, optimization, (13 more...)

arXiv.org Machine Learning

2404.17997

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

'Bigger than MTV': how video games are helping the music industry thrive

The GuardianAug-22-2018, 19:43:39 GMT

"Video games have not only helped the music industry survive, but thrive on entirely new levels," Steve Schnur tells me. As the worldwide executive and president of music at game publisher EA, his team – many of whom have been professional musicians and singer/songwriters – work with some of the biggest music acts in the world, licensing music for video game series like Fifa, Madden NFL, Need for Speed and NHL. Since the 90s, when licensed music became prevalent in games, series such as Tony Hawk's Pro Skater, Grand Theft Auto and Wipeout have become just as well-known for their soundtracks as they are for their gameplay. For millions of people, video games have been a way to discover new favourite bands or dive into other musical genres. And because people discover this music while playing a game they love, they develop a strong emotional attachment to it.

artificial intelligence, music, video game, (14 more...)

The Guardian

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)

Industry:

Media > Music (1.00)
Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games (1.00)

Add feedback

Defying Gravity Broadcasting & Cable

#artificialintelligenceApr-25-2016, 01:55:20 GMT

Network television is at risk of getting caught in a vicious cycle. As the audience fragments in a million different directions, smaller subsets of that audience see promos for new shows. Then, as new shows draw smaller crowds, even fewer viewers see promos for other programs. The reach of television networks (the total number of viewers who watch for a minute or more once a day) is down a daunting 12 percent in one year. Yet a six percent larger audience has seen the promos for MTV's Viacom networks--even though they're using fewer spots.

data mining, machine learning, promo, (14 more...)

#artificialintelligence

Country: North America > United States > Iowa (0.05)

Industry:

Media > Television (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.76)
Information Technology > Data Science > Data Mining > Big Data (0.58)

Add feedback