AITopics

2512.05325

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

arXiv.org Artificial IntelligenceOct-1-2025

Video models are zero-shot learners and reasoners

Wiedemer, Thaddäus, Li, Yuxuan, Vicol, Paul, Gu, Shixiang Shane, Matarese, Nick, Swersky, Kevin, Kim, Been, Jaini, Priyank, Geirhos, Robert

The remarkable zero-shot capabilities of Large Language Models (LLMs) have propelled natural language processing from task-specific models to unified, generalist foundation models. This transformation emerged from simple primitives: large, generative models trained on web-scale data. Curiously, the same primitives apply to today's generative video models. Could video models be on a trajectory towards general-purpose vision understanding, much like LLMs developed general-purpose language understanding? We demonstrate that Veo 3 can solve a broad variety of tasks it wasn't explicitly trained for: segmenting objects, detecting edges, editing images, understanding physical properties, recognizing object affordances, simulating tool use, and more. These abilities to perceive, model, and manipulate the visual world enable early forms of visual reasoning like maze and symmetry solving. Veo's emergent zero-shot capabilities indicate that video models are on a path to becoming unified, generalist vision foundation models.

artificial intelligence, large language model, natural language, (13 more...)

2509.20328

Genre: Research Report (0.40)

Industry: Leisure & Entertainment (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

arXiv.org Artificial IntelligenceJul-16-2025

FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE

Le, Khiem, Tran, Tuan, Hua, Ting, Chawla, Nitesh V.

Existing resource-adaptive LoRA federated fine-tuning methods enable clients to fine-tune models using compressed versions of global LoRA matrices, in order to accommodate various compute resources across clients. This compression requirement will lead to suboptimal performance due to information loss. To address this, we propose FLAME, a novel federated learning framework based on the Sparse Mixture-of-Experts (SMoE) architecture. Unlike prior approaches, FLAME retains full (uncompressed) global LoRA matrices and achieves client-side adaptability by varying the number of activated experts per client. However, incorporating SMoE into federated learning introduces unique challenges, specifically, the mismatch in output magnitude from partial expert activation and the imbalance in expert training quality across clients. FLAME tackles these challenges through a lightweight rescaling mechanism and an activation-aware aggregation scheme. Empirical results across diverse computational settings demonstrate that FLAME consistently outperforms existing methods, providing a robust and effective solution for resource-adaptive federated learning.

large language model, machine learning, natural language, (15 more...)

2506.166

Country: North America > United States (0.68)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Zhang, Yuantao, Yang, Zhankui

A Perplexity and Menger Curvature-Based Approach for Similarity Evaluation of Large Language Models

arXiv.org Artificial IntelligenceApr-9-2025

The rise of Large Language Models (LLMs) has brought about concerns regarding copyright infringement and unethical practices in data and model usage. For instance, slight modifications to existing LLMs may be used to falsely claim the development of new models, leading to issues of model copying and violations of ownership rights. This paper addresses these challenges by introducing a novel metric for quantifying LLM similarity, which leverages perplexity curves and differences in Menger curvature. Comprehensive experiments validate the performance of our methodology, demonstrating its superiority over baseline methods and its ability to generalize across diverse models and domains. Furthermore, we highlight the capability of our approach in detecting model replication through simulations, emphasizing its potential to preserve the originality and integrity of LLMs. Code is available at https://github.com/zyttt-coder/LLM_similarity.

large language model, machine learning, natural language, (21 more...)

2504.04216

Genre: Research Report (0.50)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.76)

arXiv.org Artificial IntelligenceSep-7-2024

Action is the primary key: a categorical framework for episode description and logical reasoning

Fukada, Yoshiki

This research presents a computational framework for describing and recognizing episodes and for logical reasoning. This framework, named cognitive-logs, consists of a set of relational and graph databases. Cognitive-logs record knowledge, particularly in episodes that consist of "actions" represented by verbs in natural languages and "participants" who perform the actions. These objects are connected by arrows (morphisms) that link each action to its participant and link cause to effect. Operations based on category theory enable comparisons between episodes and deductive inferences, including abstractions of stories. One of the goals of this study is to develop a database-driven artificial intelligence. This artificial intelligence thinks like a human but possesses the accuracy and rigour of a machine. The vast capacities of databases (up to petabyte scales in current technologies) enable the artificial intelligence to store a greater volume of knowledge than neural-network based artificial intelligences. Cognitive-logs serve as a model of human cognition and designed with references to cognitive linguistics. Cognitive-logs also have the potential to model various human mind activities.

category, classification, functor, (15 more...)

2409.04793

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > New York > Tompkins County > Ithaca (0.04)
(5 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

arXiv.org Artificial IntelligenceOct-16-2023

Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT -- A Text-to-SQL Parsing Comparison

Sun, Shuo, Zhang, Yuchen, Yan, Jiahuan, Gao, Yuze, Ong, Donovan, Chen, Bin, Su, Jian

The success of ChatGPT has ignited an AI race, with researchers striving to develop new large language models (LLMs) that can match or surpass the language understanding and generation abilities of commercial ones. In recent times, a number of models have emerged, claiming performance near that of GPT-3.5 or GPT-4 through various instruction-tuning methods. As practitioners of Text-to-SQL parsing, we are grateful for their valuable contributions to open-source research. However, it is important to approach these claims with a sense of scrutiny and ascertain the actual effectiveness of these models. Therefore, we pit six popular large language models against each other, systematically evaluating their Text-to-SQL parsing capability on nine benchmark datasets with five different prompting strategies, covering both zero-shot and few-shot scenarios. Regrettably, the open-sourced models fell significantly short of the performance achieved by closed-source models like GPT-3.5, highlighting the need for further work to bridge the performance gap between these models.

classical dataset, dataset, language model, (15 more...)

2310.1019

Country:

Asia > Singapore (0.14)
Europe > United Kingdom (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(5 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceApr-17-2023, 12:36:16 GMT

Hello Dolly: Democratizing the magic of ChatGPT with open models

Update Apr 12, 2023: We have released Dolly 2.0, licensed for both research and commercial use. See the new blog post here. We show that anyone can take a dated off-the-shelf open source large language model (LLM) and give it magical ChatGPT-like instruction following ability by training it in 30 minutes on one machine, using high-quality training data. Surprisingly, instruction-following does not seem to require the latest or largest models: our model is only 6 billion parameters, compared to 175 billion for GPT-3. We open source the code for our model (Dolly) and show how it can be re-created on Databricks.

chatgpt, dolly, language model, (14 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceApr-4-2023, 08:20:43 GMT

Databricks open-sources its Dolly large language AI model

In an attempt to open up its technology to a wider audience, enterprise software company Databricks has released Dolly, a large language model and its associated training code under an open-source licence. Despite being based on a much smaller underlying model, the company says it has ChatGPT-like functionality and can be run "in-house". The move was inspired by the success of OpenAI's natural language platform ChatGPT, which became one of the fastest-growing consumer apps within a couple of months of its release in November last year. It has since caused some of the world's largest companies including Microsoft and Google to pivot and release generative and natural language AI tools. "We show that anyone can take a dated off-the-shelf open source LLM and give it magical ChatGPT-like instruction-following ability by training it in 30 minutes on one machine, using high-quality training data," Databricks wrote in a blog post explaining the decision.

databrick, dolly, language model, (13 more...)

Genre: Play > Prospect > Charge (1.00)

Industry: Information Technology (0.57)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMar-28-2023, 15:21:13 GMT

This AI newsletter is all you need #40

With the surging demand for generative AI, this week saw preparatory developments for the next wave of AI. Companies are fast-tracking the development of AI products, and generative AI tools are closer to becoming consumer products than ever before. They are already becoming powerful assistants for writers and programmers and rapidly taking on more challenges. The open-source community is also making significant progress in running local LLMs. For instance, Facebook's LLama model has continued to be a focal point for building in the academic and open source community following the leaked weights on 4Chan.

chatgpt, language model, openai, (14 more...)

Country: Asia > India > Karnataka > Bengaluru (0.05)

Genre:

Overview (0.49)
Research Report (0.30)

Industry: Information Technology > Services (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.75)

#artificialintelligenceOct-17-2022, 07:50:21 GMT

'Killer robots' will be nothing like the movies show – here's where the real threats lie

You might suppose Hollywood is good at predicting the future. Indeed, Robert Wallace, head of the CIA's Office of Technical Service and the US equivalent of MI6's fictional Q, has recounted how Russian spies would watch the latest Bond movie to see what technologies might be coming their way. Hollywood's continuing obsession with killer robots might therefore be of significant concern. The newest such movie is Apple TV's forthcoming sex robot courtroom drama Dolly. I never thought I'd write the phrase "sex robot courtroom drama", but there you go.

killer robot, robot, robotic company, (12 more...)

Country:

Europe > Ukraine (0.06)
Oceania > New Zealand (0.05)
Asia > Middle East > Republic of Türkiye (0.05)
Asia > Middle East > Iran (0.05)

Industry:

Leisure & Entertainment (1.00)
Government > Military (1.00)
Media > Film (0.85)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.97)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.31)