AITopics | firework

Collaborating Authors

firework

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

29e4b51d45dc8f534260adc45b587363-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 05:30:19 GMT

agent, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Add feedback

29e4b51d45dc8f534260adc45b587363-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 01:24:40 GMT

active agent, agent, information token, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Add feedback

STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models

Chiang, Cheng-Han, Wang, Xiaofei, Li, Linjie, Lin, Chung-Ching, Lin, Kevin, Liu, Shujie, Wang, Zhendong, Yang, Zhengyuan, Lee, Hung-yi, Wang, Lijuan

arXiv.org Artificial IntelligenceJul-22-2025

Spoken Language Models (SLMs) are designed to take speech inputs and produce spoken responses. However, current SLMs lack the ability to perform an internal, unspoken thinking process before responding. In contrast, humans typically engage in complex mental reasoning internally, enabling them to communicate ideas clearly and concisely. Thus, integrating an unspoken thought process into SLMs is highly desirable. While naively generating a complete chain-of-thought (CoT) reasoning before starting to talk can enable thinking for SLMs, this induces additional latency for the speech response, as the CoT reasoning can be arbitrarily long. To solve this issue, we propose Stitch, a novel generation method that alternates between the generation of unspoken reasoning chunks and spoken response chunks. Since the audio duration of a chunk of spoken response is much longer than the time to generate the tokens in a chunk of spoken response, we use the remaining free time to generate the unspoken reasoning tokens. When a chunk of audio is played to the user, the model continues to generate the next unspoken reasoning chunk, achieving simultaneous thinking and talking. Remarkably, Stitch matches the latency of baselines that cannot generate unspoken CoT by design while outperforming those baselines by 15% on math reasoning datasets; Stitch also performs equally well on non-reasoning datasets as those baseline models. Some animations and demonstrations are on the project page: https://d223302.github.io/STITCH.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.15375

Country:

Asia (0.92)
North America > United States (0.67)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

WhisperKit: On-device Real-time ASR with Billion-Scale Transformers

Orhon, Atila, Okan, Arda, Durmus, Berkin, Nagengast, Zach, Pacheco, Eduardo

arXiv.org Artificial IntelligenceJul-16-2025

Real-time Automatic Speech Recognition (ASR) is a fundamental building block for many commercial applications of ML, including live captioning, dictation, meeting transcriptions, and medical scribes. Accuracy and latency are the most important factors when companies select a system to deploy. We present WhisperKit, an optimized on-device inference system for real-time ASR that significantly outperforms leading cloud-based systems. We benchmark against server-side systems that deploy a diverse set of models, including a frontier model (OpenAI gpt-4o-transcribe), a proprietary model (Deepgram nova-3), and an open-source model (Fireworks large-v3-turbo).Our results show that WhisperKit matches the lowest latency at 0.46s while achieving the highest accuracy 2.2% WER. The optimizations behind the WhisperKit system are described in detail in this paper.

large language model, latency, machine learning, (23 more...)

arXiv.org Artificial Intelligence

2507.1086

Country: North America > United States > California (0.46)

Genre: Research Report > New Finding (0.86)

Industry: Information Technology (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.69)

Add feedback

No more fireworks? Big change coming to 4th of July at Pasadena's Rose Bowl

Los Angeles TimesJun-19-2025, 10:00:00 GMT

Marking the end of a longtime tradition, the Fourth of July celebration at the Rose Bowl in Pasadena will not feature a fireworks show this year. Instead, there will be a drone show. The move comes as some venues have switched from fireworks to drone shows -- in which a fleet of drones performs a choreographed light show -- to celebrate the 4th of July. But drone shows have fallen flat for some. Notably Redondo Beach and Laguna Beach switched back to fireworks after trying out drone shows, and some promoters of fireworks shows have voiced criticism over efforts to transition to drone shows.

artificial intelligence, drone show, firework, (15 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Redondo Beach (0.26)
North America > United States > California > San Diego County > San Diego (0.07)
North America > United States > California > San Francisco County > San Francisco (0.05)

Industry: Leisure & Entertainment > Sports > Football (0.68)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.36)

Add feedback

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Kim, Jaemin, Chang, Hangeol, Hwang, Hyunmin, Kim, Choonghan, Ye, Jong Chul

arXiv.org Artificial IntelligenceMay-28-2025

Large Language Models (LLMs) have demonstrated remarkable general capabilities, but enhancing skills such as reasoning often demands substantial computational resources and may compromise their generalization. While Parameter-Efficient Fine-Tuning (PEFT) methods offer a more resource-conscious alternative, they typically requires retraining for each LLM backbone due to architectural dependencies. To address these challenges, here we propose Universal Reasoner (UniR) - a single, lightweight, composable, and plug-and-play reasoning module that can be used with any frozen LLM to endow it with specialized reasoning capabilities. Specifically, UniR decomposes the reward into a standalone reasoning module that is trained independently using predefined rewards, effectively translating trajectory-level signals into token-level guidance. Once trained, UniR can be combined with any frozen LLM at inference time by simply adding its output logits to those of the LLM backbone. This additive structure naturally enables modular composition: multiple UniR modules trained for different tasks can be jointly applied by summing their logits, enabling complex reasoning via composition. Experimental results on mathematical reasoning and machine translation tasks show that UniR significantly outperforms existing baseline fine-tuning methods using the Llama3.2 model. Furthermore, UniR demonstrates strong weak-to-strong generalization: reasoning modules trained on smaller models effectively guide much larger LLMs. This makes UniR a cost-efficient, adaptable, and robust solution for enhancing reasoning in LLMs without compromising their core capabilities. Code is open-sourced at https://github.com/hangeol/UniR

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2505.19075

Country: North America > United States (0.28)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Model Equality Testing: Which Model Is This API Serving?

Gao, Irena, Liang, Percy, Guestrin, Carlos

arXiv.org Artificial IntelligenceOct-26-2024

Users often interact with large language models through black-box inference APIs, both for closed- and open-weight models (e.g., Llama models are popularly accessed via Amazon Bedrock and Azure AI Studio). In order to cut costs or add functionality, API providers may quantize, watermark, or finetune the underlying model, changing the output distribution -- often without notifying users. We formalize detecting such distortions as Model Equality Testing, a two-sample testing problem, where the user collects samples from the API and a reference distribution and conducts a statistical test to see if the two distributions are the same. We find that tests based on the Maximum Mean Discrepancy between distributions are powerful for this task: a test built on a simple string kernel achieves a median of 77.4% power against a range of distortions, using an average of just 10 samples per prompt. We then apply this test to commercial inference APIs for four Llama models, finding that 11 out of 31 endpoints serve different distributions than reference weights released by Meta.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2410.20247

Country:

Africa > Middle East > Libya (0.14)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Greater London > London > City of London (0.04)
(17 more...)

Genre: Research Report (1.00)

Industry:

Transportation (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Drones carrying fireworks: why the world's most famous gunpowder artist is collaborating with AI

The GuardianSep-22-2024, 12:00:08 GMT

For decades, Cai Guo-Qiang has been the world's foremost fine artist of explosions. He is famous for his massive fireworks displays, from his glowing footsteps in the sky at the opening of the 2008 Beijing Olympics, to his 2015 Sky Ladder, a 1,650-foot flaming ladder to heaven featured in a Netflix documentary. Recently, the gunpowder artist has become obsessed with a new threatening technology: artificial intelligence. AI "brings me more anxiety, but also, freshness", the 66-year-old Chinese artist told me last week at the historic Nassau Veterans Memorial Coliseum in Los Angeles, where he was preparing for his newest "explosion event", which would be the kickoff of a major arts festival opening in southern California this month. "It's similar to why I use gunpowder," Cai told me.

artist, cai, firework, (16 more...)

The Guardian

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia > China > Beijing > Beijing (0.25)

Industry: Leisure & Entertainment > Sports > Olympic Games (0.88)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments

Chen, Junzhe, Hu, Xuming, Liu, Shuodi, Huang, Shiyu, Tu, Wei-Wei, He, Zhaofeng, Wen, Lijie

arXiv.org Artificial IntelligenceFeb-26-2024

Recent advancements in large language models (LLMs) have revealed their potential for achieving autonomous agents possessing human-level intelligence. However, existing benchmarks for evaluating LLM Agents either use static datasets, potentially leading to data leakage or focus only on single-agent scenarios, overlooking the complexities of multi-agent interactions. There is a lack of a benchmark that evaluates the diverse capabilities of LLM agents in multi-agent, dynamic environments. To this end, we introduce LLMArena, a novel and easily extensible framework for evaluating the diverse capabilities of LLM in multi-agent dynamic environments. LLMArena encompasses seven distinct gaming environments, employing Trueskill scoring to assess crucial abilities in LLM agents, including spatial reasoning, strategic planning, numerical reasoning, risk assessment, communication, opponent modeling, and team collaboration. We conduct an extensive experiment and human evaluation among different sizes and types of LLMs, showing that LLMs still have a significant journey ahead in their development towards becoming fully autonomous agents, especially in opponent modeling and team collaboration. We hope LLMArena could guide future research towards enhancing these capabilities in LLMs, ultimately leading to more sophisticated and practical applications in dynamic, multi-agent settings. The code and data will be available.

agent, llm, observation prompt, (15 more...)

arXiv.org Artificial Intelligence

2402.16499

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Texas (0.05)
Europe > Middle East (0.04)
(9 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games > Computer Games (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

Rollout of Waymo's self-driving taxis in LA is paused amid 'very real public safety concerns' after two crashes within minutes of each other - and one fire

Daily Mail - Science & techFeb-22-2024, 22:53:24 GMT

The California Public Utilities Commission (CPUC) has put a pause on self-driving car company Waymo's plans to expand its autonomous taxi service in the state. The announcement comes a week after Waymo admitted that not one but two of its self-driving taxis crashed into the very same truck in Arizona back in December. Waymo, which is owned by Google's parent company Alphabet, has had fully autonomous taxis operating in San Francisco since 2022, alongside rival Cruise. The company had requested permission to deploy its fleet of driverless taxis beyond San Francisco in the Bay Area, as well as in Los Angeles. But as of Wednesday, the CPUC has suspended that plan for at least 120 days.

san francisco, taxi, waymo, (11 more...)

Daily Mail - Science & tech

Country:

North America > United States > California > San Francisco County > San Francisco (0.57)
North America > United States > California > Los Angeles County > Los Angeles (0.27)
North America > United States > Arizona (0.27)
North America > United States > California > San Mateo County (0.17)

Industry:

Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)
Transportation > Passenger (0.72)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.70)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback