AITopics | preview

Collaborating Authors

preview

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OpenAI staggers AI model release after Trump administration request

The GuardianJun-26-2026, 14:06:11 GMT

OpenAI had been working with the US government over a preview of the GPT 5.6 model. OpenAI had been working with the US government over a preview of the GPT 5.6 model. Sam Altman announces limited preview of GPT 5.6 in move that echoes launch of Anthropic's Mythos OpenAI is staggering the release of its latest AI model after a request from the US government, in a move echoing the launch of Anthropic's Mythos product. Sam Altman, the chief executive of the company behind ChatGPT, told staff this week that GPT 5.6 would be released in a limited preview to a small group of partners, according to the tech publication The Information. Altman said the federal government had asked for a staggered release.

government, large language model, machine learning, (18 more...)

The Guardian

Country: North America > United States (1.00)

Genre: Personal (0.59)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

CES 2026 preview: What we're expecting from tech's biggest conference in January

EngadgetDec-16-2025, 18:36:35 GMT

Expect big announcements from the likes of Samsung, Sony, NVIDIA and many more. CES doesn't start until January, but whispers of the products and announcements that could be in store for tech's biggest annual conference have already started to take shape. The CES 2026 show floor is officially open from January 6 through 9, although the show kicks off with events on Sunday January 4 and a host of press conferences on Monday. As always, product demos, announcements and networking will be happening at the Las Vegas Convention Center and surrounding hotels all over the city. As usual, Engadget will be covering the event in-person and remotely, bringing you news and hands-ons straight from the show floor.

advertisement advertisement, press conference, samsung, (14 more...)

Engadget

Country: North America > United States > Nevada > Clark County > Las Vegas (0.25)

Genre: Press Release (0.38)

Industry:

Semiconductors & Electronics (1.00)
Automobiles & Trucks > Manufacturer (0.96)
Transportation > Ground > Road (0.48)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Mobile (0.30)

Add feedback

CES 2026 preview: What to expect from tech's biggest conference in January

EngadgetDec-11-2025, 16:06:49 GMT

CES 2026 preview: What to expect from tech's biggest conference in January Expect big announcements from the likes of Samsung, Sony, NVIDIA and many more. CES doesn't start until January, but whispers of the products and announcements that could be in store for tech's biggest annual conference have already started to take shape. The CES 2026 show floor is officially open from January 6 through 9, although the show kicks off with events on Sunday January 4 and a host of press conferences on Monday. As always, product demos, announcements and networking will be happening at the Las Vegas Convention Center and surrounding hotels all over the city. As usual, Engadget will be covering the event in-person and remotely, bringing you news and hands-ons straight from the show floor.

artificial intelligence, natural language, samsung, (15 more...)

Engadget

Country: North America > United States > Nevada > Clark County > Las Vegas (0.25)

Industry:

Semiconductors & Electronics (1.00)
Information Technology > Hardware (0.37)

Technology:

Information Technology > Communications > Mobile (0.49)
Information Technology > Artificial Intelligence > Natural Language (0.49)

Add feedback

ML-Tool-Bench: Tool-Augmented Planning for ML Tasks

Chittepu, Yaswanth, Addanki, Raghavendra, Mai, Tung, Rao, Anup, Kveton, Branislav

arXiv.org Artificial IntelligenceDec-2-2025

The development of autonomous machine learning (ML) agents capable of end-to-end data science workflows represents a significant frontier in artificial intelligence. These agents must orchestrate complex sequences of data analysis, feature engineering, model selection, and hyperparameter optimization, tasks that require sophisticated planning and iteration. While recent work on building ML agents has explored using large language models (LLMs) for direct code generation, tool-augmented approaches offer greater modularity and reliability. However, existing tool-use benchmarks focus primarily on task-specific tool selection or argument extraction for tool invocation, failing to evaluate the sophisticated planning capabilities required for ML Agents. In this work, we introduce a comprehensive benchmark for evaluating tool-augmented ML agents using a curated set of 61 specialized tools and 15 tabular ML challenges from Kaggle. Our benchmark goes beyond traditional tool-use evaluation by incorporating an in-memory named object management, allowing agents to flexibly name, save, and retrieve intermediate results throughout the workflows. We demonstrate that standard ReAct-style approaches struggle to generate valid tool sequences for complex ML pipelines, and that tree search methods with LLM-based evaluation underperform due to inconsistent state scoring. To address these limitations, we propose two simple approaches: 1) using shaped deterministic rewards with structured textual feedback, and 2) decomposing the original problem into a sequence of sub-tasks, which significantly improves trajectory validity and task performance. Using GPT-4o, our approach improves over ReAct by 16.52 percentile positions, taking the median across all Kaggle challenges. We believe our work provides a foundation for developing more capable tool-augmented planning ML agents.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2512.00672

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report > New Finding (0.93)

Industry:

Transportation (0.47)
Leisure & Entertainment (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

A Compliance-Preserving Retrieval System for Aircraft MRO Task Search

Jo, Byungho

arXiv.org Artificial IntelligenceNov-20-2025

Aircraft Maintenance Technicians (AMTs) spend up to 30% of work time searching manuals, a documented efficiency bottleneck in MRO operations where every procedure must be traceable to certified sources. We present a compliance-preserving retrieval system that adapts LLM reranking and semantic search to aviation MRO environments by operating alongside, rather than replacing, certified legacy viewers. The system constructs revision-robust embeddings from ATA chapter hierarchies and uses vision-language parsing to structure certified content, allowing technicians to preview ranked tasks and access verified procedures in existing viewers. Evaluation on 49k synthetic queries achieves >90% retrieval accuracy, while bilingual controlled studies with 10 licensed AMTs demonstrate 90.9% top-10 success rate and 95% reduction in lookup time, from 6-15 minutes to 18 seconds per task. These gains provide concrete evidence that semantic retrieval can operate within strict regulatory constraints and meaningfully reduce operational workload in real-world multilingual MRO workflows.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2511.15383

Genre: Research Report > Experimental Study (0.49)

Industry:

Transportation > Air (1.00)
Law (1.00)
Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)

Add feedback

Preview, Accept or Discard? A Predictive Low-Motion Interaction Paradigm

Berengueres, Jose

arXiv.org Artificial IntelligenceNov-14-2025

Repetitive strain injury (RSI) affects roughly one in five computer users and remains largely unresolved despite decades of ergonomic mouse redesign. All such devices share a fundamental limitation: they still require fine-motor motion to operate. This work investigates whether predictive, AI-assisted input can reduce that motion by replacing physical pointing with ranked on-screen suggestions. To preserve user agency, we introduce Preview Accept Discard (PAD), a zero-click interaction paradigm that lets users preview predicted GUI targets, cycle through a small set of ranked alternatives, and accept or discard them via key-release timing. We evaluate PAD in two settings: a browser-based email client and a ISO 9241-9 keyboard-prediction task under varying top-3 accuracies. Across both studies, PAD substantially reduces hand motion relative to trackpad use while maintaining comparable task times with the trackpad only when accuracies are similar to those of the best spell-checkers.

artificial intelligence, keyboard, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2511.10532

Country:

Europe > United Kingdom > Scotland (0.28)
North America > United States > California (0.28)
Asia > Japan > Honshū > Kantō (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Consumer Health (0.94)
Leisure & Entertainment (0.66)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

iOS 26 adds a new app to your iPhone. Here's how to use it.

Popular ScienceOct-2-2025, 14:00:00 GMT

DIY Tech Hacks iOS 26 adds a new app to your iPhone. Here's how to use it. You're not imagining it--there is a new app on your iPhone. Breakthroughs, discoveries, and DIY tips sent every weekday. Apple's big iOS 26 software update for 2025 has now reached millions of iPhones, and brought with it a bunch of new features and an updated visual interface.

david nield, iphone, preview, (14 more...)

Popular Science

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (0.91)

Add feedback

Unlock this AI feature in Firefox and never fall for a scam link again

PCWorldSep-17-2025, 18:24:47 GMT

When you purchase through links in our articles, we may earn a small commission. AI-powered link previews are a great way to see ahead so you don't end up clicking on malicious links. Starting with version 138 (released back in April), Firefox has had a new-yet-still-deactivated option that uses "artificial intelligence" to display a mini preview of the destination page for a link. The feature determines the content of the page in question and displays a pop-up, and this preview can help to avoid potential scams and malware when navigating unsolicited links. The AI feature works locally on your PC and, according to Mozilla, doesn't use a cloud service.

gaming laptop mobile monitor pc, mobile monitor pc, security software storage streaming wi-fi, (8 more...)

PCWorld

Country: North America > United States > California (0.05)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence (0.92)

Add feedback

Oops! Google's unannounced new Nest Cams spotted in Google Home app

PCWorldSep-5-2025, 14:21:08 GMT

The big smart home manufacturers have been leaking like sieves as of late, giving us juicy early previews of their super-secret upcoming releases. Philips Hue recently fell victim to its own leak that revealed its entire fall product lineup, and now Google appears to have unwittingly shared images of its new Nest cam hardware. First, a quick recap: Google had already teased--intentionally--a new Gemini smart speaker during its Pixel event a couple of weeks back, and just days ago it promised an upcoming Google Home update on October 1, complete with a partial image of what appears to be a new Nest camera. Instead, it seems Google may have inadvertently left images of its new Nest hardware in the Google Home app following a recent update. The images, which were spotted by Android Authority and appear to have been subsequently yanked from the app, don't reveal anything startlingly new about the new Nest cams, aside from the fact that they exist.

artificial intelligence, chatbot, natural language, (10 more...)

PCWorld

Industry: Information Technology > Smart Houses & Appliances (0.38)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.91)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.91)
Information Technology > Artificial Intelligence > Robots > Robots in the Home (0.62)

Add feedback

AHELM: A Holistic Evaluation of Audio-Language Models

Lee, Tony, Tu, Haoqin, Wong, Chi Heem, Wang, Zijun, Yang, Siwei, Mai, Yifan, Zhou, Yuyin, Xie, Cihang, Liang, Percy

arXiv.org Artificial IntelligenceSep-4-2025

Evaluations of audio-language models (ALMs) -- multimodal models that take interleaved audio and text as input and output text -- are hindered by the lack of standardized benchmarks; most benchmarks measure only one or two capabilities and omit evaluative aspects such as fairness or safety. Furthermore, comparison across models is difficult as separate evaluations test a limited number of models and use different prompting methods and inference parameters. To address these shortfalls, we introduce AHELM, a benchmark that aggregates various datasets -- including 2 new synthetic audio-text datasets called PARADE, which evaluates the ALMs on avoiding stereotypes, and CoRe-Bench, which measures reasoning over conversational audio through inferential multi-turn question answering -- to holistically measure the performance of ALMs across 10 aspects we have identified as important to the development and usage of ALMs: audio perception, knowledge, reasoning, emotion detection, bias, fairness, multilinguality, robustness, toxicity, and safety. We also standardize the prompts, inference parameters, and evaluation metrics to ensure equitable comparisons across models. We test 14 open-weight and closed-API ALMs from 3 developers and 3 additional simple baseline systems each consisting of an automatic speech recognizer and a language model. Our results show that while Gemini 2.5 Pro ranks top in 5 out of 10 aspects, it exhibits group unfairness ($p=0.01$) on ASR tasks whereas most of the other models do not. We also find that the baseline systems perform reasonably well on AHELM, with one ranking 6th overall despite having only speech-to-text capabilities. For transparency, all raw prompts, model generations, and outputs are available on our website at https://crfm.stanford.edu/helm/audio/v1.0.0. AHELM is intended to be a living benchmark and new datasets and models will be added over time.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2508.21376

Country: North America > United States > California > Santa Clara County > Palo Alto (0.24)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Health & Medicine (0.88)
Education > Educational Setting (0.45)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback