AITopics | powerful model

Collaborating Authors

powerful model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Download: a startup has a solution for AI's groupthink problem

MIT Technology ReviewJul-2-2026, 12:10:00 GMT

The Download: a startup has a solution for AI's groupthink problem Plus: Scientists say they have built a cell from scratch for the first time. LLMs are stuck in a groupthink groove. This startup is trying to get them out. Open up your chatbot of choice--Claude, ChatGPT, Gemini--and type "Give me a random number between 1 and 10." You're going to get 7. Almost always. That won't work every time--but if it did for you, you may wonder if I have superpowers. The truth is that most large language models are stuck in a rut.

large language model, machine learning, natural language, (18 more...)

MIT Technology Review

Country:

North America > United States (0.51)
Asia (0.31)

Genre: Research Report (0.56)

Industry: Government > Regional Government (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning

Neural Information Processing SystemsJun-18-2026, 18:47:34 GMT

Large language models (LLMs) have achieved impressive results on complex reasoning tasks, but their high inference cost remains a major barrier to real-world deployment. A promising solution is to use cascaded inference, where small, cheap models handle easy queries, and only the hardest examples are escalated to more powerful models. However, existing cascade methods typically rely on supervised training with labeled data, offer no theoretical generalization guarantees, and provide limited control over test-time computational cost. We introduce C3PO (Cost Controlled Cascaded Prediction Optimization), a self-supervised framework for optimizing LLM cascades under probabilistic cost constraints. By focusing on minimizing regret with respect to the most powerful model (MPM), C3PO avoids the need for labeled data by constructing a cascade using only unlabeled model outputs. It leverages conformal prediction to bound the probability that inference cost exceeds a user-specified budget. We provide theoretical guarantees on both cost control and generalization error, and show that our optimization procedure is effective even with small calibration sets. Empirically, C3PO achieves stateof-the-art performance across a diverse set of reasoning benchmarks including GSM8K, MATH-500, BigBench-Hard and AIME, outperforming strong LLM cascading baselines in both accuracy and cost-efficiency. Our results demonstrate that principled, label-free cascade optimization can enable scalable LLM deployment.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Asia (0.92)
Europe (0.67)
North America > Canada (0.67)
North America > United States (0.46)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning

Neural Information Processing SystemsJun-12-2026, 22:00:27 GMT

artificial intelligence, large language model, natural language, (8 more...)

Neural Information Processing Systems

Genre: Research Report (0.56)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)

Add feedback

03255088ed63354a54e0e5ed957e9008-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 07:55:33 GMT

mage, multi-step rollout, td-error, (13 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.41)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

03255088ed63354a54e0e5ed957e9008-AuthorFeedback.pdf

Neural Information Processing SystemsOct-1-2025, 22:11:44 GMT

artificial intelligence, mage, td-error, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

The Download: meet Cathy Tie, and Anthropic's new AI models

MIT Technology ReviewMay-23-2025, 12:10:00 GMT

Since the Chinese biophysicist He Jiankui was released from prison in 2022, he has sought to make a scientific comeback and to repair his reputation after a three-year incarceration for illegally creating the world's first gene-edited children. One area of visible success on his come-back trail has been his X.com account. Over the past few years, his account has evolved from sharing mundane images of his daily life to spreading outrageous, antagonistic messages. This has left observers unsure what to take seriously. Last month, in reply to MIT Technology Review's questions about who was responsible for the account's transformation into a font of clever memes, He emailed us back: "It's thanks to Cathy Tie." Tie is no stranger to the public spotlight.

anthropic, meet cathy tie, new ai model, (5 more...)

MIT Technology Review

Technology: Information Technology > Artificial Intelligence (0.80)

Add feedback

eufy launches the world's first robot vacuum with a portable deep cleaner (plus other powerful model)

PCWorldApr-8-2025, 16:59:25 GMT

Are you ready to completely overhaul your floor cleaning routines? The eufy E28 and E25 have just landed, and we bet you're going to want one of these robotic cleaners delivered to your home sooner rather than later. Under pre-sale now, be sure to order and save your spot in line with these powerful units. The E25 and E28 are two brand-new models unveiled by the company. Both feature eufy's award-winning HydroJet mopping technology and deliver a jaw-dropping 20,000Pa suction power for a deep clean.

eufy launch, powerful model, robot vacuum, (4 more...)

PCWorld

Technology: Information Technology > Artificial Intelligence > Robots (0.87)

Add feedback

How Do You Measure AI?

Communications of the ACMMar-20-2025, 15:34:32 GMT

Millions of people use artificial intelligence (AI) tools like ChatGPT daily to do everything from generating code to drawing images to creating business ideas. Those AI tools appear to be getting better. Back in November 2022 when it was launched, ChatGPT was powered by GPT-3.5, at the time the most powerful model offered by OpenAI. Yet GPT-3.5 was quickly eclipsed by GPT-4 just a few months later. GPT-4 crushed GPT-3.5 on a range of benchmarks, including its performance on the bar exam (GPT-4 scored in the 90th percentile; GPT-3.5 in the 10th).

large language model, machine learning, measure ai, (10 more...)

Communications of the ACM

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Under Trump, AI Scientists Are Told to Remove 'Ideological Bias' From Powerful Models

WIREDMar-14-2025, 23:29:46 GMT

The National Institute of Standards and Technology (NIST) has issued new instructions to scientists that partner with the US Artificial Intelligence Safety Institute (AISI) that eliminate mention of "AI safety," "responsible AI," and "AI fairness" in the skills it expects of members and introduces a request to prioritize "reducing ideological bias, to enable human flourishing and economic competitiveness." The information comes as part of an updated cooperative research and development agreement for AI Safety Institute consortium members, sent in early March. Previously, that agreement encouraged researchers to contribute technical work that could help identify and fix discriminatory model behavior related to gender, race, age, or wealth inequality. Such biases are hugely important because they can directly affect end users and disproportionately harm minorities and economically disadvantaged groups. The new agreement removes mention of developing tools "for authenticating content and tracking its provenance" as well as "labeling synthetic content," signaling less interest in tracking misinformation and deep fakes.

artificial intelligence, institute, natural language, (14 more...)

WIRED

Country: North America > United States (0.74)

Industry: Government > Regional Government > North America Government > United States Government (0.74)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.42)

Add feedback

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Anagnostidis, Sotiris, Bachmann, Gregor, Kim, Yeongmin, Kohler, Jonas, Georgopoulos, Markos, Sanakoyeu, Artsiom, Du, Yuming, Pumarola, Albert, Thabet, Ali, Schönfeld, Edgar

arXiv.org Artificial IntelligenceFeb-27-2025

Despite their remarkable performance, modern Diffusion Transformers are hindered by substantial resource requirements during inference, stemming from the fixed and large amount of compute needed for each denoising step. In this work, we revisit the conventional static paradigm that allocates a fixed compute budget per denoising iteration and propose a dynamic strategy instead. Our simple and sample-efficient framework enables pre-trained DiT models to be converted into \emph{flexible} ones -- dubbed FlexiDiT -- allowing them to process inputs at varying compute budgets. We demonstrate how a single \emph{flexible} model can generate images without any drop in quality, while reducing the required FLOPs by more than $40$\% compared to their static counterparts, for both class-conditioned and text-conditioned image generation. Our method is general and agnostic to input and conditioning modalities. We show how our approach can be readily extended for video generation, where FlexiDiT models generate samples with up to $75$\% less compute without compromising performance.

arxiv preprint arxiv, patch size, weak model, (14 more...)

arXiv.org Artificial Intelligence

2502.20126

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback