AITopics | second half

Collaborating Authors

second half

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

eae3af0f5868f0a2eceb74208966d55b-Paper-Conference.pdf

Neural Information Processing SystemsJun-23-2026, 01:21:53 GMT

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

Asia (1.00)
Europe (0.67)
North America > United States > California > Los Angeles County (0.27)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)

Add feedback

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

Neural Information Processing SystemsJun-15-2026, 08:42:46 GMT

Large language models (LLMs) raise concerns about content authenticity and integrity because they can generate human-like text at scale. Text watermarks, which embed detectable statistical signals into generated text, offer a provable way to verify content origin. Many detection methods rely on pivotal statistics that are i.i.d.

gof test, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Leisure & Entertainment > Sports > Basketball (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

BayesTune: Bayesian Sparse Deep Model Fine-tuning

Neural Information Processing SystemsApr-29-2026, 19:48:41 GMT

Deep learning practice is increasingly driven by powerful foundation models (FM), pre-trained at scale and then fine-tuned for specific tasks of interest. A key property of this workflow is the efficacy of performing sparse or parameter-efficient finetuning, meaning that by updating only a tiny fraction of the whole FM parameters on a downstream task can lead to surprisingly good performance, often even superior to a full model update. However, it is not clear what is the optimal and principled way to select which parameters to update. Although a growing number of sparse fine-tuning ideas have been proposed, they are mostly not satisfactory, relying on hand-crafted heuristics or heavy approximation. In this paper we propose a novel Bayesian sparse fine-tuning algorithm: we place a (sparse) Laplace prior for each parameter of the FM, with the mean equal to the initial value and the scale parameter having a hyper-prior that encourages small scale.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Genre:

Workflow (0.48)
Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

BayesTune: Bayesian Sparse Deep Model Fine-tuning

Neural Information Processing SystemsFeb-17-2026, 04:42:08 GMT

Deep learning practice is increasingly driven by powerful foundation models (FM), pre-trained at scale and then fine-tuned for specific tasks of interest.

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

50c1f44e426560f3f2cdcb3e19e39903-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 04:01:26 GMT

dataset, naomi, sequence, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.75)

Add feedback

69469da823348084ca8933368ecbf676-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 13:56:17 GMT

In this section, we examine three algorithms via four numerical examples. The first algorithm is the Sliding Window-UCB (SW-UCB) algorithm presented in our paper. The second algorithm is the naive UCB algorithm without any sliding windows (Agrawal and Devanur, 2014). The third algorithm is LagrangeBwK presented in (Immorlica et al., 2019), which is originally proposed for the adversarial BwK problem. Note that the LagrangeBwK requires an approximation of the static best distribution benchmark. For simplicity, we put the exact value of the benchmark into the algorithm. All the regret performances are reported based on the average over 100 simulation trials.

algorithm, artificial intelligence, probability, (17 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback

Jensen Huang Says Nvidia's New Vera Rubin Chips Are in 'Full Production'

WIREDJan-5-2026, 23:05:32 GMT

Jensen Huang Says Nvidia's New Vera Rubin Chips Are in'Full Production' The chip giant says Vera Rubin will sharply cut the cost of training and running AI models, strengthening the appeal of its integrated computing platform. Nvidia CEO Jensen Huang says that the company's next-generation AI superchip platform, Vera Rubin, is on schedule to begin arriving to customers later this year. "Today, I can tell you that Vera Rubin is in full production," Huang said during a press event on Monday at the annual CES technology trade show in Las Vegas. Rubin will cut the cost of running AI models to about one-tenth of Nvidia's current leading chip system, Blackwell, the company told analysts and journalists during a call on Sunday. Nvidia also said Rubin can train certain large models using roughly one-fourth as many chips as Blackwell requires.

jensen huang say nvidia, nvidia, rubin, (10 more...)

WIRED

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.25)
North America > United States > California (0.15)
North America > United States > Wisconsin (0.05)
(5 more...)

Industry: Information Technology > Hardware (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.51)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

OpenAI thought to be preparing for 1tn stock market float

The GuardianOct-30-2025, 18:22:08 GMT

A float would support Sam Altman's ambitions to splash trillions of dollars on building datacentres. A float would support Sam Altman's ambitions to splash trillions of dollars on building datacentres. OpenAI is reportedly gearing up for a stock market listing valuing the company at $1tn (£760bn) as soon as next year, in what would be one of the biggest ever initial public offerings. The developer behind the hit AI chatbot ChatGPT is considering whether to file for an IPO as soon as the second half of 2026, according to Reuters, which cited people familiar with the matter. The company is thought to be looking to raise at least $60bn.

building datacentre, openai, stock market float, (11 more...)

The Guardian

Country:

North America > United States (0.16)
Oceania > Australia (0.05)
Europe > United Kingdom > England (0.05)
Europe > Ukraine (0.05)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.73)

Add feedback

OpenAI lays groundwork for juggernaut IPO at up to 1 trillion valuation

The Japan TimesOct-30-2025, 00:17:00 GMT

OpenAI is considering filing with securities regulators as soon as the second half of 2026, some people familiar with the matter said. SAN FRANCISCO - OpenAI is laying the groundwork for an initial public offering that could value the company at up to $1 trillion, three people familiar with the matter said, in what could be one of the biggest IPOs of all time. OpenAI is considering filing with securities regulators as soon as the second half of 2026, some of the people said. In preliminary discussions, the company has looked at raising $60 billion at the low end and likely more, the people said. They cautioned that talks are early and plans -- including the figures and timing -- could change depending on business growth and market conditions.

groundwork, juggernaut ipo, openai, (9 more...)

The Japan Times

Country:

North America > United States > California > San Francisco County > San Francisco (0.25)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.12)
Asia > Japan > Kyūshū & Okinawa > Kyūshū (0.05)
Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.05)

Industry:

Law (1.00)
Government (0.93)
Banking & Finance > Trading (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Do Language Models Use Their Depth Efficiently?

Csordás, Róbert, Manning, Christopher D., Potts, Christopher

arXiv.org Artificial IntelligenceOct-29-2025

Modern LLMs are increasingly deep, and depth correlates with performance, albeit with diminishing returns. However, do these models use their depth efficiently? Do they compose more features to create higher-order computations that are impossible in shallow models, or do they merely spread the same kinds of computation out over more layers? To address these questions, we analyze the residual stream of the Llama 3.1, Qwen 3, and OLMo 2 family of models. We find: First, comparing the output of the sublayers to the residual stream reveals that layers in the second half contribute much less than those in the first half, with a clear phase transition between the two halves. Second, skipping layers in the second half has a much smaller effect on future computations and output predictions. Third, for multihop tasks, we are unable to find evidence that models are using increased depth to compose subresults in examples involving many hops. Fourth, we seek to directly address whether deeper models are using their additional layers to perform new kinds of computation. To do this, we train linear maps from the residual stream of a shallow model to a deeper one. We find that layers with the same relative depth map best to each other, suggesting that the larger model simply spreads the same computations out over its many layers. All this evidence suggests that deeper models are not using their depth to learn new kinds of computation, but only using the greater depth to perform more fine-grained adjustments to the residual. This may help explain why increasing scale leads to diminishing returns for stacked Transformer architectures.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.13898

Country:

Europe > Austria > Vienna (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > Singapore (0.04)
(11 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback