Chūgoku
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Europe > Italy > Tuscany > Florence (0.04)
- (6 more...)
- Transportation (1.00)
- Information Technology (1.00)
- Law (0.92)
- (2 more...)
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (4 more...)
- Information Technology > Game Theory (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
How hibernating hamsters could help astronauts
Special cells can repair muscles, even when some animals are dormant. A hibernating Syrian hamster that was part of the study. Breakthroughs, discoveries, and DIY tips sent six days a week. With the freezing temperatures that have recently pummeled parts of the northeastern United States, the idea of curling up for the winter and snoozing until spring sounds very appealing. There's just one problem for our species--well, actually, there would be many.
- North America > United States (0.25)
- North America > Greenland (0.05)
- Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.05)
Language Model Tokenizers Introduce Unfairness Between Languages
Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different languages can have drastically different tok-enization lengths, with differences up to 15 times in some cases. These disparities persist even for tokenizers that are intentionally trained for multilingual support.
- North America > Haiti (0.14)
- Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
- (38 more...)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
- Asia > Japan > Kyūshū & Okinawa > Kyūshū (0.04)
- North America > United States > Illinois (0.04)
- (2 more...)
Warsh set to face early reality check as Trump's man at the Fed
Warsh set to face early reality check as Trump's man at the Fed Kevin Warsh has pledged to shrink the Fed's balance sheet and argued that a productivity boom driven by artificial intelligence will keep inflation low. Kevin Warsh waited almost a decade before finally clinching U.S. President Donald Trump's nomination to be chair of the Federal Reserve. He won't need to wait as long before his first big test in the job. Having won the race with a promise of regime change" at the Fed, suggesting he would make significant changes, Warsh has pledged to shrink the Fed's balance sheet and argued that a productivity boom driven by artificial intelligence will keep inflation low. While that prognosis was enough to convince Trump, his Fed pick will now need to convince fellow policymakers and investors.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.08)
- North America > United States > Minnesota (0.05)
- Europe > Ukraine (0.05)
- (5 more...)
Clustering in Deep Stochastic Transformers
Fedorov, Lev, Sander, Michaël E., Elie, Romuald, Marion, Pierre, Laurière, Mathieu
Transformers have revolutionized deep learning across various domains but understanding the precise token dynamics remains a theoretical challenge. Existing theories of deep Transformers with layer normalization typically predict that tokens cluster to a single point; however, these results rely on deterministic weight assumptions, which fail to capture the standard initialization scheme in Transformers. In this work, we show that accounting for the intrinsic stochasticity of random initialization alters this picture. More precisely, we analyze deep Transformers where noise arises from the random initialization of value matrices. Under diffusion scaling and token-wise RMS normalization, we prove that, as the number of Transformer layers goes to infinity, the discrete token dynamics converge to an interacting-particle system on the sphere where tokens are driven by a \emph{common} matrix-valued Brownian noise. In this limit, we show that initialization noise prevents the collapse to a single cluster predicted by deterministic models. For two tokens, we prove a phase transition governed by the interaction strength and the token dimension: unlike deterministic attention flows, antipodal configurations become attracting with positive probability. Numerical experiments confirm the predicted transition, reveal that antipodal formations persist for more than two tokens, and demonstrate that suppressing the intrinsic noise degrades accuracy.
- North America > United States (0.14)
- Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)
- Europe > Denmark (0.04)
Generative Modeling of Discrete Data Using Geometric Latent Subspaces
Gonzalez-Alvarado, Daniel, Cassel, Jonas, Petra, Stefania, Schnörr, Christoph
We introduce the use of latent subspaces in the exponential parameter space of product manifolds of categorial distributions, as a tool for learning generative models of discrete data. The low-dimensional latent space encodes statistical dependencies and removes redundant degrees of freedom among the categorial variables. We equip the parameter domain with a Riemannian geometry such that the spaces and distances are related by isometries which enables consistent flow matching. In particular, geodesics become straight lines which makes model training by flow matching effective. Empirical results demonstrate that reduced latent dimensions suffice to represent data for generative modeling.
- North America > United States > California > Alameda County > Hayward (0.04)
- Asia > Middle East > Jordan (0.04)
- Europe > United Kingdom > North Sea > Southern North Sea (0.04)
- (2 more...)
U.K. proposes letting websites refuse being included in Google's AI search
U.K. proposes letting websites refuse being included in Google's AI search Website publishers argue that Google's artificial intelligence-generated summaries discourage clicks to their original pages, reducing traffic to their sites and, in turn, cutting their advertising revenue. LONDON - Britain's competition watchdog proposed Wednesday that websites be allowed to opt out of having their content be used by Google's AI Overviews feature as it tackles the technology giant's dominance in online search. The Competition and Markets Authority (CMA) in October paved the way for tougher regulation on the matter, under new targeted measures focused on technology giants. Last year, it designated Google with strategic market status (SMS), subjecting it to special requirements, following a nine-month investigation. In a time of both misinformation and too much information, quality journalism is more crucial than ever. By subscribing, you can help us get the story right.
- Europe > United Kingdom (0.25)
- Asia > China (0.18)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.08)
- (5 more...)
- Media > News (0.72)
- Government > Regional Government > Asia Government > Japan Government (0.52)
Nvidia helped DeepSeek hone AI models later used by China's military
Nvidia helped DeepSeek hone AI models later used by China's military China's DeepSeek received extensive technical assistance from Nvidia as a legitimate commercial partner hone artificial intelligence models that were later used by the Chinese military, it has been revealed. SAN FRANCISCO - U.S. chipmaker Nvidia helped China's DeepSeek hone artificial intelligence models that were later used by the Chinese military, the chairman of a U.S. House of Representatives committee said in a letter on Wednesday. DeepSeek shook markets early last year with a set of AI models that rivaled some of the best offerings from the United States but were developed with far less computing power, fueling concerns in Washington that China could catch up with the U.S. in AI despite U.S. restrictions on the sale of high-powered computing chips to China. In a letter to U.S. Commerce Secretary Howard Lutnick, Rep. John Moolenaar, a Michigan Republican who chairs the House Select Committee on China, said documents obtained by the committee from Nvidia showed the achievement came after extensive technical assistance from Nvidia. In a time of both misinformation and too much information, quality journalism is more crucial than ever. By subscribing, you can help us get the story right.
- Asia > China (1.00)
- North America > United States > Michigan (0.25)
- North America > United States > California > San Francisco County > San Francisco (0.25)
- (6 more...)
- Information Technology > Hardware (1.00)
- Government > Regional Government > Asia Government > Japan Government (0.67)