AITopics | lecun

Collaborating Authors

lecun

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Appendix Potential Negative Societal Impacts

Neural Information Processing SystemsApr-25-2026, 19:26:26 GMT

C.3 Other Differences Besides the above discussion, there are some other differences between Daniely [12] and our work. First, they analyze SGD, and we analyze a constrained optimization problem and projected SGD. This may be the reason why we can get a stronger bound on width. In the experiments in Section 5, we observe that SGD performs badly when the width is small (see the first left column in (b), Figure 4). Therefore, we suspect an algorithmic change is needed to train narrow nets with such width (due to the training difficulty), and we indeed propose a new method to train narrow nets. Second, they consider binary {+1, 1}dataset, while our results apply to arbitrary labels. In addition, their proof seems to be highly dependent on the fact that the labels are {+1, 1}, and seems hard to generalize to general labels.

artificial intelligence, machine learning, training regime, (18 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.48)

Industry: Social Sector (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

AData-AugmentationIsWorthAThousandSamples: AnalyticalMomentsAndSampling-FreeTraining

Neural Information Processing SystemsFeb-10-2026, 03:22:29 GMT

Data-Augmentation (DA) is known to improve performance across tasks and datasets.

artificial intelligence, arxivpreprintarxiv, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

8abfe8ac9ec214d68541fcb888c0b4c3-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 07:16:10 GMT

More specifically,inour main result (Theorem 3.2) we identify a set of sufficient conditions on the initialization and the network topology under which theglobal convergence ofgradient descent isobtained.

artificial intelligence, arxiv, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Germany (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Appendix PotentialNegativeSocietalImpacts

Neural Information Processing SystemsFeb-8-2026, 13:53:51 GMT

In this paper, we discuss the expressivity and trainability of narrow neural networks. Appendix H introduces the following contents.

artificial intelligence, machine learning, training regime, (18 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Why an AI 'godfather' is quitting Meta after 12 years

BBC NewsNov-20-2025, 11:35:57 GMT

Why an AI'godfather' is quitting Meta after 12 years Just a couple of weeks ago, one of the godfathers of artificial intelligence was in St James's Palace being handed an award from King Charles for his work in artificial intelligence (AI). Professor Yann LeCun was being honoured along with six other recipients for his contributions to the field, which have been credited as advancing deep learning. But Mr LeCun is at odds with some of the AI world over the future of the generation-defining technology. And now he is going all-in on his idea of advanced machine intelligence after announcing he is leaving his role as Meta's chief AI scientist to start a new firm. During his 12 years at the company, Prof LeCun won the prestigious Turing Award and witnessed several flurries of excitement around AI - not least the most recent boom in generative AI accelerated by rival OpenAI's launch of ChatGPT in late 2022.

large language model, machine learning, natural language, (19 more...)

BBC News

Country:

North America (0.97)
Europe > United Kingdom (0.93)

Industry:

Leisure & Entertainment (0.73)
Information Technology > Services (0.71)
Government > Regional Government > Europe Government > United Kingdom Government (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.56)

Add feedback

Meta Poaches Key Google AI Researcher

TIME - TechOct-21-2025, 13:42:42 GMT

Upon its release earlier this month, OpenAI's Sora 2 model took the Internet by storm, thanks to its ability to generate realistic videos from just a text prompt. But Sora is about more than just capturing eyeballs with viral content. "On the surface, Sora, for example, does not look like it is AGI-relevant," OpenAI CEO Sam Altman said on a podcast earlier this month. "But I would bet that if we can build really great world models, that will be much more important to AGI than people think." Altman was speaking to a growing belief inside the AI industry at large: that if you can simulate the world with enough accuracy, you could drop AI agents into those simulations. There, they could learn more skills than they currently can from just text, photos, and videos--because they could interact with a simulated world. That form of training could be highly efficient, in part because simulated time can be accelerated, and because many simulations can be run in parallel.

brook, meta, openai, (11 more...)

TIME - Tech

Country: North America > United States (0.15)

Genre: Press Release (0.49)

Industry: Government (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback

Dual Perspectives on Non-Contrastive Self-Supervised Learning

Ponce, Jean, Terver, Basile, Hebert, Martial, Arbel, Michael

arXiv.org Artificial IntelligenceOct-15-2025

The stop gradient and exponential moving average iterative procedures are commonly used in non-contrastive approaches to self-supervised learning to avoid representation collapse, with excellent performance in downstream applications in practice. This presentation investigates these procedures from the dual viewpoints of optimization and dynamical systems. We show that, in general, although they do not optimize the original objective, or any other smooth function, they do avoid collapse Following Tian et al. (2021), but without any of the extra assumptions used in their proofs, we then show using a dynamical system perspective that, in the linear case, minimizing the original objective function without the use of a stop gradient or exponential moving average always leads to collapse. Conversely, we characterize explicitly the equilibria of the dynamical systems associated with these two procedures in this linear setting as algebraic varieties in their parameter space, and show that they are, in general, asymptotically stable . Our theoretical findings are illustrated by empirical experiments with real and synthetic data. Self-supervised learning (or SSL) is an approach to representation learning that exploits the internal consistency of training data without requiring expensive annotations. However, non-contrastive approaches to SSL (Assran et al., 2023; Bardes et al., 2022) that take as input different views of the same data samples and learn to predict one view from the other, are susceptible to representational collapse where a constant embedding is learned for all data points (LeCun, 2022). We use in this presentation the dual viewpoints of optimization and dynamical systems to study theoretically and empirically the well-known stop gradient (Chen and He, 2021) and exponential moving average (Grill et al., 2020) training procedures that are specifically designed to avoid this problem. Here C is the global minimum of E (θ,ψ) (shown as negative instead of zero for readibility) associated with a collapse of the training process; B is a nontrivial local minimum one may reach using an appropriate regularization to avoid collapse; and A is a limit point of the stop gradient (SG) training procedure associated with parameters θ and ψ at convergence. In general, it is not a minimum of E and thus does not correspond to a collapse of the training process, but it is a minimum with respect to ψ of E ( θ,ψ).

artificial intelligence, inductive learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2507.01028

Country: Europe > France (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.82)

Add feedback

'I have to do it': Why one of the world's most brilliant AI scientists left the US for China

The GuardianSep-16-2025, 04:00:27 GMT

'I have to do it': Why one of the world's most brilliant AI scientists left the US for China In 2020, after spending half his life in the US, Song-Chun Zhu took a one-way ticket to China. By the time Song-Chun Zhu was six years old, he had encountered death more times than he could count. This was the early 1970s, the waning years of the Cultural Revolution, and his father ran a village supply store in rural China . There was little to do beyond till the fields and study Mao Zedong at home, and so the shop became a refuge where people could rest, recharge and share tales. Zhu grew up in that shop, absorbing a lifetime's worth of tragedies: a family friend lost in a car crash, a relative from an untreated illness, stories of suicide or starvation. "That was really tough," Zhu recalled recently. The young Zhu became obsessed with what people left behind after they died. One day, he came across a book that contained his family genealogy. When he asked the bookkeeper why it included his ancestors' dates of birth and death but nothing about their lives, the man told him matter of factly that they were peasants, so there was nothing worth recording. He resolved that his fate would be different. Today, at 56, Zhu is one of the world's leading authorities in artificial intelligence. In 1992, he left China for the US to pursue a PhD in computer science at Harvard. Later, at University of California, Los Angeles (UCLA), he led one of the most prolific AI research centres in the world, won numerous major awards, and attracted prestigious research grants from the Pentagon and the National Science Foundation. He was celebrated for his pioneering research into how machines can spot patterns in data, which helped lay the groundwork for modern AI systems such as ChatGPT and DeepSeek. He and his wife, and their two US-born daughters, lived in a hilltop home on Los Angeles's Mulholland Drive. He thought he would never leave. But in August 2020, after 28 years in the US, Zhu astonished his colleagues and friends by suddenly moving back to China, where he took up professorships at two top Beijing universities and a directorship in a state-sponsored AI institute.

china, neural network, zhu, (16 more...)

The Guardian

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.74)
Asia > China > Beijing > Beijing (0.26)
Pacific Ocean > North Pacific Ocean > South China Sea (0.04)
(12 more...)

Genre: Personal > Honors (0.48)

Industry:

Leisure & Entertainment > Sports (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning State-Space Models of Dynamic Systems from Arbitrary Data using Joint Embedding Predictive Architectures

Ulmen, Jonas, Sundaram, Ganesh, Görges, Daniel

arXiv.org Artificial IntelligenceAug-15-2025

Abstract: With the advent of Joint Embedding Predictive Architectures (JEPAs), which appear to be more capable than reconstruction-based methods, this paper introduces a novel technique for creating world models using continuous-time dynamic systems from arbitrary observation data. The proposed method integrates sequence embeddings with neural ordinary differential equations (neural ODEs). It employs loss functions that enforce contractive embeddings and Lipschitz constants in state transitions to construct a well-organized latent state space. The approach's effectiveness is demonstrated through the generation of structured latent state-space models for a simple pendulum system using only image data. This opens up a new technique for developing more general control algorithms and estimation techniques with broad applications in robotics.

artificial intelligence, machine learning, sequence, (10 more...)

arXiv.org Artificial Intelligence

2508.10489

Country: Europe > Germany (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.50)

Add feedback

AI 'godfather' predicts another revolution in the tech in next five years

The GuardianFeb-4-2025, 19:00:15 GMT

One of the "godfathers" of modern artificial intelligence has predicted a further revolution in the technology by the end of the decade, and says current systems are too limited to create domestic robots and fully automated cars. Yann LeCun, the chief AI scientist at Mark Zuckerberg's Meta, said new breakthroughs are needed in order for the systems to understand and interact with the physical world. LeCun spoke as one of seven engineers who were awarded the 500,000 Queen Elizabeth prize for engineering on Tuesday for their contributions to machine learning, a cornerstone of AI. Recent breakthroughs in the sector, led by the launch of OpenAI's ChatGPT chatbot, have heightened expectations – and fears – of systems gaining human levels of intelligence. However, LeCun said there was some way to go before AIs matched humans or animals, with the current cutting-edge technology excelling at "manipulating language" but not at understanding the physical world.

large language model, machine learning, revolution, (16 more...)

The Guardian

Genre: Personal > Honors (0.37)

Industry:

Transportation > Passenger (0.37)
Transportation > Ground > Road (0.37)
Information Technology > Robotics & Automation (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)

Add feedback