AITopics | recent progress

Mind the Gap: Assessing Temporal Generalization in Neural Language Models

Neural Information Processing SystemsDec-25-2025, 06:33:34 GMT

Our world is open-ended, non-stationary, and constantly evolving; thus what we talk about and how we talk about it change over time. This inherent dynamic nature of language contrasts with the current static language modelling paradigm, which trains and evaluates models on utterances from overlapping time periods. Despite impressive recent progress, we demonstrate that Transformer-XL language models perform worse in the realistic setup of predicting future utterances from beyond their training period, and that model performance becomes increasingly worse with time. We find that, while increasing model size alone--a key driver behind recent progress--does not solve this problem, having models that continually update their knowledge with new information can indeed mitigate this performance degradation over time. Hence, given the compilation of ever-larger language modelling datasets, combined with the growing list of language-model-based NLP applications that require up-to-date factual knowledge about the world, we argue that now is the right time to rethink the static way in which we currently train and evaluate our language models, and develop adaptive language models that can remain up-to-date with respect to our ever-changing and non-stationary world. We publicly release our dynamic, streaming language modelling benchmarks for WMT and arXiv to facilitate language model evaluation that takes temporal dynamics into account.

name change, neural language model, temporal generalization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Mind the Gap: Assessing Temporal Generalization in Neural Language Models

Neural Information Processing SystemsJan-19-2025, 14:06:21 GMT

Our world is open-ended, non-stationary, and constantly evolving; thus what we talk about and how we talk about it change over time. This inherent dynamic nature of language contrasts with the current static language modelling paradigm, which trains and evaluates models on utterances from overlapping time periods. Despite impressive recent progress, we demonstrate that Transformer-XL language models perform worse in the realistic setup of predicting future utterances from beyond their training period, and that model performance becomes increasingly worse with time. We find that, while increasing model size alone--a key driver behind recent progress--does not solve this problem, having models that continually update their knowledge with new information can indeed mitigate this performance degradation over time. Hence, given the compilation of ever-larger language modelling datasets, combined with the growing list of language-model-based NLP applications that require up-to-date factual knowledge about the world, we argue that now is the right time to rethink the static way in which we currently train and evaluate our language models, and develop adaptive language models that can remain up-to-date with respect to our ever-changing and non-stationary world.

neural language model, recent progress, temporal generalization, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Translating single-cell genomics into cell types

#artificialintelligenceJan-12-2023, 18:55:34 GMT

Data are the new gold, and single-cell genomics is a good match for data-hungry machine-learning algorithms. Machine learning has become increasingly crucial in single-cell genomics. Recent progress in machine learning6, primarily image classification, has been revolutionized by convolutional neural networks. The trick is to focus on local patches of an image and then build up the whole image step by step -- similar to, and inspired by, the way that hierarchies of receptive fields have been discovered in the human brain. Such convolutional neural networks have become state-of-the-art tools for several prediction problems in genomics and bioinformatics, such as the prediction of transcription-factor binding sites, analysis of genetic variants, sequence analysis and protein conformation prediction7.

machine learning, neural network, translating single-cell genomic, (6 more...)

#artificialintelligence

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback

GeoAI for Large-Scale Image Analysis and Machine Vision: Recent Progress of Artificial Intelligence in Geography

#artificialintelligenceJul-18-2022, 00:55:19 GMT

GeoAI, or geospatial artificial intelligence, has become a trending topic and the frontier for spatial analytics in Geography. Although much progress has been made in exploring the integration of AI and Geography, there is yet no clear definition of GeoAI, its scope of research, or a broad discussion of how it enables new ways of problem solving across social and environmental sciences. This paper provides a comprehensive overview of GeoAI research used in large-scale image analysis, and its methodological foundation, most recent progress in geospatial applications, and comparative advantages over traditional methods. We organize this review of GeoAI research according to different kinds of image or structured data, including satellite and drone images, street views, and geo-scientific data, as well as their applications in a variety of image analysis and machine vision tasks. While different applications tend to use diverse types of data and models, we summarized six major strengths of GeoAI research, including (1) enablement of large-scale analytics; (2) automation; (3) high accuracy; (4) sensitivity in detecting subtle changes; (5) tolerance of noise in data; and (6) rapid technological advancement. As GeoAI remains a rapidly evolving field, we also describe current knowledge gaps and discuss future research directions.

artificial intelligence, geography, image analysis and machine vision, (4 more...)

#artificialintelligence

Genre: Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (0.98)
Information Technology > Sensing and Signal Processing > Image Processing (0.88)

Add feedback

Fusing Stretchable Sensing Technology with Machine Learning for Human–Machine Interfaces

#artificialintelligenceMar-20-2021, 03:05:38 GMT

Sensors and algorithms are two fundamental elements to construct intelligent systems. The recent progress in machine learning (ML) has produced great advancements in intelligent systems, owing to the powerful data analysis capability of ML algorithms. However, the performance of most systems is still hindered by sensing techniques that typically rely on rigid and bulky sensor devices, which cannot conform to irregularly curved and dynamic surfaces for high‐quality data acquisition. Skin‐like stretchable sensing technology with unique characteristics, such as high conformability, low modulus, and light weight, has been recently developed to solve this issue. Here, the recent progress in the fusion of emerging stretchable electronics and ML technology, for bioelectrical signal recognition, tactile perception, and multimodal integration is summarized, and the challenges and future developments are further discussed.

fusing stretchable sensing technology, machine interface, machine learning, (3 more...)

#artificialintelligence

Industry: Health & Medicine > Consumer Health (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

GPT-3: The First Artificial General Intelligence?

#artificialintelligenceJul-25-2020, 22:50:31 GMT

If you had asked me a year or two ago when Artificial General Intelligence (AGI) would be invented, I'd have told you that we were a long way off. Most experts were saying that AGI was decades away, and some were saying it might not happen at all. The consensus is -- was? -- that all the recent progress in AI concerns so-called "narrow AI," meaning systems that can only perform one specific task. An AGI, or a "strong AI," which could perform any task as well as a human being, is a much harder problem. It is so hard that there isn't a clear roadmap for achieving it, and few researchers are openly working on the topic. GPT-3 is the first model to shake that status-quo seriously. GPT-3 is the latest language model from the OpenAI team.

gpt-3, intelligence, language model, (14 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Italy (0.04)
Europe > France (0.04)

Industry: Leisure & Entertainment > Games (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The KnowRef Coreference Corpus: a resource for training and evaluating common sense in AI - Microsoft Research

#artificialintelligenceJul-24-2019, 18:09:48 GMT

AI has made major strides in the last decade, from beating the world champion of Go, to learning how to program, to telling fantastical short stories. However, a basic human trait continues to elude machines: common sense. Common sense is a big term with plenty of baggage, but it typically includes shared background knowledge (I know certain facts about the world, like "the sky is blue," and I know that you know them too), elements of logic, and the ability to infer what is plausible. It looms large as one of the hardest and most central problems in AI. Machines can seem glaringly unintelligent when they lack common sense.

artificial intelligence, machine learning, natural language, (15 more...)

#artificialintelligence

Country: Europe > Italy (0.15)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback

Marvelous models

ScienceJul-26-2018, 18:24:08 GMT

The availability of computational resources enables the simulation of increasingly intricate models in many fields of science. Scientists learn about the world by observing, manipulating, measuring, and abstracting. To make sure that they truly understand their system, and to gain insight beyond what experimental data can provide, many also turn to building mathematical models. Some models are based directly on fundamental physical laws, but most rely on approximations. The computational costs vary widely--from exactly solvable models to those that require all the computer power you can get.

artificial intelligence, marvelous model, simulation, (1 more...)

Science

Technology: Information Technology > Artificial Intelligence (0.61)

Add feedback

How to Spot a Machine Learning Opportunity, Even If You Aren't a Data Scientist

@machinelearnbotOct-22-2017, 11:45:09 GMT

Having an intuition for how machine learning algorithms work -- even in the most general sense -- is becoming an important business skill. As Andrew Ng has written: "Almost all of AI's recent progress is through one type, in which some input data (A) is used to quickly generate some simple response (B)." But how does this work? As you might imagine, many exciting machine learning problems can't be reduced to a simple equation like y mx b. But at their essence, supervised machine learning algorithms are solving for complex versions of m, based on labeled values for x and y, so that they can predict future y's from future x's.

artificial intelligence, equation, machine learning, (13 more...)

@machinelearnbot

Industry: Education (0.91)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

How to Spot a Machine Learning Opportunity, Even If You Aren't a Data Scientist 7wData

#artificialintelligenceOct-21-2017, 22:20:29 GMT

Having an intuition for how machine learning algorithms work -- even in the most general sense -- is becoming an important business skill. As Andrew Ng has written: "Almost all of AI's recent progress is through one type, in which some input data (A) is used to quickly generate some simple response (B)." But how does this work? As you might imagine, many exciting machine learning problems can't be reduced to a simple equation like y mx b. But at their essence, supervised machine learning algorithms are solving for complex versions of m, based on labeled values for x and y, so that they can predict future y's from future x's.

artificial intelligence, data mining, machine learning, (15 more...)

#artificialintelligence

Industry: Education (0.91)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.40)

Add feedback

Filters

Collaborating Authors

recent progress

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Mind the Gap: Assessing Temporal Generalization in Neural Language Models

Mind the Gap: Assessing Temporal Generalization in Neural Language Models

Translating single-cell genomics into cell types

GeoAI for Large-Scale Image Analysis and Machine Vision: Recent Progress of Artificial Intelligence in Geography

Fusing Stretchable Sensing Technology with Machine Learning for Human–Machine Interfaces

GPT-3: The First Artificial General Intelligence?

The KnowRef Coreference Corpus: a resource for training and evaluating common sense in AI - Microsoft Research

Marvelous models

How to Spot a Machine Learning Opportunity, Even If You Aren't a Data Scientist

How to Spot a Machine Learning Opportunity, Even If You Aren't a Data Scientist 7wData