AITopics | Large Language Model

Large pretrained models can be used as annotators, helping replace or augment crowdworkers and enabling distilling generalist models into smaller specialist models.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Information Technology (0.93)
Health & Medicine > Therapeutic Area > Oncology (0.67)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback

726ab29b61a749b36d2593648716ae3c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 20:27:52 GMT

Hence, the performance of LLMs in various NLP tasks depends significantly onthecrucial roleplayedbytheattention mechanism with thesoftmaxunit.

large language model, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Tuscany > Florence (0.04)
Asia > China (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.98)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

724be4472168f31ba1c9ac630f15dec8-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 20:27:39 GMT

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Virginia (0.04)
North America > United States > Texas (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.68)

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

An Analysis of Tokenization: Transformers under Markov Data

Neural Information Processing SystemsFeb-15-2026, 20:27:17 GMT

The training of language models is typically not an end-to-end process.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > North Carolina > Wake County > Raleigh (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Germany > Berlin (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

8e9bdc23f169a05ea9b72ccef4574551-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 20:27:10 GMT

correspondence, dataset, dino feature, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.41)

Add feedback

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Neural Information Processing SystemsFeb-15-2026, 20:09:36 GMT

Linear RNN architectures, like Mamba, can be competitive with Transformer models in language modeling while having advantageous deployment characteristics. Given the focus on training large-scale Transformer models, we consider the challenge of converting these pretrained models for deployment.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country: