AITopics | mdl

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

TrueFew-ShotLearningwithLanguageModels

Neural Information Processing SystemsFeb-8-2026, 21:01:27 GMT

Here, we evaluate the few-shot ability ofLMs when such held-out examples are unavailable, a setting we calltrue few-shot learning. We test two model selection criteria, cross-validation and minimum description length, for choosing LM prompts and hyperparameters in the true few-shot setting. Onaverage, both marginally outperform random selection and greatlyunderperform selection basedonheld-out examples.

artificial intelligence, dtrain, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Tuscany > Florence (0.04)
Asia > China (0.04)
Asia > Japan > Kyūshū & Okinawa > Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

4f693c15f189efd888b6782a5f4eccb1-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 16:28:40 GMT

artificial intelligence, machine learning, probability, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

4f693c15f189efd888b6782a5f4eccb1-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 16:28:36 GMT

artificial intelligence, bidder, mechanism, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.98)

Add feedback

Refactoring Codebases through Library Design

Kovacic, Ziga, Chiu, Justin T., Lee, Celine, Zhao, Wenting, Ellis, Kevin

arXiv.org Artificial IntelligenceOct-7-2025

Maintainable and general software allows developers to build robust applications efficiently, yet achieving these qualities often requires refactoring specialized solutions into reusable components. This challenge becomes particularly relevant as code agents become used to solve isolated one-off programming problems. We investigate code agents' capacity to refactor code in ways that support growth and reusability. We first investigate what makes a good refactoring, finding via simulation results and a human study that Minimum Description Length best correlates with preferable refactorings. We then present both a benchmark and a method for refactoring: MiniCode, a benchmark where multiple files must be refactored into a shared library, and Librarian, a sample-and-rerank method for generating reusable libraries. We compare Librarian to state-of-the-art library generation methods, and study it on real-world code bases.

large language model, machine learning, programming language, (23 more...)

arXiv.org Artificial Intelligence

2506.11058

Genre: Research Report (0.82)

Industry: Education (0.47)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

A Minimum Description Length Approach to Regularization in Neural Networks

Abudy, Matan, Well, Orr, Chemla, Emmanuel, Katzir, Roni, Lan, Nur

arXiv.org Artificial IntelligenceSep-9-2025

State-of-the-art neural networks can be trained to become remarkable solutions to many problems. But while these architectures can express symbolic, perfect solutions, trained models often arrive at approximations instead. We show that the choice of regularization method plays a crucial role: when trained on formal languages with standard regularization ($L_1$, $L_2$, or none), expressive architectures not only fail to converge to correct solutions but are actively pushed away from perfect initializations. In contrast, applying the Minimum Description Length (MDL) principle to balance model complexity with data fit provides a theoretically grounded regularization method. Using MDL, perfect solutions are selected over approximations, independently of the optimization algorithm. We propose that unlike existing regularization techniques, MDL introduces the appropriate inductive bias to effectively counteract overfitting and promote generalization.

artificial intelligence, machine learning, regularization, (17 more...)

arXiv.org Artificial Intelligence

2505.13398

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.68)

Technology: