AITopics | South America

From an optimization perspective, our SLED framework leverages the latent knowledge embedded within the LLM by contrasting the output logits from the final layer with those from early layers.

arxiv preprint arxiv, logit, sled, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Jordan (0.04)
(3 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

09265e2568cf7a6ff47b506acbc2c6eb-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 18:01:01 GMT

causal query, computational linguistic, proceedings, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > Canada > Ontario > Toronto (0.04)
(20 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Consumer Products & Services (1.00)
Transportation > Air (0.46)
Transportation > Passenger (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

08a9e28c96d016dd63903ab51cd085b0-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 18:00:17 GMT

experiment, knowledge, neuron, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
(16 more...)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers

Neural Information Processing SystemsOct-9-2025, 17:58:49 GMT

Interestingly, the scaling performance of structured matrices is explored, revealing steeper curves in scaling training FLOPs, along with a favorable scaling trend in the overtraining regime. Specifically, we show that wide and structured networks can utilize training FLOPs more efficiently, with fewer parameters and lower loss than dense models at their optimal trade-off.

arxiv preprint arxiv, experiment, matrix, (14 more...)

Neural Information Processing Systems

Country: