Goto

Collaborating Authors

 Large Language Model


DAPE: Data-AdaptivePositionalEncodingforLength Extrapolation

Neural Information Processing Systems

Positional encoding plays a crucial role in transformers, significantly impactingmodel performance andlength generalization. Prior research hasintroduced absolute positional encoding (APE) and relative positional encoding (RPE) to distinguish token positions in given sequences.


Decoupled Context Processing for Context Augmented Language Modeling Zonglin Li

Neural Information Processing Systems

Language models can be augmented with a context retriever to incorporate knowledge from large external databases. By leveraging retrieved context, the neural network does not have to memorize the massive amount of world knowledge within its internal parameters, leading to better parameter efficiency, interpretability and mod-ularity.


The Download: Making AI Work, and why the Moltbook hype is similar to Pokรฉmon

MIT Technology Review

Are you interested in learning more about the ways in which AI is being used? We've launched a new weekly newsletter series exploring just that: digging into how generative AI is being used and deployed across sectors and what professionals need to know to apply it in their everyday work. Each edition of Making AI Work begins with a case study, examining a specific use case of AI in a given industry. Then we'll take a deeper look at the AI tool being used, with more context about how other companies or sectors are employing that same tool or system. Finally, we'll end with action-oriented tips to help you apply the tool. The first edition takes a look at how AI is changing health care, digging into the future of medical note-taking by learning about the Microsoft Copilot tool used by doctors at Vanderbilt University Medical Center.




LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Neural Information Processing Systems

However, such inputs impose a substantial burden on users when compared to simple text inputs. To address the issue, we study how Large Language Models (LLMs) can serve as visual planners by generating layouts from text conditions, and thus collaborate with visual generative models. We propose LayoutGPT, a method to compose in-context visual demonstrations in style sheet language to enhance the visual planning skills of LLMs.