Goto

Collaborating Authors

 Large Language Model




DiscoveringSparsityAllocationforLayer-wise PruningofLargeLanguageModels

Neural Information Processing Systems

In this paper, we present DSA, the first automated framework for discovering sparsity allocation schemes for layer-wise pruning in Large Language Models (LLMs). LLMs have become increasingly powerful, but their large parameter counts make them computationally expensive. Existing pruning methods for compressing LLMs primarily focus on evaluating redundancies and removing element-wise weights. However, these methods fail to allocate adaptive layerwise sparsities, leading to performance degradation in challenging tasks.


ImOV3D: LearningOpen-VocabularyPointClouds 3DObjectDetectionfromOnly2DImages

Neural Information Processing Systems

Open-vocabulary 3D object detection (OV-3Det) aims to generalize beyond the limited number ofbasecategories labeled during thetraining phase. Thebiggest bottleneck is the scarcity of annotated 3D data, whereas 2D image datasets are abundantandrichlyannotated.



BAKU: AnEfficientTransformerfor Multi-TaskPolicyLearning

Neural Information Processing Systems

Inthiswork,wepresentBAKU,asimple transformer architecture that enables efficient learning of multi-task robot policies.BAKU builds upon recent advancements in offline imitation learning and meticulously combines observation trunks, action chunking, multi-sensory observations, and action heads tosubstantially improveupon prior work.




Multi-modalSituated Reasoningin3DScenes

Neural Information Processing Systems

Comprehensiveevaluationson MSQA andMSNN highlight thelimitations ofexisting vision-language models and underscore the importance of handling multi-modal interleaved inputs and situation modeling.