CoLLA T: On Adding Fine-grained Audio Understanding to Language Models using Token-Level Locked-Language Tuning

Nov-19-2025, 19:36:17 GMT–Neural Information Processing Systems

Humans can easily understand various audio concepts, but conventional audio classification models fail due to their inability to predict unseen classes during training.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Nov-19-2025, 19:36:17 GMT

Conferences PDF

Add feedback

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks (0.96)

Duplicate Docs Excel Report

Title
CoLLA T: On Adding Fine-grained Audio Understanding to Language Models using Token-Level Locked-Language Tuning

Similar Docs Excel Report more

Title	Similarity	Source
None found