Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge
–Neural Information Processing Systems
To unveil the hidden mechanisms under LM's incapability to learn long-tail domain-specific knowledge, we investigate the behaviors of a GPT on the Wikipedia dataset.
Neural Information Processing Systems
Nov-19-2025, 23:58:16 GMT
- Country:
- North America
- United States
- Michigan (0.04)
- Washington > King County
- Seattle (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- Canada > Quebec
- Montreal (0.04)
- United States
- Asia > China
- Shanghai > Shanghai (0.04)
- Jiangsu Province (0.04)
- North America
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Health & Medicine (1.00)
- Information Technology (0.67)
- Technology: