Spectral Probing
Müller-Eberstein, Max, van der Goot, Rob, Plank, Barbara
–arXiv.org Artificial Intelligence
Linguistic information is encoded at varying timescales (subwords, phrases, etc.) and communicative levels, such as syntax and semantics. Contextualized embeddings have analogously been found to capture these phenomena at distinctive layers and frequencies. Leveraging these findings, we develop a fully learnable frequency filter to identify spectral profiles for any given task. It enables vastly more granular analyses than prior handcrafted filters, and improves on efficiency. After demonstrating the informativeness of spectral probing over manual filters in a monolingual setting, we investigate its multilingual characteristics across seven diverse NLP tasks in six languages. Our analyses identify distinctive spectral profiles which quantify cross-task similarity in a linguistically intuitive manner, while remaining consistent across languages-highlighting their potential as robust, lightweight task descriptors.
arXiv.org Artificial Intelligence
Oct-21-2022
- Country:
- North America
- Canada (0.04)
- United States > Minnesota
- Hennepin County > Minneapolis (0.14)
- Europe
- Spain > Aragón (0.04)
- France (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- China (0.04)
- Japan
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Honshū
- Tōhoku > Iwate Prefecture
- Morioka (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Tōhoku > Iwate Prefecture
- Kyūshū & Okinawa > Kyūshū
- North America
- Genre:
- Research Report (0.50)
- Technology: