UDApter -- Efficient Domain Adaptation Using Adapters
Malik, Bhavitvya, Kashyap, Abhinav Ramesh, Kan, Min-Yen, Poria, Soujanya
–arXiv.org Artificial Intelligence
We propose two methods to make unsupervised domain adaptation (UDA) more parameter efficient using adapters, small bottleneck layers interspersed with every layer of the large-scale pre-trained language model (PLM). The first method deconstructs UDA into a two-step process: first by adding a domain adapter to learn domain-invariant information and then by adding a task adapter that uses domain-invariant information to learn task representations in the source domain. The second method jointly learns a supervised classifier while reducing the divergence measure. Compared to strong baselines, our simple methods perform well in natural language inference (MNLI) and the cross-domain sentiment classification task. We even outperform unsupervised domain adaptation methods such as DANN and DSN in sentiment classification, and we are within 0.85% F1 for natural language inference task, by fine-tuning only a fraction of the full model parameters. We release our code at https://github.com/declare-lab/domadapter
arXiv.org Artificial Intelligence
Feb-16-2023
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America > United States
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York > New York County
- Europe
- France (0.04)
- Czechia > Prague (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- China > Hong Kong (0.04)
- Middle East > Jordan (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Oceania > Australia
- Genre:
- Research Report (0.50)
- Technology: