Stable and low-precision training for large-scale vision-language models Mitchell Wortsman 1 Tim Dettmers 1 Luke Zettlemoyer
–Neural Information Processing Systems
Our main focus is int8 as GPU support for float8 is rare, though we also analyze float8 training through simulation.
Neural Information Processing Systems
Oct-8-2025, 06:32:19 GMT
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia > Middle East
- Jordan (0.04)
- Europe > France (0.04)
- North America > Canada
- British Columbia > Vancouver (0.04)
- Quebec > Montreal (0.04)
- Africa > Ethiopia
- Genre:
- Research Report (0.47)
- Technology: