Kraken: InherentlyParallelTransformersFor EfficientMulti-DeviceInference
–Neural Information Processing Systems
Large Transformer networks are increasingly used in settings where low inference latency is necessary to enable new applications and improve the end-user experience.
Neural Information Processing Systems
Feb-7-2026, 23:23:39 GMT
- Country:
- Europe
- France (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- United States > New York
- New York County > New York City (0.04)
- Canada > Ontario
- Europe
- Genre:
- Research Report (0.46)
- Industry:
- Government (0.94)
- Technology: