CogL TX: Applying BERT to Long Texts
–Neural Information Processing Systems
BERT is incapable of processing long texts due to its quadratically increasing memory and time consumption. The most natural ways to address this problem, such as slicing the text by a sliding window or simplifying transformers, suffer from insufficient long-range attentions or need customized CUDA kernels.
Neural Information Processing Systems
Nov-14-2025, 14:19:39 GMT
- Country:
- Europe > Belgium
- Brussels-Capital Region > Brussels (0.04)
- North America > Canada (0.04)
- Europe > Belgium
- Industry:
- Health & Medicine (0.30)
- Technology: