Funnel-Transformer: FilteringoutSequential RedundancyforEfficientLanguageProcessing
–Neural Information Processing Systems
With the success of language pretraining, it is highly desirable to develop more efficient architectures ofgood scalability thatcanexploit theabundant unlabeled dataatalowercost.
Neural Information Processing Systems
Feb-7-2026, 22:54:55 GMT
- Country:
- Technology: