MaxPoolBERT: Enhancing BERT Classification via Layer- and Token-Wise Aggregation

Open in new window