Beam Tree Recursive Cells
Chowdhury, Jishnu Ray, Caragea, Cornelia
–arXiv.org Artificial Intelligence
We propose Beam Tree Recursive Cell (BT-Cell) - a backpropagation-friendly framework to extend Recursive Neural Networks (RvNNs) with beam search for latent structure induction. We further extend this framework by proposing a relaxation of the hard top-k operators in beam search for better propagation of gradient signals. We evaluate our proposed models in different out-of-distribution splits in both synthetic and realistic data. Our experiments show that BTCell achieves near-perfect performance on several challenging structure-sensitive synthetic tasks like ListOps and logical inference while maintaining comparable performance in realistic data against other RvNN-based models. Additionally, we identify a previously unknown failure case for neural models in generalization to unseen number of arguments in ListOps. The code is available at: https://github.com/JRC1995/BeamTreeRecursiveCells.
arXiv.org Artificial Intelligence
Jun-20-2023
- Country:
- Asia
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- France > Hauts-de-France
- Germany > Berlin (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Belgium > Brussels-Capital Region
- North America > United States
- California
- Los Angeles County > Los Angeles (0.04)
- San Diego County > San Diego (0.04)
- Washington > King County
- Seattle (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Colorado > Denver County
- Denver (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California
- Oceania > Australia
- Genre:
- Research Report (0.64)
- Technology: