Simple Hardware-Efficient PCFGs with Independent Left and Right Productions
Liu, Wei, Yang, Songlin, Kim, Yoon, Tu, Kewei
–arXiv.org Artificial Intelligence
Scaling dense PCFGs to thousands of nonterminals via a low-rank parameterization of the rule probability tensor has been shown to be beneficial for unsupervised parsing. However, PCFGs scaled this way still perform poorly as a language model, and even underperform similarly-sized HMMs. This work introduces \emph{SimplePCFG}, a simple PCFG formalism with independent left and right productions. Despite imposing a stronger independence assumption than the low-rank approach, we find that this formalism scales more effectively both as a language model and as an unsupervised parser. As an unsupervised parser, our simple PCFG obtains an average F1 of 65.1 on the English PTB, and as a language model, it obtains a perplexity of 119.0, outperforming similarly-sized low-rank PCFGs. We further introduce \emph{FlashInside}, a hardware IO-aware implementation of the inside algorithm for efficiently scaling simple PCFGs.
arXiv.org Artificial Intelligence
Oct-23-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Nevada (0.04)
- Washington > King County
- Seattle (0.04)
- Texas > Travis County
- Austin (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Maryland > Prince George's County
- College Park (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- Asia
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.05)
- China
- Middle East > UAE
- North America
- Genre:
- Research Report (0.82)
- Technology: