From Partial to Strictly Incremental Constituent Parsing

Ezquerro, Ana, Gómez-Rodríguez, Carlos, Vilares, David

arXiv.org Artificial Intelligence 

However, parsing architectures, but that does not adhere with the rise of bidirectional LSTMs (Hochreiter to a definition of strong incrementality. More recently, and Schmidhuber, 1997) and Transformers Kitaev et al. (2022) introduced a span-based (Vaswani et al., 2017), recent research has focused model that incrementally encodes input sentences on non-incremental solutions. These models process into discrete elements using vectors from GPT-2 the full input for contextualization before they mapped into a codebook. Despite this, it relied on start generating any output. Therefore, this approach bidirectional Transformers and a CYK architecture does not capture the progressive unfolding (Kitaev and Klein, 2018) for decoding these vectors of input over time, giving the sense that all into trees. Complementarily, Yang and Deng of it is available all of a sudden (Madureira and (2020) proposed an incremental decoder based on Schlangen, 2020). This is not an issue for most graph neural networks. Although they referred to NLP tasks, but it is relevant for others, such as their parser as strongly incremental, sentences were real-time NLP, e.g., instant machine translation or encoded with bidirectional architectures like BERT real-time speech.