NLP from Scratch: Key Insights

#artificialintelligence 

A frequency count of all the names in a document; think of a dictionary containing the frequency of all words in a document. Often use for autocompleting and resolving word ambiguity (e.g. is watch in the document a verb or a noun?) "Also called shallow parsing, chunking aims at labeling segments of a sentence with syntactic constituents such as noun or verb phrases (NP or VP). Each word is assigned only one unique tag, often encoded as a begin-chunk (e.g., B-NP) or inside-chunk tag (e.g., I-NP)" (Collobert et al.). An entity is a thing that belongs to a noun family (e.g. the car, the chef, the plant). A named entity is an entity that has a proper name (e.g.