The imbNLP library introduces quite unique data structure designed to facilitate robust regular expression queries over different facets of tokenized content graph model. It is called textual map, a textual representation pivoted collection, keeping record on references to the graph elements on char position level. When the textual map, shown below in conceptual example, is queried for range from 5 to 9, it will return: words T1 and T2, sentence TS1 and phrase P1.
|Putting out the current form of the token|
|Putting out the initial form of the token|
|The Part-of-speech, is very frequently used to provide linguistic information to NER and CR in form of features in statistical approaches... type tag form: |A V|N|Part|Conj||
|The Part-of-speech, is very frequently used to provide linguistic information to NER and CR in form of features in statistical approaches... type and grammatical tag form: [Amspf:Nmsps]|ADJ[fs1f]||
|The flags form: |phoneOfficeNeedle|symbol|number phone phoneNumber||
|The flags form: |dat_business.phoneOfficeNeedle|tkn_contains.symbols|tkn_contains.number dat_business.phone dat_business.phoneNumber||
|The descriptive form: |”kompanijom”:”kompanija”:N,common,f,s,instrumental:lowerCase,letter,onlyLetters||