Sequences Corpus

We call sequences corpusWe call sequences corpus or qualified corpus a list of sequences of one or several words that we want to be recognized by only one local grammar graph. This sequences corpus is stored in one single file wich must be from one of the following formats :  raw text files in which sequences are delimited by end of line... or qualified corpusIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.... a list of sequences of one or several words that
we want to be recognized by only one local grammar graph.
This sequences corpusWe call sequences corpus or qualified corpus a list of sequences of one or several words that we want to be recognized by only one local grammar graph. This sequences corpus is stored in one single file wich must be from one of the following formats :  raw text files in which sequences are delimited by end of line... is stored in one single file wich must be from one of the following
formats :
 raw text files in which sequences are delimited by end of line
 SNT files already processed with this menu : sequences will be delimited by the STOP
tag.
 TEILite files in which sequences are delimited by the following xml tag :
<seg type=”sequence”>example</seg>