Glossary

Outlier

In statistics, an outlier is an observation point that is distant from other observations.[1][2] An outlier may be due to variability in the measurement or it may indicate experimental error; the latter are sometimes excluded from the data set.[3] An outlier can cause serious problems in statistical analyses. From: https://en.wikipedia.org/wiki/Outlier

Pattern Specificity

The specificity of a pattern X is defined as power function of the pattern length with the exponent less than 1, denoted as spe(X). spe(X)m = a|X| where: a and m are constant real numbers e.g. a=1, m=0.5 [1] Y. Gao, Y. Xu, Y. Li, Pattern-Based Topic Models for Information Filtering, in: 2013 IEEE 13th…

Perplexity

[Topic Model] Perplexity is a standard performance measure used to evaluate models of text data. It measures a model’s ability to generalise and predict new documents: the perplexity is an indication of the number of equally likely words that can occur at an arbitrary position in a document. A lower perplexity therefore indicates better generalisation. We calculate…

POS

Part-of-speech, is very frequently used to provide linguistic information to NER and CR in form of features in statistical approaches