Glossary

Correlated Topic Model (CTM)

Correlated Topic Model (CTM) [2] extends LDA using a logistic-normal prior which explicitly models correlation patterns with a Gaussian covariance matrix. [1] J. He, Z. Hu, T. Berg-Kirkpatrick, Y. Huang, E.P. Xing, Efficient Correlated Topic Modeling with Topic Embedding, Proc. 23rd ACM SIGKDD Int. Conf. Knowl. Discov. Data Min. – KDD ’17. (2017) 225–233. doi:10.1145/3097983.3098074.…

DELAF

All the dictionaries conform to the DELAF formalism. A DELAF dictionary is a text file, each line representing an entry. The line representing a word contains the inflected form of the word, the lemma of the word and some grammatical, semantic and inflectional information. Here is a sample of the English simple word dictionary: acidize,.V:W:P1s:P2s:P1p:P2p:P3p…

DELAS

The DELAS format is very similar to the one used in the DELAF. The only difference is that there is only a canonical form followed by grammatical and/or semantic codes. The canonical form is separated from the different codes by a comma. There is an example: horse,N4+Anl The first grammatical or semantic code will be…

DFC

Document Frequency Factor – multiplier that is applied to Number of Documents variable in TF-cIDF term weight model. If DFC = 0, then it is TF model, if DFC=1, then it is typical TF-IDF.

Distributional Hypothesis

says that words that occur in the same contexts tend to have similar meaning. Z.Harris, 1954. Distributional Structure. In J. A. Fodor and J. J. Katz, editor, The Structure of Language, pages 33-49, Prentice-Hall