Glossary

GlossaNet

GlossaNet is a specialized search engine and also watch engine. It lets you make searches in every published texts on the Internet in the form of RSS feeds : press, media, blogs, forum, firms, etc. You specify a RSS publication list, you register a query and the system will analyse these sources and will search…

GNU General Public License v3.0

GNU GENERAL PUBLIC LICENSE Version 3, 29 June 2007 Copyright (C) 2007 Free Software Foundation, Inc. Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. Preamble The GNU General Public License is a free, copyleft license for software and other kinds of works. The licenses…

Gold Standard

Ground truth, truth table or gold standard refers to a set of predefined correct results used for evaluation purposes. Usually in evaluation scenarios involving: Precision, Recall and F1 score.

Hungarian method

The Hungarian method is a Topic Alignment method. It searches for the match with maximum weight, i.e., the set of edges that touches each topic in the two sets exactly once, so that sum of weights is maximized [1]. [1] A. De Waal, E. Barnard, Evaluating topic models with stability, 19th Annu. Symp. Pattern Recognit.…

IndustryTermModel

Industry Term Model is working title for the Web Classification algorithm, and it refers to particular namespace within imbWBI (documentation). The namespace contains few classes that are just connecting different parts of imbWBI.Core (documentation), imbNLP.PartOfSpeech (documentation) and imbWEM.Core (documentation) libraries, together to perform classification of business entities, actually their web sites, using natural language processing, ontology…