imbVeles tools and development guide

The page enumerates possible ways to make use of the imbVeles Framework.

  • Download libraries from NuGet
  • Download source from GitHub

For non-developers, researchers in Natural Language Processing and Web Content Mining / Information Retrieval fields:

  • imbWBI: Experimental Console Tool (imbWBI Console Tool v0.3.1)
    • Perform web site classification using Industry Term Model (itmPlugin), embedded in the console application
    • Define and perform your experiments, with customized project configuration
    • Generate secondary reports on performed experiments
  • imbWEM: Web Exploration Model – Application Tools
    • Design your own web crawler model
    • Define your Crawl Job and experiment setup
    • Perform experiments and create benchmark reports
    • Retrieve web documents with your or existing crawler model design
  • imbNLP: Natural Language Processing – Application Tools
    • Extract corporaIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.... from documents
    • Construct Term Frequency – Inverse Document Frequency table
    • Construct domain-specific lexicon
    • Convert different lexical and corpusIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.... resources
    • Create reports
    • Extract facts from corporaIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory....
    • Construct semantic knowledge on entities

For developers, researchers in other fields:

  • imbACE: Advanced Console Environment
    • Develop your research-specific console application using one of the imbACE: Application Templates
  • imbSCI: Coding for Science Foundation
    • Use reporting and data annotation features for your application
    • Use data structures and serialization tools for your application

For developers, researchers in Natural Language Processing and Web Content Mining / Information Retrieval fields:

  • imbNLP: Natural Language Processing
    • Develop a NLP parser and/or plugin
    • Develop facts extraction plugin
    • Develop knowledge extraction rules / plugin
  • imbWEM: Web Exploration Model
    • Develop a Crawler module
    • Develop a Crawler frontier rule

 

Visual Studio Project and Item templates and snippets

Item Templates

Attachments

Spread the love