The page enumerates possible ways to make use of the imbVeles Framework.
For non-developers, researchers in Natural Language Processing and Web Content Mining / Information Retrieval fields:
- imbWBI: Experimental Console Tool (imbWBI Console Tool v0.3.1)
- Perform web site classification using Industry Term Model (itmPlugin), embedded in the console application
- Define and perform your experiments, with customized project configuration
- Generate secondary reports on performed experiments
- imbWEM: Web Exploration Model – Application Tools
- Design your own web crawler model
- Define your Crawl Job and experiment setup
- Perform experiments and create benchmark reports
- Retrieve web documents with your or existing crawler model design
- imbNLP: Natural Language Processing – Application Tools
- Extract corporaIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.... from documents
- Construct Term Frequency – Inverse Document Frequency table
- Construct domain-specific lexicon
- Convert different lexical and corpusIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.... resources
- Create reports
- Extract facts from corporaIn linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed). They are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory....
- Construct semantic knowledge on entities
For developers, researchers in other fields:
- imbACE: Advanced Console Environment
- Develop your research-specific console application using one of the imbACE: Application Templates
- imbSCI: Coding for Science Foundation
- Use reporting and data annotation features for your application
- Use data structures and serialization tools for your application
For developers, researchers in Natural Language Processing and Web Content Mining / Information Retrieval fields:
- imbNLP: Natural Language Processing
- Develop a NLP parser and/or plugin
- Develop facts extraction plugin
- Develop knowledge extraction rules / plugin
- imbWEM: Web Exploration Model
- Develop a Crawler module
- Develop a Crawler frontier rule