Rubryx 2.1
Rubryx is a software application of pattern classification of web sites.
|
Rubryx is a software application of pattern classification of web sites. It allows classifying a large bulk of specialized textual information and generating web-catalogs, electronic libraries, reference systems on account of expert information and full-text analysis.
The aim of the program is to classify the documents most efficiently. For a successful solution of the task, an accurate selection of the class and threshold value of index K is required. The classes should be selected so that their intersection is minimized and the most bulk of documents is covered. Index K should be chosen so that odd documents are not included into the class (K value is too small) and suitable documents are not sorted away (K value is too big). A number of preliminary classifications may be required.
For preliminary classifications, make approx 1 per cent sample of the general bulk of documents. For example, for 100 thousand web sites to classify, 1 sites is enough for preliminary experiments. On the one hand, 1 sites is a representative sample, on the other hand, classification of such a sample on up-to-date computers will take a few moments.
During classification a part of documents can be excluded from all classes. These documents should be carefully studied. It is possible that new classes should be added to the list. Part of the residual documents may not suit for the generated catalog. Including of a large amount of the same documents into different classes means that the subject matter of the catalog has been poorly divided into classes.
Having obtained good results in sample classification, the whole bulk of documents can be classified. Consequently, you get a number of web sites of qualitative information corresponding to the number of classes.
Limitations
tags web sites preliminary classifications for preliminary documents can are not documents are information and the class classes should
Download Rubryx 2.1
Purchase: Buy Rubryx 2.1
Similar software
Rubryx 2.1
Vladimir Polyakov
Rubryx is a software application of pattern classification of web sites.
Advanced Bulk PDF Merger 1.0
Advanced Reliable Software, In
Advanced Bulk PDF Merger is an affordable utility that automates the task of merging multiple PDF documents into one PDF document.
All-Purpose Legal Documents 1.02
Rodent Software
All-Purpose Legal Documents provides 56 legal documents in 8 categories.
Personal Documents 1.7
Alexis Rios Software
Personal Documents is a useful program that will help you to organize all your documents in a way where whenever you need to look for any of them, you will be able to get to it almost instantly and will never ever will lose a document once again.
X2Net Recent Documents 3.0.2.0
X2Net Limited
X2Net Recent Documents will help you to ever lose track of a file again!
X2Net Recent Documents automatically keeps track of every document you use.
ProLibra 2.0
Sharpeware Ltd
ProLibra is an application that keeps teams in control of their documents, without the need for complex databases or maintenance.
MagicDoc 1.00
TDOC Projects Ltd
MagicDoc is a document scanning, encryption, archiving and retrieval solution suitable for home and business use.
MSD Documents 3.30
MSD Soft
MSD Documents is a documentation manager with two main modules:
A Document manager that allows to catalog any kind of document, allowing also to associate documents to projects and clients.
MSD Documents Multiuser 1.70
MSD Soft
MSD Documents Multiuser is a documentation manager with two main modules:
A Document manager that allows to catalog any kind of document, allowing also to associate documents to projects and clients.
GUIPDFTK 0.48
Dirk Paehl
If PDF is electronic paper, then pdftk is an electronic staple-remover, hole-punch, binder, secret-decoder-ring, and X-Ray-glasses.