CRM114 Discriminator 20060704a review

Download
by rbytes.net on

CRM114 is a system to examine incoming e-mail, data files or other data streams, system log streams and to sort, filter, or alter the

License: GPL (GNU General Public License)
File size: 0K
Developer: Crah the Merciless
0 stars award from rbytes.net

CRM114 is a system to examine incoming e-mail, data files or other data streams, system log streams and to sort, filter, or alter the incoming files or data streams according to the user's wildest desires.

Criteria for categorization of data can be by satisfaction of regexes, by sparse binary polynomial matching with a Bayesian Chain Rule evaluator, a Hidden Markov Model, or by other means. Accuracy of the SBPH/BCR classifier has been seen in excess of 99 per cent, for 1/4 megabyte of learning text. In other words, CRM114 learns, and it learns fast .

CRM114 is compatible with SpamAssassin or other spam-flagging software; it can also be pipelined in front of or behind procmail. CRM114 is also useful as a syslog or firewall log filter, to alert you to important events but ignore the ones that aren't meaningful.

People have been able to run CRM114 on Linux, BSD, Mac OS-X, and Windows (natively and with Cygwin), and it has even been integrated with Microsoft Outlook and QUALCOMM Eudora. See the "Cool Things" link below for details. I can't help on any of these except Linux, though if you ask on the mailing list, someone might be able to assist you.

You can get at all of these exciting interconnects (including the Outlook macros) below in "Other CRM114 Cool Things", below.

CRM114 is licensed under the GPL; it is WITHOUT WARRANTY of ANY KIND, and it is BETA/FIELD TEST QUALITY. It's still experimental, be warned.

Use at your own risk, and send me bug reports! Or even better, send me improvements! If your code is substantial, I prefer to dual-license the code (i.e. we both get full rights to it, including the right to reuse and relicense under other licenses).

Not every user gets great results with the default classifier; that's why CRM114 has six different classifiers available. It's easy to switch classifiers to see what the tradeoffs are.

What's New in This Release:
This version makes OSB classification, mailreaver, and reavercacheing the default configuration, along with an improved learning system (mailtrainer.crm) for faster convergence and higher accuracy.
There have been a few edge-condition bugs squashed, and the documentation has been updated as well.
This is the recommended version for all new and upgrade installs.

CRM114 Discriminator 20060704a keywords