CRM114 is a system to examine incoming e-mail, data files or other data streams.
Criteria for categorization of data can be by satisfaction of regexes, by sparse binary polynomial matching with a Bayesian Chain Rule evaluator, a Hidden Markov Model, or by other means. Accuracy of the SBPH/BCR classifier has been seen in excess of 99 per cent, for 1/4 megabyte of learning text. In other words, CRM114 learns, and it learns fast .

CRM114 is compatible with SpamAssassin or other spam-flagging software; it can also be pipelined in front of or behind procmail. CRM114 is also useful as a syslog or firewall log filter, to alert you to important events but ignore the ones that aren't meaningful.

People have been able to run CRM114 on Linux, BSD, Mac OS-X, and Windows (natively and with Cygwin), and it has even been integrated with Microsoft Outlook and QUALCOMM Eudora. See the "Cool Things" link below for details. I can't help on any of these except Linux, though if you ask on the mailing list, someone might be able to assist you.

CRM114 is licensed under the GPL; it is WITHOUT WARRANTY of ANY KIND, and it is BETA/FIELD TEST QUALITY. It's still experimental, be warned.

Use at your own risk, and send me bug reports! Or even better, send me improvements! If your code is substantial, I prefer to dual-license the code (i.e. we both get full rights to it, including the right to reuse and relicense under other licenses).

Not every user gets great results with the default classifier; that's why CRM114 has six different classifiers available. It's easy to switch classifiers to see what the tradeoffs are.

This version makes OSB classification, mailreaver, and reavercacheing the default configuration, along with an improved learning system (mailtrainer.crm) for faster convergence and higher accuracy.
There have been a few edge-condition bugs squashed, and the documentation has been updated as well.
This is the recommended version for all new and upgrade installs.

July 10th, 2006, 17:25 GMT
GPL (GNU General Public License) 
Crah the Merciless
CRM114 Discriminator
