crm114 - classification made easy
DESCRIPTION
Presented to Bmore on Rails December 14, 2010. bmoreonrails.org hatesitlovesit.com jonathanjulian.comTRANSCRIPT
CRM114Classification made easy
Tuesday, December 14, 2010
“CRM114 is a system to examine incoming e-mail, system log streams, data files or other data streams, and to sort, filter, or alter the
incoming files or data streams according to the user's wildest desires.”
Tuesday, December 14, 2010
“Ruby interface to the CRM114 Controllable Regex Mutilator, an advanced and fast text classifier that
uses sparse binary polynomial matching with a Bayesian Chain Rule evaluator and a hidden
Markov model to categorize data with up to a 99.87% accuracy”
Tuesday, December 14, 2010
Classification?
Tuesday, December 14, 2010
Bayesian Filtering
Tuesday, December 14, 2010
Paul GrahamA Plan for Spam
Tuesday, December 14, 2010
•every email client
•replyz.com
•hatesitlovesit
Tuesday, December 14, 2010
Tuesday, December 14, 2010
apt-get install crm114
Tuesday, December 14, 2010
gem install crm114
Tuesday, December 14, 2010
crm = Classifier::CRM114.new([:spam,:ham])
crm.train!(:spam “Buy a Watch as a gift at a discount!”)crm.train!(:ham, "I'll see you all tomorrow morning.")
crm.classify(“It seems I have to be at the New York office tomorrow.”)
Tuesday, December 14, 2010
-rwxr-xr-x 1 jjulian staff 588696 Jul 8 17:41 /usr/local/bin/crm
spam.cssham.css
Tuesday, December 14, 2010
live coding demo?
Tuesday, December 14, 2010
hatesitlovesit.com
Tuesday, December 14, 2010
Jonathan Julian@jonathanjulian
jonathanjulian.com
Tuesday, December 14, 2010
Resources• http://crm114.sourceforge.net/
• https://github.com/bendiken/crm114
• http://www.elegantchaos.com/node/132
• http://www.paulgraham.com/spam.html
• http://replyz.com
• http://hatesitlovesit.com
• http://www.youtube.com/watch?v=dm7yAWpX1Mc
Tuesday, December 14, 2010