I have recently finished a book, Data
Science for Business: What You Need to Know About Data Mining and Data-analytic Thinking, with Foster Provost, published by O'Reilly.
(technically a C.V.)
I am on the editorial board of the Machine
I was a program chair of ICML-03
(The Twentieth International Conference on Machine Learning).
- I guest
edited a special issue of Machine
Learning journal on Data
Mining Lessons Learned, which appeared as volume 15, issues 1-2.
- I gues edited (with Bart Baesens and David
Marten) a special issue of Machine Learning on Swarm
Intelligence for Knowledge Discovery in Data.
CV, Publications, etc.
One of my interests is
the problem of spam detection and filtering, as well as general email
I studied spam and
spam filtering for a while, and then wrote a paper: "In
vivo" spam filtering: A
challenge problem for data mining.
The audience is data mining researchers, but I recommend the paper to
anyone interested in studying the problem. The real spam filtering problem
has a lot of interesting aspects that most people ignore.
I have reviewed for the
Conference on Email and Anti-Spam (CEAS-04,
was program chair of
the Second Conference on Email and Anti-Spam (CEAS-05).
on the Technical
Advisory Council of Proofpoint,
a spam filtering company.