I am currently finishing up a book, Data
Science for Business: Fundamental Principles of Data Mining and Data-analytic Thinking, with Foster Provost, to be published by O'Reilly.
It should be officially released by O'Reilly in July 2013.
In the meantime, it's already being used in draft form in courses worldwide.
(technically a C.V.)
I am on the editorial board of the Machine
I was a program chair of ICML-03
(The Twentieth International Conference on Machine Learning).
- I guest
edited a special issue of Machine
Learning journal on Data
Mining Lessons Learned, which appeared as volume 15, issues 1-2.
- I am currently guest editing (with Bart Baesens and David
Marten) a special issue of Machine Learning on Swarm
Intelligence for Knowledge Discovery in Data. The Call for
Papers is here.
CV, Publications, etc.
One of my interests is
the problem of spam detection and filtering, as well as general email
I studied spam and
spam filtering for a while, and then wrote a paper: "In
vivo" spam filtering: A
challenge problem for data mining.
The audience is data mining researchers, but I recommend the paper to
anyone interested in studying the problem. The real spam filtering problem
has a lot of interesting aspects that most people ignore.
I have reviewed for the
Conference on Email and Anti-Spam (CEAS-04,
was program chair of
the Second Conference on Email and Anti-Spam (CEAS-05).
on the Technical
Advisory Council of Proofpoint,
a spam filtering company.