![]() Assistant Professor | Address:
Department of Computer Science University of California - Davis Kemper Hall 1 Shields Avenue, CA 95616 | Contact Details: Telephone: (530) 752-5764 Fax: (530) 752-4767 Email: davidson replace-with-at cs dot ucdavis dot edu Office: 3025 Kemper Hall |
Hello, I'm an assistant Professor in the computer science department working on mainly data mining algorithm developent. Thanks to the NSF and Google for supporting my research projects and my wife for these two projects. In my spare time I enjoy history, am an avid movie buff and playing golf.
We are running a workshop at KDD this year on Data Mining with Constraints: Workshop Webpage and CFP
New Journal of DMKD Paper on Constraints and Agglomerative clustering: PDF
The Book has been sent to the publishers, check it out! Click here for book details, Buy from Amazon
I work in the fields of data
mining & machine learning and enjoy both formulating problems and applying
rigorous mathematical techniques to address them. I particularly like applying
these techniques to problems of social importance (typically scientific data), my current focus is mining
the results of pandemic micro-simulation data for pandemic preparation (with
Virginia Techs VBI). Ive divided my
papers by problem area below. I have made use of techniques from information
theory, probability, statistics and worst-case complexity analysis and am
interested in investigating the application of new techniques to DM and AI
problems.
I have several research assistantship slots available for students
who are motivated to do research in data mining. My research is a mix
of rigorous algorithm design, implementation and application to novel
areas of social importance. Please read some of my work and contact me
if your genuinely interested. Do NOT send form letters.
I bought the constrained-clustering.org
domain name and its being set up as a central repository for things related to clustering with constraints.
So far it only contains my work but it will be expanded. Feel free to email me
for suggestions.
If you would like to read about
me and my personal interests then click here.
Best research paper award at SIAM Data Mining Conference 2005
Best application paper award IEEE Data Mining Conference 2006.
I am most thankful for the NSF for supporting my work via a NSF CAREER Award 2007-2011.
I like working with undergraduates and graduate students, so email me if your
interested in my research areas.
Ashwin Satyanarayana (SUNY) Graduated
Ke Yin. (SUNY) Graduated
Constrained Clustering: Advances in
Algorithms, Applications and Theory, In Preparation Due out 2008
co-edited with Sugato Basu and Kiri Wagstaff. CRC Press.
Click here for book details,
Buy from Amazon
Knowledge Discovery and Data Mining: Challenges and Realities of Mining Real
World Data with co-edited with Hill Zhu, Ideal Press 2007.
Click here for ToC
IEEE ICDM 2005 Clustering with Constraints Theory and
Practice (with S.Basu SRI) ACM KDD 2006
Clustering with Constraints Theory and Practice (with S.Basu SRI) PDF of
Slides
PC Co-Chair Workshop on Data Mining with Constraints - co-located with ACM KDD Conference, 2008.
Workshop Webpage
Area Chair: ECML/PKDD 2008 Co-Workshop Chair, 9th Area Chair: ECML/PKDD 2008 Tutorial Chair, 8th Publicity Chair, 6th Co-Guest Editor, Special Issue on Mining Low Quality Data,
KAIS: Click
here for CFP PC Member: ACM SIGKDD
Conference (KDD07, 08),
ACM CIKM Conference, 2006,2007 I also
review for Pattern Recognition Letters, IEEE KDE, KAIS, IEEE PAMI and JMLR
amongst others.
Pacific
AAAI Conference (AAAI07,06) IEEE DM Conference, (ICDM07)
Many papers have a journal or technical report analogs
which contain proofs and derivations that could not fit into the conference
version as well as additional results. Please email me for pointers to these
extended versions. Constrained
Clustering (In collaboration with S.
Basu (SRI), S.S. Ravi (SUNY) and K. Wagstaff (NASA-JPL)) Davidson I. Ester M. and Ravi, S.S., Efficient Incremental Clustering with
Constraints, 13th ACM Knowledge Discovery and Data Mining Conference
2007
PDF
Ge, R., Ester, M., Jin W., and Davidson I., Constraint Driven Clustering,
13th ACM Knowledge Discovery and Data Mining Conference 2007
Davidson I., Ravi, S.S., Intractability and Clustering with Constraints, To Appear in the
Proceeding of ICML 2007
PDF
Davidson I., Basu S.S., Clustering with Constraints Bibliography, To Appear,
PDF
Davidson I., Wagstaff, K., Basu, Sugato., Measuring
Constraint-Set Utility for Partitional Clustering Algorithms , To Appear in the
Proceeding of ECML/PKDD 2006 (acceptance rate 9%) PDF,
Learning
With Biased Training Sets (In collaboration with Wei Fan (IBM Watson)) Wei Fan and Ian Davidson
"On Sample Selection Bias and Its Efficient Correction via Model Averaging
and Unlabeled Examples", SIAM Data Mining Conference 2007 (acceptance rate
9%). Kun Zhang, Wei Fan,
Xiaojing Yuan, Ian Davidson, and Xiangshang Li, "Forecasting Skewed Biased
Stochastic Ozone Days", IEEE ICDM 2006 (acceptance rate 9%). , Winner Best Paper Award, PDF,
Fan W.,
Davidson I., Zadrozny B. and Yu P., An Improved Categorization of Classifier's
Sensitivity on Sample Selection Bias, 5th
IEEE International Conference on Data Mining, ICDM 2005. (acceptance rate
18%) PDF Applications Davidson
I., Paul, Goutam, Locating Secret Messages in Images, Research Track: 10th ACM KDD Conference 2004: Research Track,
Seattle (acceptance rate 29%) PDF Yin, K.
and Berg G., Davidson Davidson, Foundations
of Mining and Learning Using Information Theory, Probability and Statistics Davidson I., Ensemble Approaches for Stable Learners with
Convergence Bounds, 19th AAAI Conference
2004, San Jose (acceptance rate 26%) PDF. Davidson, Davidson, Yin, K., Davidson I., Bayesian Model Averaging Across
Model Spaces via Compact Encoding, The 8th International Symposium on A.I. and
Math pdf
Davidson, I., "Minimum Message Length Clustering
Using Gibbs Sampling", 16th International Conference on Uncertainty in
A.I., 2000 PDF
Davidson
I., Fan, Wei., When Efficient Model Averaging Out-Performs Boosting and Bagging
, To Appear in the Proceeding of ECML/PKDD 2006 (acceptance rate 18%) PDF,
Fan, W. and Davidson I.,
ReverseTesting: An Efficient Framework to Select Amongst Classifiers under Sample
Selection Bias, To Appear 12th
ACM KDD Conference, Philadelphia, 2006. (acceptance rate 11%) PDF
Davidson I. et al, A
General Approach to Incorporate Data Quality Matrices into Data Mining
Algorithms, 10th ACM KDD Conference 2004:
Industrial Track, 2004. (Email me for Journal/TR version). Seattle. PDF
I mainly teach in the sub-fields
of data mining and machine learning which are a very active area of research
used extensively in the high tech industry though often under different titles.
Data mining, speech recognition, natural language processing, machine learning
and expert systems are all sub-fields of A.I. In addition applications of A.I.
to specific domains are also popular such as bio-informatics, data mining and
machine vision. Spring 2005 CSI 445/661 Data Mining and CSI 535
Introduction to Artificial Intelligence Fall 2005 CSI 635 A.I. II Machine Learning Spring 2006 CSI 431/531 Data Mining and CSI 535
Introduction to Artificial Intelligence For courses at U.C. Davis please see myucdavis course web pages. This
page is always under construction davidson at cs dot ucdavis dot edu