10 Challenging Problems in Data Mining
Research
In October 2005, we took an initiative to identify 10 challenging
problems in data mining research, by consulting some of the most
active researchers in data mining and machine learning for their
opinions on what are considered important and worthy topics for future
research in data mining. We hope their insights will inspire new
research efforts, and give young researchers (including PhD students)
a high-level guideline as to where the hot problems are located in
data mining.
The identification results were presented at the fifth IEEE International
Conference on Data Mining (ICDM '05).
- Presentation slides: in PDF.
- A companion article in PDF
(reprinted
from the following journal):
Qiang Yang and Xindong Wu (Contributors: Pedro
Domingos, Charles Elkan, Johannes Gehrke, Jiawei Han, David Heckerman,
Daniel Keim, Jiming Liu, David Madigan, Gregory Piatetsky-Shapiro,
Vijay V. Raghavan, Rajeev Rastogi, Salvatore J. Stolfo, Alexander
Tuzhilin, and Benjamin W. Wah), 10 Challenging Problems in Data Mining
Research, International Journal of Information Technology &
Decision Making, Vol. 5, No. 4, 2006, 597-604.
The 10 challenging problems are listed below (where the order of the
listing does not reflect their level of importance):
- Developing a Unifying Theory of Data Mining
- Scaling Up
for High Dimensional Data and High Speed Data Streams
- Mining
Sequence Data and Time Series Data
- Mining Complex Knowledge
from Complex Data
- Data Mining in a Network Setting
- Distributed Data Mining and Mining Multi-agent Data
- Data
Mining for Biological and Environmental Problems
- Data-Mining-Process Related Problems
- Security, Privacy and
Data Integrity
- Dealing with Non-static, Unbalanced and
Cost-sensitive Data
Qiang Yang and Xindong Wu
This page has been accessed times since November 29, 2006.
Last updated: January 18, 2007.