skip to main content
research-article

Domain-Driven Data Mining: Challenges and Prospects

Published: 01 June 2010 Publication History

Abstract

Traditional data mining research mainly focus]es on developing, demonstrating, and pushing the use of specific algorithms and models. The process of data mining stops at pattern identification. Consequently, a widely seen fact is that 1) many algorithms have been designed of which very few are repeatable and executable in the real world, 2) often many patterns are mined but a major proportion of them are either commonsense or of no particular interest to business, and 3) end users generally cannot easily understand and take them over for business use. In summary, we see that the findings are not actionable, and lack soft power in solving real-world complex problems. Thorough efforts are essential for promoting the actionability of knowledge discovery in real-world smart decision making. To this end, domain-driven data mining (D^3M) has been proposed to tackle the above issues, and promote the paradigm shift from “data-centered knowledge discovery” to “domain-driven, actionable knowledge delivery.” In D^3M, ubiquitous intelligence is incorporated into the mining process and models, and a corresponding problem-solving system is formed as the space for knowledge discovery and delivery. Based on our related work, this paper presents an overview of driving forces, theoretical frameworks, architectures, techniques, case studies, and open issues of D^3M. We understand D^3M discloses many critical issues with no thorough and mature solutions available for now, which indicates the challenges and prospects for this new topic.

Cited By

View all

Index Terms

  1. Domain-Driven Data Mining: Challenges and Prospects

    Recommendations

    Reviews

    Russel Pears

    Cao highlights the need to incorporate domain information into the data mining process. The paper raises a number of important issues: improving the actionability of knowledge generated by data mining; improving the interpretability of knowledge discovered by data mining; and the conversion of models to a form that can be easily executed in production systems. Although these issues have surfaced from time to time in the data mining community, they have not been given the attention they deserve. The paper presents domain-driven data mining, a new methodology that uses explicit knowledge from domain experts at an early stage of the data mining process, rather than involving them in a post-mining phase. It discusses how technical and business interest measures can be combined in order to produce knowledge that has both statistical rigor and business value. The paper acknowledges that there is still much work to be done-not merely in producing better data mining algorithms, but also in building the necessary infrastructure to support mining, providing support for different stakeholders, and, the most difficult issue of all, measuring the actionability of knowledge. This paper is most relevant to researchers who are attempting to devise interest measures to validate the patterns discovered by data mining, as well as to industry practitioners who are engaged in the business intelligence arena. Online Computing Reviews Service

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Information & Contributors

    Information

    Published In

    cover image IEEE Transactions on Knowledge and Data Engineering
    IEEE Transactions on Knowledge and Data Engineering  Volume 22, Issue 6
    June 2010
    158 pages

    Publisher

    IEEE Educational Activities Department

    United States

    Publication History

    Published: 01 June 2010

    Author Tags

    1. Data mining
    2. actionable knowledge discovery and delivery.
    3. domain-driven data mining (D^3M)

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 04 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media