skip to main content
10.1145/3152494.3152504acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
research-article

MineAr: using crowd knowledge for mining association rules in the health domain

Published: 11 January 2018 Publication History

Abstract

Crowdsourcing, where the power of the human thinking is harnessed to answer queries that are otherwise difficult for computers to answer, has been successfully used in many applications. A particularly interesting application of crowdsourcing is crowd mining, where given a dataset, patterns are learned by asking questions to the crowd. Crowd mining is extremely useful in situations where either the information is complex or it is not available in a systematic manner. In this paper, we target one such scenario, that of common health practices and cures. A web-based framework, called MineAr, is built to ask simple questions to the crowd. The questions ask whether a common product helps in a disease (such as ginger for cold). The crowd worker can choose an answer from different grades varying from "always" to "never", or can skip if she is not sure. Association rules are then mined from these answers using different aggregation techniques. Since not all the crowd workers can be relied upon, the system takes into account the confidence of the workers and, consequently, the rules are ordered according to importance. We also enhance the framework to enable prediction of answers of a new question for a crowd worker using her history. Finally, we construct a knowledge graph for searching and visualization.

References

[1]
R. Agrawal, T. Imielinski, and A. Swami. 1993. Mining Association Rules between Sets of Items in Large Databases. In SIGMOD. 207--216.
[2]
Almende B.V. 2010. vis.js. (2010). Vis.js is dual licensed under both Apache 2.0 and MIT.
[3]
Yael Amsterdamer, Yael Grossman, Tova Milo, and Pierre Senellart. 2013. Crowd-Miner: Mining Association Rules from the Crowd. PVLDB 6, 12 (2013), 1250--1253.
[4]
Rubi Boim, Ohad Greenshpan, Tova Milo, Slava Novgorodov, Neoklis Polyzotis, and Wang Chiew Tan. 2012. Asking the Right Questions in Crowd Data Sourcing. In ICDE. 1261--1264.
[5]
Sergey Brin, Rajeev Motwani, Jeffrey D. Ullman, and Shalom Tsur. 1997. Dynamic Itemset Counting and Implication Rules for Market Basket Data. In SIGMOD. 255--264.
[6]
Simon Funk. 2006. Netflix Update: Try This at Home. (Dec. 2006).
[7]
Dirk Hovy, Taylor Berg-Kirkpatrick, Ashish Vaswani, and Eduard Hovy. 2013. Learning Whom to Trust with MACE. In Proc. 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 1120--1130.
[8]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix Factorization Techniques for Recommender Systems. (2009).
[9]
Qi Li, Yaliang Li, Jing Gao, Bo Zhao, Wei Fan, and Jiawei Han. 2014. Resolving Conflicts in Heterogeneous Data by Truth Discovery and Source Reliability Estimation. In SIGMOD. 1187--1198.
[10]
Carlos Ordonez, Cesar Santana, and Levien de Braal. 2000. Discovering Interesting Association Rules in Medical Data. In Proc. ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery. 78--85.
[11]
Xiaoyuan Su and Taghi M. Khoshgoftaar. 2009. A Survey of Collaborative Filtering Techniques. Adv. in Artif. Intell. 2009, Article 4 (Jan. 2009), 1 pages.
[12]
Jacob Whitehill, Ting fan Wu, Jacob Bergsma, Javier R. Movellan, and Paul L. Ruvolo. 2009. Whose Vote Should Count More: Optimal Integration of Labels from Labelers of Unknown Expertise. In Advances in Neural Information Processing Systems 22. 2035--2043.
[13]
Adam Wright, Elizabeth S. Chen, and Francine L. Maloney. 2010. An automated technique for identifying associations between medications, laboratory results and problems. J. Biomedical Informatics 43, 6 (2010), 891--901.
[14]
Guo Xintong, Wang Hongzhi, Yangqiu Song, and Gao Hong. 2014. Review: Brief Survey of Crowdsourcing for Data Mining. Expert Syst. Appl. 41, 17 (Dec. 2014), 7987--7994.

Cited By

View all

Index Terms

  1. MineAr: using crowd knowledge for mining association rules in the health domain

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data
      January 2018
      379 pages
      ISBN:9781450363419
      DOI:10.1145/3152494
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 January 2018

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. crowd mining
      2. health mining

      Qualifiers

      • Research-article

      Conference

      CoDS-COMAD '18

      Acceptance Rates

      CODS-COMAD '18 Paper Acceptance Rate 50 of 150 submissions, 33%;
      Overall Acceptance Rate 197 of 680 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 05 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media