skip to main content
10.1145/3132218.3132222acmotherconferencesArticle/Chapter ViewAbstractPublication PagessemanticsConference Proceedingsconference-collections
research-article

DemandMiner: Extraction and Visualization of Product Demands from Open Web

Published: 11 September 2017 Publication History

Abstract

In this paper, we present the DemandMiner, an online tool that leverages Natural Language Processing and Machine Learning techniques to extract demand related to different industries from large volume of News articles. We also propose techniques to enrich the information components extracted from a document by associating them with their sense oriented entities. The enriched components are then analyzed for generating informed insights to generate contextual reports that can convey greater sense to the decision-makers, analysts and knowledge workers than simple display of events. An interactive visualization system is also provided for searching and querying the underlying collection of data and the derived insights.

References

[1]
Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary Ives. 2007. Dbpedia: A nucleus for a web of open data. The semantic web (2007), 722--735.
[2]
Shenghua Bao, Rui Li, Yong Yu, and Yunbo Cao. 2008. Competitor mining with the web. IEEE Transactions on Knowledge and Data Engineering 20, 10 (2008), 1297--1310.
[3]
Abraham Bernstein, Scott Clearwater, and Foster Provost. 2003. The relational vector-space model and industry classification. In Proceedings of the IJCAI-2003 workshop on learning statistical models from relational data, Vol. 266. Citeseer.
[4]
Roger B Bradford. 2006. Relationship discovery in large text collections using latent semantic indexing. In Proceedings of the Fourth Workshop on Link Analysis, Counterterrorism, and Security. Citeseer.
[5]
Indra Budi and Stephane Bressan. 2007. Application of association rules mining to named entity recognition and co-reference resolution for the Indonesian language. International Journal of Business Intelligence and Data Mining 2, 4 (2007), 426--446.
[6]
Deepayan Chakrabarti and Christos Faloutsos. 2006. Graph mining: Laws, generators, and algorithms. ACM computing surveys (CSUR) 38, 1 (2006), 2.
[7]
Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2, 3 (2011), 27.
[8]
Hsinchun Chen, Roger HL Chiang, and Veda C Storey. 2012. Business intelligence and analytics: From big data to big impact. MIS quarterly 36, 4 (2012), 1165--1188.
[9]
Hsinchun Chen and David Zimbra. 2010. AI and opinion mining. IEEE Intelligent Systems 25, 3 (2010), 74--80.
[10]
Laura Chiticariu, Yunyao Li, and Frederick R Reiss. 2013. Rule-based information extraction is dead! long live rule-based information extraction systems!. In EMNLP. 827--832.
[11]
Jennifer Coates. 1987. Epistemic modality and spoken discourse. Transactions of the Philological society 85, 1 (1987), 110--131.
[12]
Marie-Catherine De Marneffe, Bill MacCartney, Christopher D Manning, et al. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of LREC, Vol. 6. Genoa, 449--454.
[13]
Norberto Fernández, Jesus A Fisteus, Luis Sánchez, and Eduardo Martin. 2010. Webtlab: A cooccurencebased approach to kbp 2010 entity-linking task. In Proc. TAC 2010 Workshop.
[14]
Zhou GuoDong and Su Jian. 2004. A high-performance coreference resolution system using a constraint-based multi-agent strategy. In Proceedings of the 20th international conference on Computational Linguistics. Association for Computational Linguistics, 522.
[15]
Lun-Wei Ku, Hsiu-Wei Ho, and Hsin-Hsi Chen. 2006. Novel relationship discovery using opinions mined from the web. In PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, Vol. 21. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, 1357.
[16]
Lun-Wei Ku, Hsiu-Wei Ho, and Hsin-Hsi Chen. 2009. Opinion mining and relationship discovery using CopeOpi opinion analysis system. Journal of the American Society for Information Science and Technology 60, 7 (2009), 1486--1503.
[17]
Alexandros Labrinidis and Hosagrahar V Jagadish. 2012. Challenges and opportunities with big data. Proceedings of the VLDB Endowment 5, 12 (2012), 2032--2033.
[18]
RY Lau and Wenping Zhang. Semi-supervised statistical inference for business entities extraction and business relations discovery. Balog et al.{3} (????), 41--46.
[19]
Raymond YK Lau, Dawei Song, Yuefeng Li, Terence CH Cheung, and Jin-Xing Hao. 2009. Toward a fuzzy domain ontology extraction method for adaptive e-learning. IEEE transactions on knowledge and data engineering 21, 6 (2009), 800--813.
[20]
Rui Li, Shenghua Bao, Jin Wang, Yong Yu, and Yunbo Cao. 2006. Cominer: An effective algorithm for mining competitors from the web. In Data Mining, 2006. ICDM'06. Sixth International Conference on. IEEE, 948--952.
[21]
Zhongming Ma, Gautam Pant, and Olivia RL Sheng. 2009. A network-based approach to mining competitor relationships from online news. ICIS 2009 Proceedings (2009), 59.
[22]
Zhongming Ma, Olivia RL Sheng, and Gautam Pant. 2009. Discovering company revenue relations from news: A network approach. Decision Support Systems 47, 4 (2009), 408--414.
[23]
James Manyika, Michael Chui, Brad Brown, Jacques Bughin, Richard Dobbs, Charles Roxburgh, and Angela H Byers. 2011. Big data: The next frontier for innovation, competition, and productivity. (2011).
[24]
Vincent Ng. 2007. Semantic class induction and coreference resolution. In AcL. 536--543.
[25]
Eric WT Ngai, Li Xiu, and Dorothy CK Chau. 2009. Application of data mining techniques in customer relationship management: A literature review and classification. Expert systems with applications 36, 2 (2009), 2592--2602.
[26]
Bo Pang, Lillian Lee, et al. 2008. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval 2, 1--2 (2008), 1--135.
[27]
Gautam Pant and Olivia RL Sheng. 2009. Avoiding the blind spots: Competitor identification using web text and linkage structure. ICIS 2009 Proceedings (2009), 57.
[28]
Jon Patrick, Casey Whitelaw, and Robert Munro. 2002. Slinerc: The sydney language-independent named entity recogniser and classifier. In proceedings of the 6th conference on Natural language learning-Volume 20. Association for Computational Linguistics, 1--4.
[29]
Horacio Saggion, Adam Funk, Diana Maynard, and Kalina Bontcheva. 2007. Ontology-based information extraction for business intelligence. The Semantic Web (2007), 843--856.
  1. DemandMiner: Extraction and Visualization of Product Demands from Open Web

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    Semantics2017: Proceedings of the 13th International Conference on Semantic Systems
    September 2017
    202 pages
    ISBN:9781450352963
    DOI:10.1145/3132218
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • St. Pölten University: St. Pölten University of Applied Sciences, Austria
    • Wolters Kluwer: Wolters Kluwer, Germany
    • Vrije Universeit Amsterdam: Vrije Universeit Amsterdam
    • Semantic Web Company: Semantic Web Company
    • Uinv. Leipzig: Universität Leipzig

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 September 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Business Entity Extraction
    2. Demand Extraction
    3. Supervised Classification
    4. Text Analytics

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    Semantics2017

    Acceptance Rates

    Overall Acceptance Rate 40 of 182 submissions, 22%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 46
      Total Downloads
    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 06 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media