research-article

Public Access

Accelerating Innovation Through Analogy Mining

Authors:

Tom Hope,

Joel Chan,

Aniket Kittur,

Dafna ShahafAuthors Info & Claims

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 235 - 243

https://doi.org/10.1145/3097983.3098038

Published: 04 August 2017 Publication History

PDF eReader

Abstract

The availability of large idea repositories (e.g., the U.S. patent database) could significantly accelerate innovation and discovery by providing people with inspiration from solutions to analogous problems. However, finding useful analogies in these large, messy, real-world repositories remains a persistent challenge for either human or automated methods. Previous approaches include costly hand-created databases that have high relational structure (e.g., predicate calculus representations) but are very sparse. Simpler machine-learning/information-retrieval similarity metrics can scale to large, natural-language datasets, but struggle to account for structural similarity, which is central to analogy. In this paper we explore the viability and value of learning simpler structural representations, specifically, "problem schemas", which specify the purpose of a product and the mechanisms by which it achieves that purpose. Our approach combines crowdsourcing and recurrent neural networks to extract purpose and mechanism vector representations from product descriptions. We demonstrate that these learned vectors allow us to find analogies with higher precision and recall than traditional information-retrieval methods. In an ideation experiment, analogies retrieved by our models significantly increased people's likelihood of generating creative ideas compared to analogies retrieved by traditional methods. Our results suggest a promising approach to enabling computational analogy at scale is to learn and leverage weaker structural representations.

Supplementary Material

MP4 File (hope_accelerating_innovation.mp4)

Download
403.26 MB

References

[1]

Sanjeev Arora, Yuanzhi Li, Yingyu Liang, Tengyu Ma, and Andrej Risteski 2016. Linear algebraic structure of word senses, with applications to polysemy. arXiv preprint arXiv:1601.03764 (2016).

Google Scholar

[2]

Sanjeev Arora, Yingyu Liang, and Tengyu Ma 2016natexlabb. A simple but tough-to-beat baseline for sentence embeddings. (2016).

Google Scholar

[3]

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).

Google Scholar

[4]

David M. Blei, Andrew Y. Ng, Michael I. Jordan, and John Lafferty 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research (2003), 993--1022. r, and Robert E Kraut 2014. Searching for analogical ideas with crowds. In Proceedings of the 32nd annual ACM conference on Human factors in computing systems. ACM, 1225--1234.

Google Scholar

[5]

Lixiu Yu, Aniket Kittur, and Robert E Kraut 2016. Encouraging "Outside-the-box" Thinking in Crowd Innovation Through Identifying Domains of Expertise. In Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing. ACM, 1214--1222.

Digital Library

Google Scholar

[6]

L Yu, B Kraut, and A Kittur 2014. Distributed analogical idea generation: innovating with crowds CHI'14.

Google Scholar

[7]

Lixiu Yu, Robert E Kraut, and Aniket Kittur 2016. Distributed Analogical Idea Generation with Multiple Constraints Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing ACM, 1236--1245.

Google Scholar

Cited By

View all

Srinivasan AChan J(2024)Improving Selection of Analogical Inspirations through Chunking and RecombinationProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656207(374-397)Online publication date: 23-Jun-2024
https://dl.acm.org/doi/10.1145/3635636.3656207
Xu XPalani SMunkhbat ALee TDow S(2024)Idea-Centric Search: Four Patterns of Information Seeking During Creative IdeationProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656193(280-291)Online publication date: 23-Jun-2024
https://dl.acm.org/doi/10.1145/3635636.3656193
Kang HLin DMartelaro NKittur AChen YHong M(2024)BioSpark: An End-to-End Generative System for Biological-Analogical Inspirations and IdeationExtended Abstracts of the CHI Conference on Human Factors in Computing Systems10.1145/3613905.3651035(1-13)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613905.3651035
Show More Cited By

Index Terms

Accelerating Innovation Through Analogy Mining
1. Computing methodologies
  1. Artificial intelligence
  2. Machine learning
    1. Machine learning approaches
      1. Learning latent representations

Recommendations

Analogy Mining for Specific Design Needs
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Finding analogical inspirations in distant domains is a powerful way of solving problems. However, as the number of inspirations that could be matched and the dimensions on which that matching could occur grow, it becomes challenging for designers to ...
CAM: A Large Language Model-based Creative Analogy Mining Framework
WWW '23: Proceedings of the ACM Web Conference 2023

Analogies inspire creative solutions to problems, and facilitate the creative expression of ideas and the explanation of complex concepts. They have widespread applications in scientific innovation, creative writing, and education. The ability to ...
Accelerating innovation through analogy mining
IJCAI'18: Proceedings of the 27th International Joint Conference on Artificial Intelligence

The availability of large idea repositories (e.g., patents) could significantly accelerate innovation and discovery by providing people inspiration from solutions to analogous problems. However, finding useful analogies in these large, messy, real-world ...

Comments

Information & Contributors

Information

Published In

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 2017

2240 pages

ISBN:9781450348874

DOI:10.1145/3097983

General Chairs:
Stan Matwin
Dalhousie University
,
Shipeng Yu
LinkedIn
,
Faisal Farooq
IBM

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 August 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Badges

Best Paper

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

KDD '17

Sponsor:

KDD '17: The 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 13 - 17, 2017

NS, Halifax, Canada

Acceptance Rates

KDD '17 Paper Acceptance Rate 64 of 748 submissions, 9%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

39
Total Citations
View Citations
6,288
Total Downloads

Downloads (Last 12 months)304
Downloads (Last 6 weeks)50

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Srinivasan AChan J(2024)Improving Selection of Analogical Inspirations through Chunking and RecombinationProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656207(374-397)Online publication date: 23-Jun-2024
https://dl.acm.org/doi/10.1145/3635636.3656207
Xu XPalani SMunkhbat ALee TDow S(2024)Idea-Centric Search: Four Patterns of Information Seeking During Creative IdeationProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656193(280-291)Online publication date: 23-Jun-2024
https://dl.acm.org/doi/10.1145/3635636.3656193
Kang HLin DMartelaro NKittur AChen YHong M(2024)BioSpark: An End-to-End Generative System for Biological-Analogical Inspirations and IdeationExtended Abstracts of the CHI Conference on Human Factors in Computing Systems10.1145/3613905.3651035(1-13)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613905.3651035
Lee YKang HLatzke MKim JBragg JChang JSiangliulue P(2024)PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected PapersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642196(1-19)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642196
Chib SKumar BAsha VSingh NLourens MBanerjee D(2024)Deep Learning Algorithms for Business Management: Ethical Considerations2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE)10.1109/IC3SE62002.2024.10593494(1490-1495)Online publication date: 9-May-2024
https://doi.org/10.1109/IC3SE62002.2024.10593494
Anwar ZAfzal H(2024)Mining crowd sourcing repositories for open innovation in software engineeringAutomated Software Engineering10.1007/s10515-023-00410-z31:1Online publication date: 8-Jan-2024
https://doi.org/10.1007/s10515-023-00410-z
Huang ZMacNeil S(2023)DesignNet: a knowledge graph representation of the conceptual design spaceProceedings of the 15th Conference on Creativity and Cognition10.1145/3591196.3596614(375-377)Online publication date: 19-Jun-2023
https://dl.acm.org/doi/10.1145/3591196.3596614
Ding ZSrinivasan AMacneil SChan J(2023)Fluid Transformers and Creative Analogies: Exploring Large Language Models’ Capacity for Augmenting Cross-Domain Analogical CreativityProceedings of the 15th Conference on Creativity and Cognition10.1145/3591196.3593516(489-505)Online publication date: 19-Jun-2023
https://dl.acm.org/doi/10.1145/3591196.3593516
MacNeil SHuang ZChen KDing ZYu ANakai KDow S(2023)Freeform Templates: Combining Freeform Curation with Structured TemplatesProceedings of the 15th Conference on Creativity and Cognition10.1145/3591196.3593337(478-488)Online publication date: 19-Jun-2023
https://dl.acm.org/doi/10.1145/3591196.3593337
Hope TDowney DWeld DEtzioni OHorvitz E(2023)A Computational Inflection for Scientific DiscoveryCommunications of the ACM10.1145/357689666:8(62-73)Online publication date: 25-Jul-2023
https://dl.acm.org/doi/10.1145/3576896
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Analogy Mining for Specific Design Needs

CAM: A Large Language Model-based Creative Analogy Mining Framework

Accelerating innovation through analogy mining