skip to main content
10.1145/3308557.3308683acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
short-paper

Programming-by-example for data transformation to improve machine learning performance

Published: 16 March 2019 Publication History

Abstract

In this study, we propose a programming-by-example (PBE)-based data transformation method for feature engineering in machine learning. Data transformation by PBE is not new. However, we utilized the one proposed herein to improve the performance of machine learning in synthesizing a transformation rule from examples. Herein, the system first generates candidate rules, and then chooses the rule that achieves the highest performance in a target machine learning task. We tested this system with the Titanic dataset, and the result shows that the proposed method can avoid worst-case performance compared to the original PBE method.

References

[1]
Sumit Gulwani, William R. Harris, and Rishabh Singh. 2012. Spreadsheet data manipulation using examples. Commun. ACM 55, 8 (2012).
[2]
Zhongjun Jin, Michael R. Anderson, Michael Cafarella, and Hosagrahar V. Jagadish. 2017. Foofah: A programming-by-example system for synthesizing data transformation programs. In ACM SIGMOD. 1607--1610.
[3]
James Max Kanter and Kalyan Veeramachaneni. 2015. Deep feature synthesis: Towards automating data science endeavors. In DSAA.
[4]
David Kofoed Wind. 2014. Concepts in predictive machine learning. Master's thesis. Technical Univ. of Denmark, Denmark.

Cited By

View all

Index Terms

  1. Programming-by-example for data transformation to improve machine learning performance

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    IUI '19 Companion: Companion Proceedings of the 24th International Conference on Intelligent User Interfaces
    March 2019
    173 pages
    ISBN:9781450366731
    DOI:10.1145/3308557
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 March 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data transformation
    2. feature generation
    3. program synthesis
    4. programming-by-example

    Qualifiers

    • Short-paper

    Funding Sources

    • JST CREST

    Conference

    IUI '19
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 746 of 2,811 submissions, 27%

    Upcoming Conference

    IUI '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 05 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media