skip to main content
10.1145/2390848.2390852acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Building a personalized audio equalizer interface with transfer learning and active learning

Published: 02 November 2012 Publication History

Abstract

Potential users of audio production software, such as audio equalizers, may be discouraged by the complexity of the interface and a lack of clear affordances in typical interfaces. In this work, we create a personalized on-screen slider that lets the user manipulate the audio with an equalizer in terms of a descriptive term (e.g. "warm"). The system learns mappings by presenting a sequence of sounds to the user and correlating the gain in each frequency band with the user's preference rating. This method is extended and improved on by incorporating knowledge from a database of prior concepts taught to the system by prior users. This is done with a combination of active learning and simple transfer learning. Results on a study of 35 participants show personalized audio manipulation tool can be built with 10 times fewer interactions than is possible with the baseline approach.

References

[1]
Burton, J., "Ear Machine iQ: Intelligent Equaliser Plug-in for MacOS & Windows," Sound on Sound Magazine, 2011, Sound on Sound Ltd.: United Kingdom.
[2]
Sabin, A., Rafii, Z. and B. Pardo, "Weighting function-based rapid mapping of descriptors to audio processing parameters," Journal of the Audio Engineering Society, 2011, vol. 59(6): p. 419--430.
[3]
Ear Machine's iQ equalization plugin. Available from: http://www.ear-machine.com/iQ.html.
[4]
Neuman, A. C., et al., "An evaluation of three adaptive hearing aid selection strategies," Journal of the Acoustical Society of America, 1987. 82(6): p. 1967--1976.
[5]
Durant, E. A., et al., "Efficient Perceptual Tuning of Hearing Aids with Genetic Algorithms," IEEE Transactions on Speech and Audio Processing, 2004. 12(2): p. 144--155.
[6]
Wakefield, G. H., et al., "Genetic Algorithms for Adaptive Psychophysical Procedures: Recipient-Directed Design of Speech-Processor MAPs," Ear Hear, 2005. 26 (4 Suppl.): p. 57S--72S
[7]
Kuk, F. K. and N. M. Pape, "The reliability of a modified simplex procedure in hearing aid frequency-response selection," Journal of Speech and Hearing Research,1992. 35(2): p. 418--429.
[8]
Mecklenburg, S. and J. Loviscach, "subjEQt: Controlling an equalizer through subjective terms," Extended Abstracts of the Proc. of ACM CHI 2006. NY: ACM Press.: Montreal, Canada. p. 1109--1114.
[9]
Reed, D., "Capturing perceptual expertise: a sound equalization expert system," Knowledge-Based Systems, 2001. 14: p. 111--118.
[10]
Morris, D. and J. Secretan, "Computational creativity support: Using algorithms and machine learning to help people be more creative," Ext. Abs. of the Proc. of ACM CHI 2009, Boston, USA. p. 4733--4736.
[11]
Simon, I., Morris, D., and Basu, S., "MySong: Automatic accompaniment generation for vocal melodies," Proceedings of ACM CHI 2008, Florence, Italy. p. 725--734.
[12]
Fogarty, K., Tan, D., Kapoor, A., and Winder, S., "CueFlik: Interactive Concept Learning in Image Search," Proc. of ACM CHI 2008, Florence, Italy. p. 29--38.
[13]
Fiebrink, R., D. Morris, and M. R. Morris. "Dynamic mapping of physical controls for tabletop groupware," Proceedings of ACM CHI 2009. Boston, MA.
[14]
Fiebrink, R., D. Trueman, and P. R. Cook, "A meta-instrument for interactive, on-the-fly machine learning," Proceedings of New Interfaces for Musical Expression (NIME) 2009, Pittsburgh, PA.
[15]
Cohn, D., Atlas, L., and Ladner, R., "Improving Generalization with Active Learning," Machine Learning, 1994 15: p. 201--221.
[16]
Settles, B., Active Learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool, 2011. ISBN: 9781608457250.
[17]
Bloodgood, M. a. C.-B., C. Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation. in Proc. of the Association for Computational Linguistics (ACL 2010). Uppsala, Sweden.
[18]
Pan, S. a. Y., Q., "A Survey on Transfer Learning," IEEE Transactions on Knowledge and Data Engineering, 2010. 22(10): p. 1345--1359.
[19]
Ando, R. and T. Zhang, "A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data," Journal of Machine Learning Research, 2005. 6(Nov): p. 1817--1853.
[20]
Gajos, K. a. W., D. S., SUPPLE: Automatically Generating User Interfaces., Proc. of Intelligent User Interfaces (IUI 2004), 2004, Funchal, Portugal. p. 93--100.
[21]
Lin, J. and J. A. Landay, Damask, "A Tool for Early-stage Design and Prototyping of Multi-device User Interfaces," Proc. of the 8th International Workshop on Visual Computing, 2002. p. 573--580.
[22]
Nichols, J., Myers, B. A., Higgins, M., Hughes, J., Harris, T. K., Rosenfeld, R., and M. Pignol, "Generating remote control interfaces for complex appliances," UIST '02 - Proceedings of the 15th annual ACM Symposium on User Interface Software and Technology, 2002, Paris, France. p. 161--170.
[23]
Pardo, B., Little, D. and D. Gergle, "Towards Speeding Audio EQ Interface Building with Transfer Learning, "Proceedings of New Interfaces for Musical Expression (NIME) 2012, Ann Arbor, MI, USA, May 21-23, 2012
[24]
Su, X. and T. M. Khoshgoftaar, "A Survey of Collaborative Filtering Techniques," Advances in Artificial Intelligence, 2009. Volume 2009 (2009).

Cited By

View all

Index Terms

  1. Building a personalized audio equalizer interface with transfer learning and active learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MIRUM '12: Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies
      November 2012
      82 pages
      ISBN:9781450315913
      DOI:10.1145/2390848
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 02 November 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. active learning
      2. audio
      3. equalizer
      4. interface
      5. music
      6. transfer learning

      Qualifiers

      • Research-article

      Conference

      MM '12
      Sponsor:
      MM '12: ACM Multimedia Conference
      November 2, 2012
      Nara, Japan

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)28
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 28 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media