skip to main content
10.1145/1839294.1839365acmotherconferencesArticle/Chapter ViewAbstractPublication PagespetraConference Proceedingsconference-collections
research-article

Benchmarking dynamic time warping for music retrieval

Published: 23 June 2010 Publication History

Abstract

We study the performance of three dynamic programming methods on music retrieval. The methods are designed for time series matching but can be directly applied to retrieval of music. Dynamic Time Warping (DTW) identifies an optimal alignment between two time series, and computes the matching cost corresponding to that alignment. Significant speed-ups can be achieved by constrained Dynamic Time Warping (cDTW), which narrows down the set of positions in one time series that can be matched with specific positions in the other time series. Both methods are designed for full sequence matching but can also be applied for subsequence matching, by using a sliding window over each database sequence to compute a matching score for each database subsequence. In addition, SPRING is a dynamic programming approach designed for subsequence matching, where the query is matched with a database subsequence without requiring the match length to be equal to the query length. SPRING has a lower computational cost than DTW and cDTW. Our database consists of a set of MIDI files taken from the web. Each MIDI file has been converted to a 2-dimensional time series, taking into account both note pitches and durations. We have used synthetic queries of fixed size and different noise levels. Surprisingly, when looking for the top-K best matches, all three approaches show similar behavior in terms of retrieval accuracy for small values of K. This suggests that for the specific application area, a computationally cheaper method, such as SPRING, is sufficient to retrieve the best top-K matches.

References

[1]
T. Argyros and C. Ermopoulos. Efficient subsequence matching in time series databases under time and amplitude transformations. In International Conference on Data Mining, pages 481--484, 2003.
[2]
V. Athitsos, P. Papapetrou, M. Potamias, G. Kollios, and D. Gunopulos. Approximate embedding-based subsequence matching of time series. In ACM SIGMOD, 2008.
[3]
D. Bainbridge. MELDEX: A Web-based Melodic Locator Service. Computing in Musicology, 11:223--229, 1998.
[4]
R. Bellman. The theory of dynamic programming. Bull. Amer. Math. Soc., 60(6):503--515, 1954.
[5]
L. Bergroth, H. Hakonen, and T. Raita. A survey of longest common subsequence algorithms. In SPIRE'00: Proceedings of the Seventh International Symposium on String Processing Information Retrieval (SPIRE'00), pages 39--48.
[6]
B. Bollobás, G. Das, D. Gunopulos, and H. Mannila. Time-series similarity problems and well-separated geometric sets. In Proceedings of the thirteenth annual symposium on Computational geometry, pages 454--456. ACM New York, NY, USA, 1997.
[7]
K.-P. Chan and A. W.-C. Fu. Efficient time series matching by wavelets. In ICDE, pages 126--133, 1999.
[8]
L. Chen and R. T. Ng. On the marriage of lp-norms and edit distance. In VLDB, pages 792--803, 2004.
[9]
L. Chen, M. T. Özsu, and V. Oria. Robust and fast similarity search for moving object trajectories. In SIGMOD Conference, pages 491--502, 2005.
[10]
R. Dannenberg and N. Hu. Understanding search performance in query-by-humming systems. In Proc. ISMIR, pages 232--237, 2004.
[11]
C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast subsequence matching in time-series databases. In ACM International Conference on Management of Data (SIGMOD), pages 419--429, 1994.
[12]
W. Gao, G. Fang, D. Zhao, and Y. Chen. Transition movement models for large vocabulary continuous sign language recognition. In Automatic Face and Gesture Recognition, pages 553--558, 2004.
[13]
Z. Ghahramani and M. I. Jordan. Factorial hidden markov models. Machine Learning, 29:245--275, 1997.
[14]
E. Keogh. Exact indexing of dynamic time warping. In International Conference on Very Large Data Bases, pages 406--417, 2002.
[15]
E. Keogh and M. Pazzani. Scaling up dynamic time warping for data mining applications. In Proc. of SIGKDD, 2000.
[16]
J. B. Kruskall and M. Liberman. The symmetric time warping algorithm: From continuous to discrete. In Time Warps. Addison-Wesley, 1983.
[17]
L. Latecki, V. Megalooikonomou, Q. Wang, R. Lakämper, C. Ratanamahatana, and E. Keogh. Elastic partial matching of time series. In PKDD, pages 577--584, 2005.
[18]
K. Lemström and E. Ukkonen. Including interval encoding into edit distance based music comparison and retrieval. In Proc. AISB, pages 53--60, 2000.
[19]
V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics, 10(8):707--710, 1966.
[20]
D. Mazzoni and R. Dannenberg. Melody matching directly from audio. In 2nd Annual International Symposium on Music Information Retrieval, pages 17--18, 2001.
[21]
Y. Moon, K. Whang, and W. Han. General match: a subsequence matching method in time-series databases based on generalized windows. In ACM International Conference on Management of Data (SIGMOD), pages 382--393, 2002.
[22]
Y. Moon, K. Whang, and W. Loh. Duality-based subsequence matching in time-series databases. In IEEE International Conference on Data Engineering (ICDE), pages 263--272, 2001.
[23]
P. Morguet and M. Lang. Spotting dynamic hand gestures in video image sequences using hidden Markov models. In IEEE International Conference on Image Processing, pages 193--197, 1998.
[24]
M. Morse and J. Patel. An efficient and accurate method for evaluating time series similarity. In SIGMOD Conference, pages 569--580, 2007.
[25]
R. Oka. Spotting method for classification of real world data. The Computer Journal, 41(8):559--565, July 1998.
[26]
B. Pardo, J. Shifrin, and W. Birmingham. Name that tune: A pilot study in finding a melody from a sung query. Journal of the American Society for Information Science and Technology, 55(4):283--300, 2004.
[27]
S. Park, W. W. Chu, J. Yoon, and J. Won. Similarity search of time-warped subsequences via a suffix tree. Information Systems, 28(7), 2003.
[28]
S. Park, S. Kim, and W. W. Chu. Segment-based approach for subsequence searches in sequence databases. In Symposium on Applied Computing, pages 248--252, 2001.
[29]
S. Pauws. Cubyhum: A fully operational query by humming system. In Proceedings of ISMIR, pages 187--196, 2002.
[30]
D. Rafiei and A. O. Mendelzon. Similarity-based queries for time series data. In ACM International Conference on Management of Data (SIGMOD), pages 13--25, 1997.
[31]
C. Ratanamahatana and E. J. Keogh. Three myths about dynamic time warping data mining. In SIAM International Data Mining Conference, 2005.
[32]
H. Sakoe and S. Chiba. In Transactions on ASSP, volume 26, pages 43--49.
[33]
Y. Sakurai, C. Faloutsos, and M. Yamamuro. Stream monitoring under the time warping distance. In IEEE International Conference on Data Engineering (ICDE), 2007.
[34]
Y. Sakurai, M. Yoshikawa, and C. Faloutsos. FTW: fast similarity search under the time warping distance. In Principles of Database Systems (PODS), pages 326--337, 2005.
[35]
J. Shifrin, B. Pardo, C. Meek, and W. Birmingham. HMM-based musical query retrieval. In Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pages 295--300. ACM New York, NY, USA, 2002.
[36]
L. Smith, R. McNab, and I. Witten. Sequence-based melodic comparison: A dynamic programming approach. Computing in Musicology, 11:101--117, 1998.
[37]
A. Uitdenbogerd and J. Zobel. Melodic matching techniques for large music databases. In Proceedings of the seventh ACM international conference on Multimedia (Part 1), page 66. ACM, 1999.
[38]
E. Unal, E. Chew, P. Georgiou, and S. Narayanan. Challenging uncertainty in query by humming systems: a fingerprinting approach. IEEE Transactions on Audio Speech and Language Processing, 16(2):359, 2008.
[39]
M. Vlachos, D. Gunopulos, and G. Kollios. Discovering similar multidimensional trajectories. In ICDE, pages 673--684, 2002.
[40]
M. Vlachos, M. Hadjieleftheriou, D. Gunopulos, and E. Keogh. Indexing multi-dimensional time-series with support for multiple distance measures. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 216--225, 2003.
[41]
H. Wu, B. Salzberg, G. C. Sharp, S. B. Jiang, H. Shirato, and D. R. Kaeli. Subsequence matching on structured time series data. In ACM International Conference on Management of Data (SIGMOD), pages 682--693, 2005.
[42]
B.-K. Yi, H. V. Jagadish, and C. Faloutsos. Efficient retrieval of similar time sequences under time warping. In IEEE International Conference on Data Engineering, pages 201--208, 1998.
[43]
Y. Zhu and D. Shasha. Warping indexes with envelope transforms for query by humming. In ACM International Conference on Management of Data (SIGMOD), pages 181--192, 2003.

Cited By

View all

Index Terms

  1. Benchmarking dynamic time warping for music retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    PETRA '10: Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments
    June 2010
    452 pages
    ISBN:9781450300711
    DOI:10.1145/1839294
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 June 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dynamic time warping
    2. query-by-humming
    3. time series

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    PETRA '10

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 06 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media