skip to main content
10.1145/584792.584864acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

A singer identification technique for content-based classification of MP3 music objects

Published: 04 November 2002 Publication History

Abstract

As there is a growing amount of MP3 music data available on the Internet today, the problems related to music classification and content-based music retrieval are getting more attention recently. In this paper, we propose an approach to automatically classify MP3 music objects according to their singers. First, the coefficients extracted from the output of the polyphase filters are used to compute the MP3 features for segmentation. Based on these features, an MP3 music object can be decomposed into a sequence of notes (or phonemes). Then for each MP3 phoneme in the training set, its MP3 feature is extracted and used to train an MP3 classifier which can identify the singer of an unknown MP3 music object. Experiments are performed and analyzed to show the effectiveness of the proposed method.

References

[1]
Bakhmutova, V., V. D. Gusev, and T. N. Titkova, "The Search for Adaptations in Song Melodies," Computer Music Journal, Vol. 21, No. 1, pp. 58--67, Spring 1997.
[2]
Brandenburg, K., and G. Stoll, "ISO-MPEG-1 Audio: A Generic Standard for Coding of High Quality Digital Audio," Journal of the Audio Engineering Society, Vol. 42, No. 10, Oct 1994, pp. 780--792.
[3]
Campbell, J.P., Jr., "Speaker Recognition: a Tutorial," Proceedings of the IEEE, Vol. 85, No. 9, Sept. 1997 pp. 1437--1462.
[4]
Chen, J. C. C. and A. L. P. Chen, "Query by Rhythm: An Approach for Song Retrieval in Music Databases," In Proc. of 8th Intl. Workshop on Research Issues in Data Engineering, pp. 139--146, 1998.
[5]
Chibelushi, C.C., F. Deravi, and J. S. D. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. On Multimedia, Vol. 4, No. 1, pp. 23--37, March 2002.
[6]
Chou, T. C., A. L. P. Chen, and C. C. Liu, "Music Databases: Indexing Techniques and Implementation," in Proc. IEEE Intl. Workshop on Multimedia Data Base Management Systems, 1996.
[7]
Chou, W., and L. Gu, "Robust Singing Detection in Speech/Music Discriminator Design," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 865--868, 2001.
[8]
Foote, J., "Content-Based Retrieval of Music and Audio", Multimedia Storage and Archiving systems II, Proc. SPIE, Vol.3229, pp. 138--147.
[9]
Fukunaga, K., An Introduction to Statistical Pattern Recognition, San Diego, CA, Academic Press, 2nd ed., 1990.
[10]
Ghias, A., Logan, H., Chamberlin, D., and Smith, B. C., "Query by Humming: Musical Information Retrieval in an Audio Database," in Proc. of Third ACM International Conference on Multimedia, pp. 231--236, 1995.
[11]
Hsu, J.L., C.C. Liu and A.L.P. Chen, "Discovering Non-Trivial Repeating Patterns in Music Data," IEEE Transactions on Multimedia, Vol. 3, No. 3, pp. 311--325, 2001.
[12]
ISO/IEC 11172-3:1993, "Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s - Part 3: Audio."
[13]
Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "Music Retrieval by Humming," in Proceedings of IEEE PACRIM'99, pp. 404--407, 1999.
[14]
Kosugi, N., Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, "A Practical Query-By-Humming System for a Large Music Database," In Proc. ACM Multimedia, 2000.
[15]
Lambrou, T. et al., "Classification of Audio Signals Using Statistical Features on Time and Wavelet Transform Domains," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3621--3624, 1998.
[16]
Li, S. Z., "Content-Based Audio Classification and Retrieval Using the Nearest Feature Line Method," IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 5, pp. 619--625, Sept. 2000.
[17]
Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "Efficient Theme and Non-Trivial Repeating Pattern Discovering in Music Databases," in Proc. of IEEE Intl. Conf. on Data Engineering, pp. 14--21, 1999.
[18]
Liu, C. C., A. J. L. Hsu, and A. L. P. Chen, "An Approximate String Matching Algorithm for Content-Based Music Data Retrieval," in Proc. of IEEE Intl. Conf. on Multimedia Computing and Systems, 1999.
[19]
Liu, C. C., and Wei-Yi Kuo, "Content-Based Segmentation of MP3 Music Objects," in Proc. of the Workshop on the 21st Century Digital Life and Internet Technologies, 2001.
[20]
Liu, C. C. and Po-Jun Tsai, "Content-Based Retrieval of MP3 Music Objects," in Proc. of the ACM Intl. Conf. on Information and Knowledge Management (CIKM 2001), 2001.
[21]
Liu, Z. et al., "Audio Feature Extraction and Analysis for Scene Classification," in Proc. IEEE First Workshop on Multimedia Signal Processing, pp. 343--348, 1997.
[22]
Liu, Z. and Q. Huang., "Classification of Audio Events in Broadcast News," in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 364--369, 1998.
[23]
Lu, G.J. and T. Hankinson, "A Technique Towards Automatic Audio Classification and Retrieval," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 1142--1145, 1998.
[24]
Lu, G.J. and T. Hankinson, "An Investigation of Automatic Audio Classification and Segmentation," in Proc. IEEE Intl. Conf. on Signal Processing, Vol. 2, pp. 776--781, 2000.
[25]
Martin, K. D., and Y. E. Kim, "2pMU9. Musical instrument identification : A pattern-recognition approach," in the 136th meeting of the Acoustical Society of America, October 13, 1998.
[26]
Melih, K., and R. Gonzalez, "Audio Retrieval Using Perceptually Based Structures", in Proc. of IEEE International Conference on Multimedia Computing and system, pp 338--347, 1998.
[27]
Melih, K., and R. Gonzalez, "Audio Source Type Segmentation Using a Perceptually Based Representation," in ISSPA 99, Brisbane, Australia, 22--25 August, 1999.
[28]
Mo, J. S., C. H. Han, and Y. S. Kim, "A Melody-Based Similarity Computation Algorithm for Musical Information," in Proc. of Knowledge and Data Engineering Exchange Workshop ?KDEX '99?, pp. 114--121, 1999.
[29]
Moreno, P.J. and R. Rifkin, "Using The Fisher Kernel Method for Web Audio Classification," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 4, pp. 2417--2420, 2000.
[30]
Noll, P., "MPEG Digital Audio Coding," IEEE Signal Processing Magazine, Vol. 14, No. 5, pp. 59--81, Sept. 1997.
[31]
Painter, T. and A. Spanias, "Perceptual Coding of Digital Audio," Proceedings of the IEEE, Vol. 88, No. 4, pp. 451--515, April 2000.
[32]
Pan, D., "A Tutorial on MPEG/Audio Compression," IEEE Multimedia Magazine, Vol. 2, No. 2, pp. 60--74, Summer 1995.
[33]
Rolland, P. Y., G Raskinis, and J. G. Ganascia, "Musical Content-Based Retrieval: an Overview of the Melodiscov Approach and System," In Proc. ACM Multimedia 99, pp. 81--84, 1999.
[34]
Saunders, J., "Real-Time Discrimination of Broadcast Speech/Music," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 993--996, 1996.
[35]
Scheirer, E. and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 1331--1334, 1997.
[36]
Smith, G., H. Murase, H. Kashino, "Quick Audio Retrieval Using Active Search", in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3777--3780, 1998.
[37]
Tsai, Po-Jun and Chih-Chin Liu, "An MP3 Search Engine on the Internet", in Proc. of 2000 Workshop on Internet & Distributed Systems, Vol. 1, pp. 18--27, 2000.
[38]
Tzanetakis, G., G. Essl, and P. Cook, "Automatic Musical Genre Classification of Audio Signals," in Proc. Int. Symposium on Music Information Retrieval (ISMIR), Bloomington, Indiana, 2001.
[39]
Tzanetakis, G., and P. Cook, "A Framework for Audio Analysis Based on Classification and Temporal Segmentation," in Proc. EUROMICRO Conf., Vol. 2, pp. 61--67, 1999.
[40]
Wold, E., T. Blum, D. Keislar, and J. Wheaton, "Contented-Based Classification, Search, and Retrieval of Audio", IEEE Multimedia Vol. 3, No. 3, pp. 27--36, Fall 1996.
[41]
Zhang, T. and C.-C.J. Kuo, "Hierarchical Classification of Audio Data for Archiving and Retrieving," in Proc. IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Vol. 6, pp. 3001--3004, 1999.

Cited By

View all

Index Terms

  1. A singer identification technique for content-based classification of MP3 music objects

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
        November 2002
        704 pages
        ISBN:1581134924
        DOI:10.1145/584792
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 04 November 2002

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. MP3
        2. MP3 classification
        3. MP3 databases
        4. content-based music classification
        5. music classification
        6. music databases
        7. music feature extraction
        8. singer identification

        Qualifiers

        • Article

        Conference

        CIKM02

        Acceptance Rates

        Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

        Upcoming Conference

        CIKM '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)3
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 09 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media