skip to main content
10.1145/2814895.2814917acmotherconferencesArticle/Chapter ViewAbstractPublication PagesamConference Proceedingsconference-collections
research-article

Embedding sound localization and spatial audio interaction through coincident microphones arrays

Published: 07 October 2015 Publication History

Abstract

This paper discusses a methodology for embedding sound localization techniques for spatial audio interaction, aiming at matching the low computing capabilities of mobile and embedded systems. The main goal is to implement a sound localization system, using a microphone array that combines increased accuracy with compromised computational load and applicable layout size. In particular, four cardioid microphones are placed in a cross-shape arrangement, thus forming a planar coincident microphones array for horizontal direction of arrival estimation. The incorporation of two additional microphones at the perpendicular plane is also considered for 3D audio localization. The implemented system is evaluated through simulation experiments and real-world field measurements in comparison to B-Format based localization. Joint time frequency analysis is considered for improving the localization accuracy in pure SNR conditions. The utilization of multiple arrays is also discussed for 2D and 3D position estimations, as well as signal enhancement by means of time-delay compensation.

References

[1]
Bamford, J. S. 1995. An Analysis of Ambisonic Sound Systems of First and Second Order. Master of Science in Physics Thesis. Waterloo Ontario Canada.
[2]
Benjamin, E., Chen, T. 2005. The Native B-format Microphone: Part I. Audio Engineering Society. New York.
[3]
Benjamin, E., Chen, T. 2005. The Native B-format Microphone: Part II. Audio Engineering Society. New York.
[4]
Brandstein, Michael S. and Harvey F. Silverman. 1997 A practical methodology for speech source localization with microphone arrays. Computer Speech & Language.
[5]
Dimoulas, C. A., Kalliris, G. M., Avdelidis, K. A. and Papanikolaou, G. V. 2009. Spatial audio content management within the MPEG-7 standard of ambisonic localization and visualization descriptions. Audio Engineering Society. Munich.
[6]
Dimoulas, C. A., Kalliris. G. M., Avdelidis. K. A. and Papanikolaou, G. V. 2009. Improved localization of sound sources using multi-band processing of ambisonic components. Audio Engineering Society. Munich.
[7]
Dimoulas, C. A. 2006. Audio-visual processing and content management techniques, for the study of (human) bioacoustics' phenomena. PhD Dissertation (in Greek), Dept. of Electrical and Computer Engineering. Aristotle University of Thessaloniki.
[8]
Dimoulas, C. A., Avdelidis, K. A., Kalliris, G. M. and Papanikolaou, G. V. 2007. Sound source localization and B-format enhancement using soundfield microphone sets. Audio Engineering Society. Vienna.
[9]
Dimoulas C., Kalliris G. and Papanikolaou G. 2008. Soundfield microphone simulation and surround / 3D sound design techniques for 3D-graphics and digital-characters movie production. Proceedings of the 4th Greek National Conference of Hellenic Institute of Acoustics (HELINA) "Akoustiki 2008". pp 59--68. Xanthi, Greece.
[10]
Freiberger K., Sontacchi A. 2011. Similarity-based sound source localization with a coincident microphone array. Proc. of the 14th Int. Conference on Digital Audio Effects (DAFx-11), Paris, France
[11]
Galios K., Dimoulas C. and Kalliris G. 2014. Construction and evaluation of α soundfield microphone. proceedings of the 7th Greek National Conference "Acoustics 2014". pp. 356--362. Thessaloniki
[12]
Gerzon, A. M. 1975. The Design of Precisely Coincident Microphone Arrays for Stereo and Surround Sound. Mathematical Institute. University of Oxford. England.
[13]
Gerzon, M. 1985. Ambisonics in Multichannel Broadcasting and Video", J. Audio Eng. Soc., vol. 33, no.11, pp. 859--871.
[14]
Goussios, Christos A., George M. Kalliris, and Christos V. Sevastiadis. 2007. Outdoor and Indoor Recording for Motion Picture. A Comparative Approach on Microphone Techniques. Audio Engineering Society Convention 122. Audio Engineering Society.
[15]
Gustafsson T, Rao D. B., Trivedi M. 2003. Source Localization in Reverberant Enviroments: Modeling and Statistical Analysis. IEEE Transactions on Speech and Signal Processing.
[16]
Hamasaki K. et al. 2004. Advanced Multichannel Audio Systems with Superior Impression of Presence and Reality. 116th AES Convention. Preprint 6053.
[17]
Papanikolaou, G., Kalliris, G., Goussios, C., & Dimoulas, C. 2003. An Application of Ambiophony for the Enhancement of the Reverberant Environment inside the Walls of a Byzantine Castle. In Audio Engineering Society Convention 114. Audio Engineering Society. Chicago
[18]
Petridis, V., Doulgeri, Z., Petrou, L., Symeonidis, A. and Dimoulas, C. 2013. RoboCup Rescue 2013 - Robot League Team PANDORA (Greece). technical description paper (tdp) describing the PANDORA's RoboCup 2013 participation.
[19]
Pulkki, V. 1997. Virtual Sound Source Positioning Using Vector Base Amplitude Panning. Journal of the Audio Engineering Society. Vol.45, issue 6, pp.456--466.
[20]
Pulkki, V. 2007 Spatial Sound Reproduction with Directional Audio Coding. Journal of the Audio Engineering Society, Vol. 55, no. 6, pp. 503--516.
[21]
Sevastiadis, C., Chamzas, C., Kalliris, G. and Papanikolaou G. 1999. Software for estimation and processing simulation of microphone line arrays. J. Acoust. Soc. Am. 105, p.1253 (A).
[22]
Vegiris, C. E., Avdelidis, K. A., Dimoulas, C. A., and Papanikolaou, G. V. 2008. Live Broadcasting of High Definition Audiovisual Content Using HDTV over Broadband IP Networks, Hindawi Publishing Corporation. International Journal of Digital Multimedia Broadcasting. vol 2008. Article ID 250654. 18 pages.
[23]
Vilkamo, J. 2008. Spatial Sound Reproduction with Frequency Band Processing of B-format Audio Signals. Master Thesis. Helsinki University of Technology.
[24]
Vryzas N. 2014. Sound Source Localization Techniques. Diploma of Engineering Thesis. Aristotle University of Thessaloniki.
[25]
Xiaohong Sheng, Yu-Hen Hu. 2003. Energy Based Acoustic Source Localization. University of Wisconsin-Madison. Springer-Verlag Berlin Heidelberg.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
AM '15: Proceedings of the Audio Mostly 2015 on Interaction With Sound
October 2015
250 pages
ISBN:9781450338967
DOI:10.1145/2814895
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Ambisonics
  2. Audio Localization
  3. B-Format
  4. Coincident Microphone Array
  5. Embedded Systems
  6. Energy Based Localization
  7. Multi-Channel Audio Enhancement
  8. Spatial Audio Interaction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

AM '15
AM '15: Audio Mostly 2015
October 7 - 9, 2015
Thessaloniki, Greece

Acceptance Rates

Overall Acceptance Rate 177 of 275 submissions, 64%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)1
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media