skip to main content
10.1145/1178782.1178785acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Synchronizing multimodal data streams acquired using commodity hardware

Published: 27 October 2006 Publication History

Abstract

We have developed tools and techniques that allow video frame level synchronization of multiple free-running commodity video cameras,microphones,and computer nodes using non-realtime operating systems.The techniques rely on physical audiovisual synchronization pulses,statistical procedures to correlate and interpolate the multiple timestamp streams,and software tools for review to produce smoothed and drift-corrected timestamp streams in our multimodal corpora. In this article we present those techniques and tools. Our project is open source and we are seeking collaborative developers for future work.

References

[1]
National Institute of Standards and Technology. http://www.nist.gov/
[2]
Smart Space Project. http://www.nist.gov/smartspace/
[3]
Automatic Meeting Recognition Project. http://www.nist.gov/speech/test beds/mr proj/
[4]
M. Michel, V. Stanford and O. Galibert (2005). Network Transfer of Control Data: An Application of the NIST Smart Data Flow. Proceedings of CCCT 2003. Extended version published in 2005 in the Journal of Systemics, Cybernetics and Informatics (Volume 2, Number 6).
[5]
V. Stanford, J. Garofolo, O. Galibert, M. Michel and Christophe Laprun (2003). The NIST Smart Space and Meeting Room projects: Signals, Acquisition, Annotation, and Metrics. Proceedings of ICASSP 2003.
[6]
R. Xu, G. Mei, Z. Ren, C. Kwan, J. Aube and C. Rochet and V. Stanford (2006). Towards User Sensitive Interfaces: Autodirective Speech Acquisition for Speaker Identification and Speech Recognition Using Phased Arrays. Springer Lecture Notes On Computer Science AI Subseries, 2006 Yang Cai, Ph.D.Ed.
[7]
J. Garofolo, C. Laprun, M. Michel, V. Stanford and Elham Tabassi (2004).The NIST Meeting Room Pilot Corpus. International Conference on Language Resources and Evaluation (LREC '04)'s Speech Corpora and Annotation/Processing Tools.
[8]
J. Garofolo, M. Michel, V. Stanford, E. Tabassi, J. Fiscus,C. Laprun, N. Pratz and J. Lard (2004). NIST Meeting Pilot Corpus Speech (ISBN 1-58563-302-x). http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2004S09
[9]
J. Garofolo, M. Michel, V. Stanford, E. Tabassi, J. Fiscus, C. Laprun, N. Pratz, J. Lard and S. Strassel (2004). NIST Meeting Pilot Corpus Transcripts and Metadata (ISBN 1-58563-303-8). http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2004T13
[10]
Network Time Protocol.http://www.ntp.org/
[11]
B. Widrow and E. Walach (1996. Adaptive Inverse Control. Englewood Cliffs, NJ: Prentice-Hall.
[12]
Rich Transcription 2002 Meeting Recognition Evaluation,documentation. http://www.nist.gov/speech/tests/rt/rt2002/
[13]
Rich Transcription 2002 STT and Metadata Extraction results,presentations,RT-02 Workshop. http://www.nist.gov/speech/tests/rt/rt2002/presentations/index.htm
[14]
Rich Transcription 2004 Spring Meeting Recognition Evaluation, documentation. http://www.nist.gov/speech/tests/rt/rt2004/spring/
[15]
Systems Plus,Inc.http://www.sysplus.com/

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
VSSN '06: Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
October 2006
230 pages
ISBN:1595934960
DOI:10.1145/1178782
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audio/video synchronization
  2. commodity hardware
  3. data streams
  4. timestamps

Qualifiers

  • Article

Conference

MM06
MM06: The 14th ACM International Conference on Multimedia 2006
October 27, 2006
California, Santa Barbara, USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media