skip to main content
10.1145/544220.544224acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

Supporting access to large digital oral history archives

Published: 14 July 2002 Publication History

Abstract

This paper describes our experience with the creation, indexing, and provision of access to a very large archive of videotaped oral histories - 116,000 hours of digitized interviews in 32 languages from 52,000 survivors, liberators, rescuers, and witnesses of the Nazi Holocaust. It goes on to identify a set of critical research issues that must be addressed if we are to provide full and detailed access to collections of this size: issues in user requirement studies, automatic speech recognition, automatic classification, segmentation, summarization, retrieval, and user interfaces. The paper ends by inviting others to discuss use of these materials in their own research.

References

[1]
http://www.vhf.org
[2]
http://www.clsp.jhu.edu/research/malach/
[3]
Bates, Marcia J., 1996. "The Getty End-User Online Searching Project in the Humanities: Report No. 6: Overview and Conclusions." College and Research Libraries 57 (Nov.): 514--523
[4]
Booch, Grady. 1994. Object-Oriented Analysis and Design with Applications. 2. ed. Addison-Wesley
[5]
Buckland, M., et al., 1999. Mapping Entry Vocabulary to Unfamiliar Metadata Vocabularies. D-Lib Magazine Vol.5 No.1 January
[6]
Dharanipragada, S., et al., 2001. Segmentation and detection at IBM: Hybrid statistical models and two-tiered clustering. Topic detection and Tracking: Event-Based Information Organization. Kluwer
[7]
Franz, M., McCarley, J. S., Roukos, S., 1999. Audio-Indexing for Broadcast News, Proceedings of the Seventh Text Retrieval Conference, pp. 115--119
[8]
Franz, M., McCarley, J. S., Ward, T., 2000. Ad Hoc, Cross-Language and Spoken Document Retrieval at IBM, Proceedings of the Eight Text Retrieval Conference, pp. 391--398
[9]
Goel, V. and Byrne, W., 1999. Task dependent loss functions in speech recognition: A-star search over recognition lattices. Proc. European Conf. On Speech and Communication and Technology. V. 3, p. 1243--1246
[10]
Jelinek, F., 1998. Statistical Methods for Speech Recognition. MIT Press: Cambridge
[11]
Johnson, S. E., Jourlin, P., Sparck Jones, K., Woodland, P. C., 1999. Spoken Document Retrieval. The Eighth Text Retrieval Conference TREC-8, Cambridge University, Nov. See Also http://trec.nist.gov
[12]
Kubala, F. and R. Schwartz and R. Stone and R. Weischedel, 1998. Named entity extraction from speech, in Proceedings of DARPA Broadcast News Transcription and Understanding Workshop, (Lansdowne, VA), February
[13]
Merlino, A., and Maybury, M., 1999. An Empirical Study of the Optimal Presentation of Multimedia Summaries of Broadcast News. Mani, I., and Maybury, M. (eds.), Automated Text Summarization. MIT Press. pp. 391--401
[14]
Oard, Doug, 2001. The CLEF 2001 Interactive Track. CLEF-2001 Workshop in Darmstadt, Germany. http://www.glue.umd.edu/~oard/research.html
[15]
Ramabhadran, B., Gao, Y., and Picheny, M., 2000. Dynamic selection of feature spaces for robust speech recognition. ICSPL
[16]
Ulargiu, Barbara. 2000 Accessibility of Oral History Collections: An investigation of current practices and future developments. Masters Thesis, University of Sheffield, September, 2000
[17]
Young, S., 1996. "A review of large-vocabulary continuous-speech recognition", IEEE Signal Processing Magazine, pp. 45--57, Sep
[18]
Zweig, G., et al., 2001. The IBM 2001 Conversational Speech Recognition System, The 2001 NIST Hub-5 Evaluation Workshop, May

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
July 2002
448 pages
ISBN:1581135130
DOI:10.1145/544220
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 July 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cataloging
  2. oral history
  3. research agenda

Qualifiers

  • Article

Conference

JCDL02
Sponsor:
JCDL02: Joint Conference on Digital Libraries 2002
July 14 - 18, 2002
Oregon, Portland, USA

Acceptance Rates

JCDL '02 Paper Acceptance Rate 69 of 240 submissions, 29%;
Overall Acceptance Rate 415 of 1,482 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)4
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media