skip to main content
10.1145/3055635.3056576acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

Feature Extraction Method for Lip-reading under Variant Lighting Conditions

Published: 24 February 2017 Publication History

Abstract

Automatic lip-reading is a technique of understanding the uttered speech by visually interpreting the lip movement of the speaker. with the development of the lip-reading, more and more related technologies are proposed. However, the current research of lip-reading is mainly conducted under the ideal lighting conditions and there is few researchers focus on the lip-reading technique under variant lighting conditions. For this problem, this paper proposes a new method of lip feature extraction under variant lighting conditions. The method consists of a preprocessing chain of illumination normalization and improved LBP features, which can improve the recognition rate of lip-reading under variant lighting conditions from two aspects. Experiments show that the lip feature extraction method proposed by this paper is effective.

References

[1]
Zhao, G., Barnard, M., & Pietikainen, M. (2009). Lipreading with local spatiotemporal descriptors. IEEE Transactions on Multimedia, 11(7), 1254--1265.
[2]
Aleksic, P. S., & Katsaggelos, A. K. (2006). Audio-visual biometrics. Proceedings of the IEEE, 94(11), 2025--2044.
[3]
Almajai, I., Cox, S., Harvey, R., & Lan, Y. (2016). Improved Speaker Independent Lipreading using Speaker Adaptive Training and Deep Neural Networks. ICASSP.
[4]
Bear, H. L., Harvey, R. W., & Lan, Y. (2015). Finding phonemes: improving machine lip-reading. Joint Conference on Facial Analysis, Animation and Auditory-Visual Speech Processing.
[5]
ALan, Y., Harvey, R., Theobald, B. J., Bowden, R., & Ong, E. J. (2010). Improving visual features for lip-reading. International Conference on Auditory-Visual Speech Processing (pp. 142--147).
[6]
Bakry, A., & Elgammal, A. (2013). Mkpls: manifold kernel partial least squares for lipreading and speaker identification. 684--691.
[7]
Zhao, G., & Pietikainen, M. (2013). Visual Speaker Identification with Spatiotemporal Directional Features. Image Analysis and Recognition. Springer Berlin Heidelberg.
[8]
M.Z. Ibrahim, & D.J. Mulvaney. (2015). Geometrical-based lip-reading using template probabilistic multi-dimension dynamic time warping ☆.Journal of Visual Communication & Image Representation, 30(C), 219--233.
[9]
Lucey, P., & Sridharan, S. (2008). A visual front-end for a continuous pose-invariant lipreading system. International Conference on Signal Processing and Communication Systems (pp. 1--6). IEEE.
[10]
Shin, J., Jin, L., & Kim, D. (2011). Real-time lip reading system for isolated korean word recognition. Pattern Recognition, 44(3), 559--571.
[11]
Ma X, Zhang H. Lip segmentation algorithm based on bi-color space{C}// Control Conference. IEEE, 2015.
[12]
Morade, S. S., & Patnaik, S. (2015). Comparison of classifiers for lip reading with cuave and tulips database. Optik - International Journal for Light and Electron Optics, 126(24), 5753--5761.
[13]
Shi, Xiaoxing. "A New Time Wrapping Algorithm and Its Application on Neural Network Based Speech Recognition." Journal of Southeast Univwrsity (1999).

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMLC '17: Proceedings of the 9th International Conference on Machine Learning and Computing
February 2017
545 pages
ISBN:9781450348171
DOI:10.1145/3055635
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Southwest Jiaotong University

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 February 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Illumination Normalization
  2. Improved LBP
  3. Lip-reading

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICMLC 2017

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media