skip to main content
10.1145/2390256.2390281acmotherconferencesArticle/Chapter ViewAbstractPublication PagesautomotiveuiConference Proceedingsconference-collections
research-article

On the design and evaluation of robust head pose for visual user interfaces: algorithms, databases, and comparisons

Published: 17 October 2012 Publication History

Abstract

An important goal in automotive user interface research is to predict a user's reactions and behaviors in a driving environment. The behavior of both drivers and passengers can be studied by analyzing eye gaze, head, hand, and foot movement, upper body posture, etc. In this paper, we focus on estimating head pose, which has been shown to be a good predictor of driver intent and a good proxy for gaze estimation, and provide a valuable head pose database for future comparative studies. Most existing head pose estimation algorithms are still struggling under large spatial head turns. Our method, however, relies on using facial features that are visible even during large spatial head turns to estimate head pose. The method is evaluated on the LISA-P Head Pose database, which has head pose data from on-road daytime and nighttime drivers of varying age, race, and gender; ground truth for head pose is provided using a motion capture system. In special regards to eye gaze estimation for automotive user interface study, the automatic head pose estimation technique presented in this paper can replace previous eye gaze estimation methods that rely on manual data annotation or be used in conjunction with them when necessary.

References

[1]
S. Ba and J.-M. Odobez. Evaluation of multiple cue head pose estimation algorithms in natural environements. In Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on, pages 1330--1333, july 2005.
[2]
J. Busby. 3d head scan, August 2012.
[3]
J. Chen and Q. Ji. 3d gaze estimation with a single camera without ir illumination. In Pattern Recognition, 2008. ICPR 2008. 19th International Conference on, pages 1--4, dec. 2008.
[4]
S. Cheng and M. Trivedi. Turn-intent analysis using body pose for intelligent driver assistance. Pervasive Computing, IEEE, 5(4):28--37, oct.-dec. 2006.
[5]
L. H. Christiansen, N. Y. Frederiksen, A. Ranch, and M. B. Skov. Investigating the effects of an advance warning in-vehicle system on behavior and attention in controlled driving. In Proceedings of the 3rd Internation Conference on Automotive User Interface and Interactive Vehicular Applications, pages 121--128, 2011.
[6]
J. Curin, M. Labsky, T. Macek, and J. Kleindienst. Dictating and editing short texts while driving: Distraction and task completion. In Proceedings of the 3rd Internation Conference on Automotive User Interface and Interactive Vehicular Applications, AutomotiveUI '11, pages 13--20, 2011.
[7]
D. F. Dementhon and L. S. Davis. Model-based object pose in 25 lines of code. International Journal of Computer Vision, 15:123--141, 1995.
[8]
A. Doshi and M. Trivedi. On the roles of eye gaze and head dynamics in predicting driver's intent to change lanes. Intelligent Transportation Systems, IEEE Transactions on, 10(3):453--462, sept. 2009.
[9]
L. Fletcher, L. Petersson, N. Barnes, D. Austin, and A. Zelinsky. A sign reading driver assistance system using eye gaze. In Robotics and Automation, 2005. ICRA 2005. Proceedings of the 2005 IEEE International Conference on, pages 4655--4660, april 2005.
[10]
L. Fletcher, L. Petersson, and A. Zelinsky. Road scene monotony detection in a fatigue management driver assistance system. In Intelligent Vehicles Symposium, 2005. Proceedings. IEEE, pages 484--489, june 2005.
[11]
A. Gee and R. Cipolla. Determining the gaze of faces in images. Image and Vision Computing, 12(10):639--647, 1994.
[12]
D. Hansen and Q. Ji. In the eye of the beholder: A survey of models for eyes and gaze. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(3):478--500, march 2010.
[13]
T. Horprasert, Y. Yacoob, and L. Davis. Computing 3-d head orientation from a monocular image sequence. In Automatic Face and Gesture Recognition, 1996., Proceedings of the Second International Conference on, pages 242--247, oct 1996.
[14]
T. Ishikawa, S. Baker, I. Matthews, and T. Kanade. Passive driver gaze tracking with active appearance models. In Proceedings of the 11th World Congress on Intelligent Transportation Systems, October 2004.
[15]
J. Jain and A. Jain. Displacement measurement and its application in interframe image coding. Communications, IEEE Transactions on, 29(12):1799--1808, dec 1981.
[16]
M. La Cascia and S. Sclaroff. Fast, reliable head tracking under varying illumination. In Computer Vision and Pattern Recognition, 1999. IEEE Computer Society Conference on., volume 1, pages 2 vol. (xxiii+637+663), 1999.
[17]
S. Martin, C. Tran, A. Tawari, J. Kwan, and M. M. Trivedi. Optical flow based head movement and gesture analyzer (ohmega). In Pattern Recognition (ICPR), 21st International Conference on, Nov. 2012.
[18]
S. Martin, C. Tran, and M. M. Trivedi. Optical flow based head movement and gesture analysis in automotive environment. In IEEE International Conference on Intelligent Transportation Systems-ITSC, Sept. 2012.
[19]
J. McCall, D. Wipf, M. Trivedi, and B. Rao. Lane change intent analysis using robust operators and sparse bayesian learning. Intelligent Transportation Systems, IEEE Transactions on, 8(3):431--440, sept. 2007.
[20]
E. Murphy-Chutorian and M. Trivedi. Hyhope: Hybrid head orientation and position estimation for vision-based driver head tracking. In Intelligent Vehicles Symposium, 2008 IEEE, pages 512--517, june 2008.
[21]
E. Murphy-Chutorian and M. Trivedi. Head pose estimation in computer vision: A survey. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 31(4):607--626, april 2009.
[22]
E. Murphy-Chutorian and M. Trivedi. Head pose estimation and augmented reality tracking: An integrated system and evaluation for monitoring driver awareness. Intelligent Transportation Systems, IEEE Transactions on, 11(2):300--311, june 2010.
[23]
J. Saragih, S. Lucey, and J. Cohn. Face alignment through subspace constrained mean-shifts. In Computer Vision, 2009 IEEE 12th International Conference on, pages 1034--1041, 29 2009-oct. 2 2009.
[24]
T. Sim, S. Baker, and M. Bsat. The cmu pose, illumination, and expression (pie) database. In Automatic Face and Gesture Recognition, 2002. Proceedings. Fifth IEEE International Conference on, pages 46--51, may 2002.
[25]
G. Slabaugh. Computing euler angles from a rotation matrix.
[26]
R. Valenti, N. Sebe, and T. Gevers. Combining head pose and eye location information for gaze estimation. Image Processing, IEEE Transactions on, 21(2):802--815, feb. 2012.
[27]
J.-G. Wang and E. Sung. Em enhancement of 3d head pose estimated by point at infinity. Image and Vision Computing, 25(12):1864--1874, 2007. The age of human computer interaction.
[28]
J. Wu and M. M. Trivedi. A two-stage head pose estimation framework and evaluation. Pattern Recognition, 41(3):1138--1158, 2008. Part Special issue: Feature Generation and Machine Learning for Robust Multimodal Biometrics.
[29]
H. Zhang, M. Smith, and R. Dufour. A final report of safety vehicles using adaptive interface technology: Visual distraction.

Cited By

View all

Index Terms

  1. On the design and evaluation of robust head pose for visual user interfaces: algorithms, databases, and comparisons

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AutomotiveUI '12: Proceedings of the 4th International Conference on Automotive User Interfaces and Interactive Vehicular Applications
    October 2012
    280 pages
    ISBN:9781450317511
    DOI:10.1145/2390256
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. database
    2. facial features
    3. head pose

    Qualifiers

    • Research-article

    Conference

    AutomotiveUI '12

    Acceptance Rates

    Overall Acceptance Rate 248 of 566 submissions, 44%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 29 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media