research-article

Sasayaki: augmented voice web browsing experience

Authors:

Masatomo Kobayashi,

Hironobu Takagi,

Chieko AsakawaAuthors Info & Claims

CHI '11: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Pages 2769 - 2778

https://doi.org/10.1145/1978942.1979353

Published: 07 May 2011 Publication History

Abstract

Auditory user interfaces have great Web-access potential for billions of people with visual impairments, with limited literacy, who are driving, or who are otherwise unable to use a visual interface. However a sequential speech-based representation can only convey a limited amount of information. In addition, typical auditory user interfaces lose the visual cues such as text styles and page structures, and lack effective feedback about the current focus. To address these limitations, we created Sasayaki (from whisper in Japanese), which augments the primary voice output with a secondary whisper of contextually relevant information, automatically or in response to user requests. It also offers new ways to jump to semantically meaningful locations. A prototype was implemented as a plug-in for an auditory Web browser. Our experimental results show that the Sasayaki can reduce the task completion times for finding elements in webpages and increase satisfaction and confidence.

Supplementary Material

index.html (index.html)

Slides from the presentation

Download
.93 KB

Audio only (1979353.mp3)

Download
7.84 MB

Video (1979353.mp4)

Download
108.55 MB

References

[1]

Internet World Stats, World Internet Users and Population Stats, http://www.internetworldstats.com/stats.htm.

[2]

UNESCO, International Literacy Statistics: A Review of Concepts, Methodology and Current Data, http://www.uis.unesco.org/template/pdf/Literacy/LiteracyReport2008.pdf.

[3]

WHO, Fact sheet of visual impairment and blindness, http://www.who.int/mediacentre/factsheets/fs282/en/.

[4]

ITU. Measuring the Information Society 2010. http://www.itu.int/ITU-D/ict/publications/idi/2010/.

[5]

Takagi, H., Saito, S., Fukuda, K. and Asakawa, C. Analysis of navigability of Web applications for improving blind usability. ACM Trans. Comp.-Hum. Interact 14:3 (2007), 13.

Digital Library

[6]

Barnicle, K. Usability testing with screen reading technology in a Windows environment. In Proc. CUU 2000, ACM Press (2000), 102--109.

Digital Library

[7]

Lazar, J., Allen, A.and Kleinman, J. and Malarkey, C. What Frustrates Screen Reader Users on the Web: A Study of 100 Blind Users. International Journal of Human-Computer Interaction 22:3 (2007), 247--269.

[8]

Maes, P. 1994. Agents that reduce work and information overload. Communications of. ACM 37, 7 (1994), 30--40.

Digital Library

[9]

Bederson, B. B. Audio augmented reality: a prototype automated tour guide.,In Proc. CHI 1995, ACM Press, (1995), 210--211.

Digital Library

[10]

Sawhney, N. and Schmandt, C. Nomadic radio: speech and audio interaction for contextual messaging in nomadic environments. ACM Trans. Comput.-Hum. Interact. (7:3), (2000), 353--383.

Digital Library

[11]

Eckel, G. Immersive Audio-Augmented Environments: The LISTEN Project, Fifth Framework Programme, Creating a user-friendly information society (IST), (2001), 571.

Digital Library

[12]

Kalantari, L., Hatala, M. and Willms, J. Using semantic web approach in augmented audio reality system for museum visitors. In Proc. WWW 2004, ACM Press, (2004), 386--387.

Digital Library

[13]

Miyashita, T., Meier, P., Tachikawa, T., Orlic, S., Eble, T., Scholz, V., Gapel, A., Gerl, O., Arnaudov, S. and Lieberknecht, S. An Augmented Reality museum guide. In Proc. ISMAR 2008, IEEE Computer Society (2008),103--106.

Digital Library

[14]

Shoval, S., Borenstein, J. and Koren, Y. The Navbelt - A Computerized Travel Aid for the Blind Based on Mobile Robotics Technology. IEEE Trans. on Biomedical Engineering (45:11) (1998), 1376--1386.

[15]

Jones, M., Jones, S., Bradley, G., Warren, N., Bainbridge, D. and Holmes, G. ONTRACK: Dynamically adapting music playback to support navigation. Personal and Ubiquitous Computing (12:7), (2008), 513--525.

Digital Library

[16]

Stylos, J., Myers, B. A. and Faulring, A. Citrine: providing intelligent copy-and-paste. In Proc. UIST 2004, ACM Press (2004), 185--188.

Digital Library

[17]

Wagner, E. J. and Lieberman, H. Supporting user hypotheses in problem diagnosis. In Proc. IUI 2004, ACM Press (2004), 30--37.

Digital Library

[18]

Roth, P., Petrucci, L., Pun, T., and Assimacopouls, A. Auditory browser for blind and visually impaired users, In Proc. CHI 1999. ACM Press (1999), 218--219.

Digital Library

[19]

Yu, W., McAllister, G., Strain, P., Kuber, R. and Murphy, E. Improving web accessibility using content-aware plug-ins. In Proc. CHI 2005, ACM Press (2005), 1893--1896.

Digital Library

[20]

Dontcheva, M., Drucker, S. M., Wade, G., Salesin, D., and Cohen, M. F. Summarizing personal web browsing sessions. In Proc. UIST 2006. ACM Press (2006), 115--124.

Digital Library

[21]

Hartmann, M., Schreiber, D. and Muhlhauser, M. AUGUR: providing context-aware interaction support. In Proc. of symposium on Engineering interactive computing systems, ACM Press (2009), 123--132.

Digital Library

[22]

Parente, P. Clique: a conversant, task-based audio display for GUI applications. SIGACCESS Access. Comput. 84 (2006), 34--37.

Digital Library

[23]

Mahmud, J. U., Borodin, Y., and Ramakrishnan, I. V. Csurf: a context-driven non-visual web-browser. In Proc. WWW 2007. ACM Press (2007), 31--40.

Digital Library

[24]

Borodin, Y., Bigham, J. P., Raman, R. and Ramakrishnan, I. V. What's new? Making web page updates accessible. In Proc. ASSETS 2008, ACM Press (2008), 145--152.

Digital Library

[25]

Yesilada, Y., Stevens, R., Harper, S. and Goble, C. Evaluating DANTE: Semantic transcoding for visually disabled users. ACM Trans. Comput.-Hum. Interact. (14:3), (2007), 14.

Digital Library

[26]

Harper, S. and Patel, N. Gist summaries for visually impaired surfers. In Proc. ASSETS 2005, ACM Press (2005), 90--97.

Digital Library

[27]

Miyashita, H., Sato, D., Takagi, H. and Asakawa, C. aiBrowser for multimedia: introducing multimedia content accessibility for visually impaired users. In Proc.ASSETS 2007, ACM Press (2007), 91--98.

Digital Library

[28]

Lunn, D., Bechhofer, S. and Harper, S. The SADIe transcoding platform. In Proceedings of the 2008 international cross-disciplinary conference on Web accessibility (W4A), ACM Press (2008), 128--129.

Digital Library

[29]

Takagi, H., Kawanaka, S., Kobayashi, M., Itoh, T., and Asakawa, C. Social accessibility: achieving accessibility through collaborative metadata authoring. In Proc. ASSETS 2008. ACM Press (2008), 193--200.

Digital Library

[30]

Chen, C. L. and Raman, T. V. AxsJAX: a talking translation bot using google IM: bringing web-2.0 applications to life. In Proc. the 2008 international cross-disciplinary conference on Web accessibility (W4A), ACM Press (2008), 54--56.

Digital Library

[31]

Goble, C., Harper, S., and Stevens, R. The travails of visually impaired web travellers. In Proceedings of the Eleventh ACM on Hypertext and Hypermedia. ACM Press (2000), 1--10.

Digital Library

[32]

Eclipse ACTF Accessibility Internet Browser, http://www.eclipse.org/actf/downloads/tools/aiBrowser/.

[33]

Kawanaka, S., Borodin, Y., Bigham, J. P., Lunn, D., Takagi, H. and Asakawa, C. Accessibility commons: a metadata infrastructure for web accessibility. In Proc. ASSETS 2008, ACM Press (2008), 153--160.

Digital Library

[34]

Kanayama, H., Nasukawa, T. and Watanabe, H. Deeper sentiment analysis using machine translation technology. In Proc. of the 20th international conference on Computational Linguistic', Association for Computational Linguistics (2004), 494.

Digital Library

[35]

Shaojian Zhu, Daisuke Sato, Hironobu Takagi, and Chieko Asakawa. Sasayaki: an augmented voice-based web browsing experience. In Proc. ASSETS 2010. ACM Press (2008), 279--280.

Digital Library

[36]

Wilson, J., Walker, B. N., Lindsay, J., Cambias, C., and Dellaert, F. SWAN: System for Wearable Audio Navigation. In Proc. ISWC 2007. IEEE Computer Society (2007), 1--8.

Digital Library

Cited By

Al-Thani DAqle A(2023)Evaluating Search Results Overviews and Previews with Visually Impaired UsersProceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments10.1145/3594806.3596523(680-685)Online publication date: 5-Jul-2023
https://dl.acm.org/doi/10.1145/3594806.3596523
Das MMcHugh TPiper AGergle D(2022)Co11ab: Augmenting Accessibility in Synchronous Collaborative Writing for People with Vision ImpairmentsProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3501918(1-18)Online publication date: 29-Apr-2022
https://dl.acm.org/doi/10.1145/3491102.3501918
Das MPiper AGergle D(2022)Design and Evaluation of Accessible Collaborative Writing Techniques for People with Vision ImpairmentsACM Transactions on Computer-Human Interaction10.1145/348016929:2(1-42)Online publication date: 16-Jan-2022
https://dl.acm.org/doi/10.1145/3480169
Show More Cited By

Index Terms

Sasayaki: augmented voice web browsing experience

Index terms have been assigned to the content through auto-classification.

Recommendations

Sasayaki: an augmented voice-based web browsing experience
ASSETS '10: Proceedings of the 12th international ACM SIGACCESS conference on Computers and accessibility

While the usability of voice-based Web navigation has been steadily improving, it is still not as easy for users with visual impairments as it is for sighted users. One reason is that sequential voice representation can only convey a limited amount of ...
Essential components of mobile web accessibility
W4A '13: Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility

The Web Accessibility Initiative (WAI) of the World Wide Web Consortium (W3C) develops strategies, guidelines, and resources to make the Web accessible to people with disabilities. This includes ensuring that core web technologies such as HTML and CSS ...
Handsfree for Web: A Google Chrome extension to browse the web via voice commands
W4A '19: Proceedings of the 16th International Web for All Conference

The predominance of exclusively manual interaction required when interacting with a browser and websites is presented as a restrictive condition. This limitation was the starting point and the main motivation for the development of a web navigation ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CHI '11: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

May 2011

3530 pages

ISBN:9781450302289

DOI:10.1145/1978942

General Chair:
Desney Tan
Microsoft Research
,
Program Chairs:
Geraldine Fitzpatrick
Vienna University of Technology
,
Carl Gutwin
University of Saskatchewan
,
Bo Begole
PARC
,
Wendy A. Kellogg
IBM Research

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGCHI: ACM Special Interest Group on Computer-Human Interaction

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 May 2011

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

CHI '11

Sponsor:

SIGCHI

CHI '11: CHI Conference on Human Factors in Computing Systems

May 7 - 12, 2011

BC, Vancouver, Canada

Acceptance Rates

CHI '11 Paper Acceptance Rate 410 of 1,532 submissions, 27%;

Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Upcoming Conference

CHI 2025

Sponsor:
sigchi

ACM CHI Conference on Human Factors in Computing Systems

April 26 - May 1, 2025

Yokohama , Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

17
Total Citations
View Citations
1,283
Total Downloads

Downloads (Last 12 months)14
Downloads (Last 6 weeks)2

Reflects downloads up to 22 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Al-Thani DAqle A(2023)Evaluating Search Results Overviews and Previews with Visually Impaired UsersProceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments10.1145/3594806.3596523(680-685)Online publication date: 5-Jul-2023
https://dl.acm.org/doi/10.1145/3594806.3596523
Das MMcHugh TPiper AGergle D(2022)Co11ab: Augmenting Accessibility in Synchronous Collaborative Writing for People with Vision ImpairmentsProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3501918(1-18)Online publication date: 29-Apr-2022
https://dl.acm.org/doi/10.1145/3491102.3501918
Das MPiper AGergle D(2022)Design and Evaluation of Accessible Collaborative Writing Techniques for People with Vision ImpairmentsACM Transactions on Computer-Human Interaction10.1145/348016929:2(1-42)Online publication date: 16-Jan-2022
https://dl.acm.org/doi/10.1145/3480169
Aylett MClark LCowan BTorre I(2021)Building and Designing Expressive Speech SynthesisThe Handbook on Socially Interactive Agents10.1145/3477322.3477329(173-212)Online publication date: 10-Sep-2021
https://dl.acm.org/doi/10.1145/3477322.3477329
Wang RChen ZZhang MLi ZLiu ZDang ZYu CChen XKitamura YQuigley AIsbister KIgarashi TBjørn PDrucker S(2021)Revamp: Enhancing Accessible Information Seeking Experience of Online Shopping for Blind or Low Vision UsersProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445547(1-14)Online publication date: 6-May-2021
https://dl.acm.org/doi/10.1145/3411764.3445547
Fazal MFerguson SJohnston A(2020)Evaluation of Information Comprehension in Concurrent Speech-based DesignsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/340946316:4(1-19)Online publication date: 17-Dec-2020
https://dl.acm.org/doi/10.1145/3409463
Das MGergle DPiper A(2019)"It doesn't win you friends"Proceedings of the ACM on Human-Computer Interaction10.1145/33592933:CSCW(1-26)Online publication date: 7-Nov-2019
https://dl.acm.org/doi/10.1145/3359293
Clark LDoyle PGaraialde DGilmartin ESchlögl SEdlund JAylett MCabral JMunteanu CEdwards JR Cowan B(2019)The State of Speech in HCI: Trends, Themes and ChallengesInteracting with Computers10.1093/iwc/iwz01631:4(349-371)Online publication date: 11-Sep-2019
https://doi.org/10.1093/iwc/iwz016
Fazal MFerguson SJohnston ACunningham SPicking R(2018)Investigating Concurrent Speech-based Designs for Information CommunicationProceedings of the Audio Mostly 2018 on Sound in Immersion and Emotion10.1145/3243274.3243284(1-8)Online publication date: 12-Sep-2018
https://dl.acm.org/doi/10.1145/3243274.3243284
Aqle AAl-Thani DJaoua A(2018)Conceptual Interactive Search Engine Interface for Visually Impaired Web Users2018 IEEE/ACS 15th International Conference on Computer Systems and Applications (AICCSA)10.1109/AICCSA.2018.8612874(1-6)Online publication date: Oct-2018
https://doi.org/10.1109/AICCSA.2018.8612874
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents