skip to main content
10.1145/3041021.3054168acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

HiText: Text Reading with Dynamic Salience Marking

Published: 03 April 2017 Publication History

Abstract

The staggering amounts of content readily available to us via digital channels can often appear overwhelming. While much research has focused on aiding people at selecting relevant articles to read, only few approaches have been developed to assist readers in more efficiently reading an individual text. In this paper, we present HiText, a simple yet effective way of dynamically marking parts of a document in accordance with their salience. Rather than skimming a text by focusing on randomly chosen sentences, students and other readers can direct their attention to sentences determined to be important by our system. For this, we rely on a deep learning-based sentence ranking method. Our experiments show that this results in marked increases in user satisfaction and reading efficiency, as assessed using TOEFL-style reading comprehension tests.

References

[1]
F. Ahmed, Y. Borodin, Y. Puzis, and I. V. Ramakrishnan. Why read if you can skim: Towards enabling faster screen reading. In Proc. W4A, pages 39:1--39:10, 2012.
[2]
P. L. Carrell. Facilitating esl reading by teaching text structure. TESOL Quarterly, 19(4):727--752, 1985.
[3]
J. Chen, N. Tandon, C. D. Hariman, and G. de Melo. WebBrain: Joint neural learning of large-scale commonsense knowledge. In Proc. ISWC, 2016.
[4]
E. H. Chi, L. Hong, M. Gumbrecht, and S. K. Card. Scent Highlights: Highlighting conceptually-related sentences during reading. In Proc. IUI, 2005.
[5]
J. Chung, Ç. Gülçehre, K. Cho, and Y. Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling. CoRR, abs/1412.3555, 2014.
[6]
D. Das and A. F. T. Martins. A survey on automatic text summarization. Technical report, Carnegie Mellon University, 2007.
[7]
G. de Melo. Inducing conceptual embedding spaces from Wikipedia. In Proc. WWW 2017 (Cognitive Computing Track). ACM, 2017.
[8]
G. B. Duggan and S. J. Payne. Skim reading by satisficing: Evidence from eye tracking. In Proc. CHI 2011, pages 1141--1150. ACM, 2011.
[9]
G. Erkan and D. R. Radev. Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Int. Res., 22(1):457--479, Dec. 2004.
[10]
D. J. Gillick. The Elements of Automatic Summarization. PhD thesis, EECS Department, University of California, Berkeley, May 2011.
[11]
S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Comput., 9(9):1735--1780, Nov. 1997.
[12]
N. Kalchbrenner and P. Blunsom. Recurrent continuous translation models. In Proc. EMNLP, pages 1700--1709. ACL, 2013.
[13]
D. Kingery and R. Furuta. Skimming electronic newspaper headlines: a study of typeface, point size, screen resolution, and monitor size. Inf. Process. Manage., 33(5):685--696, 1997.
[14]
R. Kiros, Y. Zhu, R. Salakhutdinov, R. S. Zemel, A. Torralba, R. Urtasun, and S. Fidler. Skip-thought vectors. CoRR, abs/1506.06726, 2015.
[15]
X. Long, C. Gan, and G. de Melo. Video captioning with multi-faceted attention. CoRR, abs/1612.00234, 2016.
[16]
E. Loza Mencía, G. de Melo, and J. Nam. Medical concept embeddings via labeled background corpora. In Proc. LREC, Paris, France, 2016.
[17]
H. P. Luhn. The automatic creation of literature abstracts. IBM Journal of Research and Development, 2(2):159--165, 1958.
[18]
M.-T. Luong, H. Pham, and C. D. Manning. Effective approaches to attention-based neural machine translation. In Proc. EMNLP, pages 1412--1421, 2015.
[19]
A. Nenkova and K. McKeown. A survey of text summarization techniques. In Mining Text Data. 2012.
[20]
A. Nenkova and L. Vanderwende. The impact of frequency on summarization. Microsoft Research, Redmond, Washington, Tech. Rep. MSR-TR-2005-101, 2005.
[21]
V. d. Paiva, D. Oliveira, S. Higuchi, A. Rademaker, and G. de Melo. Exploratory information extraction from a historical dictionary. In Proc. Workshop on Digital Humanities and e-Science at the 10th IEEE International Conference on e-Science, 2014.
[22]
E. Pitler and A. Nenkova. Revisiting readability: A unified framework for predicting text quality. In Proc. EMNLP, pages 186--195, 2008.
[23]
D. N. Rapp and P. Broek. Dynamic text comprehension: An integrative view of reading. Current Directions in Psychological Science, 14(5):276--279, 2005.
[24]
I. Sutskever, O. Vinyals, and Q. V. V. Le. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems 27, pages 3104--3112. 2014.
[25]
M. Theobald, J. Siddharth, and A. Paepcke. Spotsigs: Robust and efficient near duplicate detection in large web collections. In Proc. SIGIR 2008, 2008.
[26]
W. M. Vagias. Likert-type scale response anchors. Clemson International Institute for Tourism & Research Development, 2006.
[27]
O. Čulo and G. de Melo. Source-Path-Goal: Investigating the cross-linguistic potential of frame-semantic text analysis. it - Information Technology, 54, 2012.
[28]
M. Walsh. The 'textual shift': examining the reading process with print, visual and multimodal texts. Australian Journal of Language and Literacy, 29(1):24--37, 2006.
[29]
Y. Wang, Z. Ren, M. Theobald, M. Dylla, and G. de Melo. Summary generation for temporal extractions. In Proc. DEXA, 2016.
[30]
A. J. Wecker, J. Lanir, O. Mokryn, E. Minkov, and T. Kuflik. Semantize: Visualizing the sentiment of individual document. In Proc. AVI 2014, pages 385--386. ACM, 2014.
[31]
Q. Yang, R. J. Passonneau, and G. de Melo. PEAK: Pyramid evaluation via automated knowledge extraction. In Proc. AAAI. AAAI Press, 2016.
[32]
J. S. Yi. QnDReview: Read 100 CHI papers in 7 hours. In CHI '14 Extended Abstracts, pages 805--814. ACM, 2014.
[33]
K. X. Yong, K. Rajdev, T. J. Shakespeare, A. P. Leff, and S. J. Crutch. Facilitating text reading in posterior cortical atrophy. Neurology, 85(4):339--348, 2015.
[34]
Y. Zhu, R. Kiros, R. S. Zemel, R. Salakhutdinov, R. Urtasun, A. Torralba, and S. Fidler. Aligning books and movies: Towards story-like visual explanations by watching movies and reading books. CoRR, abs/1506.06724, 2015.

Cited By

View all

Index Terms

  1. HiText: Text Reading with Dynamic Salience Marking

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion
      April 2017
      1738 pages
      ISBN:9781450349147

      Sponsors

      • IW3C2: International World Wide Web Conference Committee

      In-Cooperation

      Publisher

      International World Wide Web Conferences Steering Committee

      Republic and Canton of Geneva, Switzerland

      Publication History

      Published: 03 April 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. natural language semantics
      2. text skimming
      3. text visualization

      Qualifiers

      • Research-article

      Funding Sources

      • National Natural Science Foundation of China
      • National Basic Research Program of China

      Conference

      WWW '17
      Sponsor:
      • IW3C2

      Acceptance Rates

      WWW '17 Companion Paper Acceptance Rate 164 of 966 submissions, 17%;
      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)39
      • Downloads (Last 6 weeks)5
      Reflects downloads up to 09 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media