skip to main content
research-article

Foresight: recommending visual insights

Published: 01 August 2017 Publication History

Abstract

Current tools for exploratory data analysis (EDA) require users to manually select data attributes, statistical computations and visual encodings. This can be daunting for large-scale, complex data. We introduce Foresight, a system that helps the user rapidly discover visual insights from large high-dimensional datasets. Formally, an "insight" is a strong manifestation of a statistical property of the data, e.g., high correlation between two attributes, high skewness or concentration about the mean of a single attribute, a strong clustering of values, and so on. For each insight type, Foresight initially presents visualizations of the top k instances in the data, based on an appropriate ranking metric. The user can then look at "nearby" insights by issuing "insight queries" containing constraints on insight strengths and data attributes. Thus the user can directly explore the space of insights, rather than the space of data dimensions and visual encodings as in other visual recommender systems. Foresight also provides "global" views of insight space to help orient the user and ensure a thorough exploration process. Furthermore, Foresight facilitates interactive exploration of large datasets through fast, approximate sketching.

References

[1]
IBM Watson Analytics. https://www.ibm.com/analytics/watson-analytics/. Accessed: 2017-02-15.
[2]
MS PowerBI. https://powerbi.microsoft.com. Accessed: 2017-02-15.
[3]
C. C. Aggarwal. Outlier Analysis. Springer, 2013.
[4]
M. Charikar. Similarity estimation techniques from rounding algorithms. In ACM STOC, pages 380--388, 2002.
[5]
G. Cormode, M. N. Garofalakis, P. J. Haas, and C. Jermaine. Synopses for massive data: Samples, histograms, wavelets, sketches. Foundations and Trends in Databases Trends in Databases, 4(1--3):1--294, 2012.
[6]
C. G. Goetz, B. C. Tilley, et al. MDS-UPDRS: Scale presentation and clinimetric testing results. Movement Disorders, 23(15):2129--2170, 2008.
[7]
R. Nickerson. Confirmation bias: A ubiquitous phenomenon in many guises. Review of General Psychology, 2(2), 1998.
[8]
P. Pirolli and S. Card. The sensemaking process and leverage points for analyst technology as identified through cognitive task analysis. In Procs. International Conference on Intelligence Analysis, volume 5, pages 2--4, 2005.
[9]
T. Siddiqui, A. Kim, J. Lee, K. Karahalios, and A. Parameswaran. Effortless data exploration with zenvisage. Procs. VLDB, 10(4):457--468, 2016.
[10]
A. Tversky and D. Kahneman. Judgment under uncertainty: Heuristics and biases. In Utility, probability, and human decision making, pages 141--162. Springer, 1975.
[11]
G. Wills and L. Wilkinson. AutoVis: Automatic visualization. Information Visualization, 9(1):47--69, 2008.
[12]
K. Wongsuphasawat, D. Moritz, A. Anand, J. Mackinlay, B. Howe, and J. Heer. Voyager: Exploratory analysis via faceted browsing of visualization recommendations. IEEE TVCG (Proc. InfoVis), 2016.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 10, Issue 12
August 2017
427 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 August 2017
Published in PVLDB Volume 10, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)26
  • Downloads (Last 6 weeks)2
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media