skip to main content
10.1145/2501511acmconferencesBook PagePublication PageskddConference Proceedingsconference-collections
IDEA '13: Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
ACM2013 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
KDD' 13: The 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Chicago Illinois 11 August 2013
ISBN:
978-1-4503-2329-1
Published:
11 August 2013
Sponsors:
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN
Next Conference
Reflects downloads up to 09 Jan 2025Bibliometrics
Skip Abstract Section
Abstract

We have entered the era of big data. Massive datasets, surpassing terabytes and petabytes in size are now commonplace. They arise in numerous settings in science, government, and enterprises, and technology exists by which we can collect and store such massive amounts of information. Yet, making sense of these data remains a fundamental challenge. We lack the means to exploratively analyze databases of this scale. Currently, few technologies allow us to freely "wander" around the data, and make discoveries by following our intuition, or serendipity. While standard data mining aims at finding highly interesting results, it is typically computationally demanding and time consuming, thus may not be well-suited for interactive exploration of large datasets.

Interactive data mining techniques that aptly integrate human intuition, by means of visualization and intuitive human-computer interaction techniques, and machine computation support have been shown to help people gain significant insights into a wide range of problems. However, as datasets are being generated in larger volumes, higher velocity, and greater variety, creating effective interactive data mining techniques becomes a much harder task.

Skip Table Of Content Section
SESSION: Invited talks
research-article
Interactive visual analytics for high dimensional data

Many modern data sets can be represented in high dimensional vector spaces and have benefited from computational methods that utilize advanced techniques from numerical linear algebra and optimization. Visual analytics approaches have contributed ...

SESSION: Research papers
research-article
Building blocks for exploratory data analysis tools

Data exploration is largely manual and labor intensive. Although there are various tools and statistical techniques that can be applied to data sets, there is little help to identify what questions to ask of a data set, let alone what domain knowledge ...

research-article
Open Access
Methods for exploring and mining tables on Wikipedia

Knowledge bases extracted automatically from the Web present new opportunities for data mining and exploration. Given a large, heterogeneous set of extracted relations, new tools are needed for searching the knowledge and uncovering relationships of ...

research-article
One click mining: interactive local pattern discovery through implicit preference and performance learning

It is known that productive pattern discovery from data has to interactively involve the user as directly as possible. State-of-the-art toolboxes require the specification of sophisticated workflows with an explicit selection of a data mining method, ...

research-article
Lytic: synthesizing high-dimensional algorithmic analysis with domain-agnostic, faceted visual analytics

We present Lytic, a domain-independent, faceted visual analytic (VA) system for interactive exploration of large datasets. It combines a flexible UI that adapts to arbitrary character-separated value (CSV) datasets with algorithmic preprocessing to ...

research-article
A process-centric data mining and visual analytic tool for exploring complex social networks

Social scientists and observational scientists have a need to analyze complex network data sets. Examples of such exploratory tasks include: finding communities that exist in the data, comparing results from different graph mining algorithms, ...

research-article
Zips: mining compressing sequential patterns in streams

We propose a streaming algorithm, based on the minimal description length (MDL) principle, for extracting non-redundant sequential patterns. For static databases, the MDL-based approach that selects patterns based on their capacity to compress data ...

research-article
Augmenting MATLAB with semantic objects for an interactive visual environment

Analysis tools such as Matlab, R, and SAS support a myriad of built-in computational functions and various standard visualization techniques. However, most of them provide little interaction from visualizations mainly due to the fact that the tools ...

research-article
Online spatial data analysis and visualization system

With the exponential growth of the usage of web map services, the geo data analysis has become more and more popular. This paper develops an online spatial data analysis and visualization system, TerraFly GeoCloud, which facilitates end users to ...

research-article
Randomly sampling maximal itemsets

Pattern mining techniques generally enumerate lots of uninteresting and redundant patterns. To obtain less redundant collections, techniques exist that give condensed representations of these collections. However, the proposed techniques often rely on ...

research-article
Towards anytime active learning: interrupting experts to reduce annotation costs

Many active learning methods use annotation cost or expert quality as part of their framework to select the best data for annotation. While these methods model expert quality, availability, or expertise, they have no direct influence on any of these ...

research-article
Storygraph: extracting patterns from spatio-temporal data

Analysis of spatio-temporal data often involves correlating different events in time and location to uncover relationships between them. It is also desirable to identify different patterns in the data. Visualizing time and space in the same chart is not ...

Contributors
  • Georgia Institute of Technology
  • CISPA - Helmholtz Center for Information Security
  • Carnegie Mellon University

Index Terms

  1. Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Acceptance Rates

      IDEA '13 Paper Acceptance Rate 11 of 25 submissions, 44%;
      Overall Acceptance Rate 11 of 25 submissions, 44%
      YearSubmittedAcceptedRate
      IDEA '13251144%
      Overall251144%