Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2016
An Interactive Data Repository with Visual Analytics
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 17, Issue 2Pages 37–41https://doi.org/10.1145/2897350.2897355Scientific data repositories have historically made data widely accessible to the scientific community, and have led to better research through comparisons, reproducibility, as well as further discoveries and insights. Despite the growing importance and ...
- columnSeptember 2015
A Framework for Collocation Error Correction in Web Pages and Text Documents
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 17, Issue 1Pages 14–23https://doi.org/10.1145/2830544.2830548Much of the English in text documents today comes from nonnative speakers. Web searches are also conducted very often by non-native speakers. Though highly qualified in their respective fields, these speakers could potentially make errors in collocation,...
- columnSeptember 2015
Question Quality in Community Question Answering Forums: a survey
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 17, Issue 1Pages 8–13https://doi.org/10.1145/2830544.2830547Community Question Answering websites (CQA) offer a new opportunity for users to provide, search and share knowledge. Although the idea of receiving a direct, targeted response to a question sounds very attractive, the quality of the question itself can ...
- columnMay 2015
A Social Formalism and Survey for Recommender Systems
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 16, Issue 2Pages 20–37https://doi.org/10.1145/2783702.2783705This paper presents a general formalism for Recommender Systems based on Social Network Analysis. After introducing the classical categories of recommender systems, we present our Social Filtering formalism and show that it extends association rules, ...
- columnMay 2015
Patent Mining: A Survey
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 16, Issue 2Pages 1–19https://doi.org/10.1145/2783702.2783704Patent documents are important intellectual resources of protecting interests of individuals, organizations and companies. Different from general web documents, patent documents have a well-defined format including frontpage, description, nclaims, and ...
-
- research-articleSeptember 2014
Contextual crowd intelligence
- Beng Chin Ooi,
- Kian Lee Tan,
- Quoc Trung Tran,
- James W.L. Yip,
- Gang Chen,
- Zheng Jye Ling,
- Thi Nguyen,
- Anthony K.H. Tung,
- Meihui Zhang
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 16, Issue 1Pages 39–46https://doi.org/10.1145/2674026.2674032Most data analytics applications are industry/domain specific, e.g., predicting patients at high risk of being admitted to intensive care unit in the healthcare sector or predicting malicious SMSs in the telecommunication sector. Existing solutions are ...
- research-articleSeptember 2014
Change detection in streaming data in the era of big data: models and issues
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 16, Issue 1Pages 30–38https://doi.org/10.1145/2674026.2674031Big Data is identified by its three Vs, namely velocity, volume, and variety. The area of data stream processing has long dealt with the former two Vs velocity and volume. Over a decade of intensive research, the community has provided many important ...
- research-articleSeptember 2014
What is Tumblr: a statistical overview and comparison
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 16, Issue 1Pages 21–29https://doi.org/10.1145/2674026.2674030Tumblr, as one of the most popular microblogging platforms, has gained momentum recently. It is reported to have 166.4 millions of users and 73.4 billions of posts by January 2014. While many articles about Tumblr have been published in major press, ...
- research-articleSeptember 2014
Twitter analytics: a big data management perspective
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 16, Issue 1Pages 11–20https://doi.org/10.1145/2674026.2674029With the inception of the Twitter microblogging platform in 2006, a myriad of research efforts have emerged studying different aspects of the Twittersphere. Each study exploits its own tools and mechanisms to capture, store, query and analyze Twitter ...
- columnJune 2014
Mining social media with social theories: a survey
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 15, Issue 2Pages 20–29https://doi.org/10.1145/2641190.2641195The increasing popularity of social media encourages more and more users to participate in various online activities and produces data in an unprecedented rate. Social media data is big, linked, noisy, highly unstructured and in- complete, and differs ...
- columnJune 2014
Clustering high dimensional data: examining differences and commonalities between subspace clustering and text clustering - a position paper
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 15, Issue 2Pages 1–8https://doi.org/10.1145/2641190.2641192The goal of this position paper is to contribute to a clear understanding of the commonalities and differences between subspace clustering and text clustering. Often text data is foisted as an ideal fit for subspace clustering due to its high ...
- columnMarch 2014
Research issues in outlier detection for data streams
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 15, Issue 1Pages 33–40https://doi.org/10.1145/2594473.2594479In applications, such as sensor networks and power usage monitoring, data are in the form of streams, each of which is an infinite sequence of data points with explicit or implicit timestamps and has special characteristics, such as transiency, ...
- columnMarch 2014
On the Management and Analysis of Our LifeSteps
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 15, Issue 1Pages 23–32https://doi.org/10.1145/2594473.2594478Huge volumes of location information are available nowadays due to the rapid growth of positioning devices (GPS-enabled smartphones and tablets, on-board navigation systems in vehicles, vessels and planes, smart chips for animals, etc.). In the near ...
- columnMarch 2014
Ensembles for unsupervised outlier detection: challenges and research questions a position paper
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 15, Issue 1Pages 11–22https://doi.org/10.1145/2594473.2594476Ensembles for unsupervised outlier detection is an emerging topic that has been neglected for a surprisingly long time (although there are reasons why this is more difficult than supervised ensembles or even clustering ensembles). Aggarwal recently ...
- columnMarch 2014
Comprehensible classification models: a position paper
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 15, Issue 1Pages 1–10https://doi.org/10.1145/2594473.2594475The vast majority of the literature evaluates the performance of classification models using only the criterion of predictive accuracy. This paper reviews the case for considering also the comprehensibility (interpretability) of classification models, ...
- research-articleApril 2013
Discovering interesting information with advances in web technology
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 14, Issue 2Pages 63–81https://doi.org/10.1145/2481244.2481255The Web is a steadily evolving resource comprising much more than mere HTML pages. With its ever-growing data sources in a variety of formats, it provides great potential for knowledge discovery. In this article, we shed light on some interesting ...
- research-articleApril 2013
Studying the source code of scientific research
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 14, Issue 2Pages 59–62https://doi.org/10.1145/2481244.2481254Just as inspecting the source code of programs tells us a lot about the process of programming, inspecting the "source code" of scientific papers informs on the process of scientific writing. We report on our study of the source of tens of thousands of ...
- research-articleApril 2013
Outlier ensembles: position paper
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 14, Issue 2Pages 49–58https://doi.org/10.1145/2481244.2481252Ensemble analysis is a widely used meta-algorithm for many data mining problems such as classification and clustering. Numerous ensemble-based algorithms have been proposed in the literature for these problems. Compared to the clustering and ...
- research-articleApril 2013
Mining large streams of user data for personalized recommendations
ACM SIGKDD Explorations Newsletter (SIGKDD), Volume 14, Issue 2Pages 37–48https://doi.org/10.1145/2481244.2481250The Netflix Prize put the spotlight on the use of data mining and machine learning methods for predicting user preferences. Many lessons came out of the competition. But since then, Recommender Systems have evolved. This evolution has been driven by the ...