Information systems

Applied Filters

People

Publications

Conferences

Publication Date

23 Results for: Book/Issue: WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,833,026 records)|Limit your search to The ACM Full-Text Collection (773,090 records)

Showing 1 - 20of23 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
February 2008
Advertising keyword suggestion based on concept hierarchy
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 251–260https://doi.org/10.1145/1341531.1341564

The increasing growth of the World Wide Web constantly enlarges the revenue generated by search engine advertising. Advertisers bid on keywords associated with their products to display their ads on the search result pages. Keyword suggestion methods ...
70
1,555
Metrics
Total Citations70
Total Downloads1,555
Last 12 Months12
Last 6 weeks0
Get Access
research-article
February 2008
A holistic lexicon-based approach to opinion mining
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 231–240https://doi.org/10.1145/1341531.1341561

One of the important types of information on the Web is the opinions expressed in the user generated content, e.g., customer reviews of products, forum posts, and blogs. In this paper, we focus on customer reviews of products. In particular, we study ...
797
5,916
Metrics
Total Citations797
Total Downloads5,916
Last 12 Months126
Last 6 weeks14
Get Access
research-article
February 2008
Opinion spam and analysis
- Nitin Jindal,
- Bing Liu
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 219–230https://doi.org/10.1145/1341531.1341560

Evaluative texts on the Web have become a valuable source of opinions on products, services, events, individuals, etc. Recently, many researchers have studied such opinion sources as product reviews, forum posts, and blogs. However, existing research ...
885
5,599
Metrics
Total Citations885
Total Downloads5,599
Last 12 Months247
Last 6 weeks24
Get Access
research-article
February 2008
Can social bookmarking improve web search?
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 195–206https://doi.org/10.1145/1341531.1341558

Social bookmarking is a recent phenomenon which has the potential to give us a great deal of data about pages on the web. One major question is whether that data can be used to augment systems like web search. To answer this question, over the past year ...
308
2,997
Metrics
Total Citations308
Total Downloads2,997
Last 12 Months14
Last 6 weeks1
Get Access
research-article
February 2008
Finding high-quality content in social media
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 183–194https://doi.org/10.1145/1341531.1341557

The quality of user-generated content varies drastically from excellent to abuse and spam. As the availability of such content increases, the task of identifying high-quality content sites based on user contributions --social media sites -- becomes ...
788
22,337
Metrics
Total Citations788
Total Downloads22,337
Last 12 Months674
Last 6 weeks72
Get Access
research-article
February 2008
On ranking controversies in wikipedia: models and evaluation
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 171–182https://doi.org/10.1145/1341531.1341556

Wikipedia 1 is a very large and successful Web 2.0 example. As the number of Wikipedia articles and contributors grows at a very fast pace, there are also increasing disputes occurring among the contributors. Disputes often happen in articles with ...
61
1,133
Metrics
Total Citations61
Total Downloads1,133
Last 12 Months26
Last 6 weeks2
Get Access
research-article
February 2008
Understanding temporal aspects in document classification
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 159–170https://doi.org/10.1145/1341531.1341554

Due to the increasing amount of information present on the Web, Automatic Document Classification (ADC) has become an important research topic. ADC usually follows a standard supervised learning strategy, where we first build a model using preclassified ...
21
696
Metrics
Total Citations21
Total Downloads696
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2008
Personal name classification in web queries
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 149–158https://doi.org/10.1145/1341531.1341553

Personal names are an important kind of Web queries in Web search, and yet they are special in many ways. Strategies for retrieving information on personal names should therefore be different from the strategies for other types of queries. To improve ...
9
694
Metrics
Total Citations9
Total Downloads694
Last 12 Months3
Last 6 weeks0
Get Access
research-article
February 2008
Deep classifier: automatically categorizing search results into large-scale hierarchies
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 139–148https://doi.org/10.1145/1341531.1341552

Organizing Web search results into hierarchical categories facilitates users' browsing through Web search results, especially for ambiguous queries where the potential results are mixed together. Previous methods on search result classification are ...
20
666
Metrics
Total Citations20
Total Downloads666
Last 12 Months3
Last 6 weeks0
Get Access
research-article
February 2008
Connectivity structure of bipartite graphs via the KNC-plot
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 129–138https://doi.org/10.1145/1341531.1341550

In this paper we introduce the k-neighbor connectivity plot, or KNC-plot, as a tool to study the macroscopic connectiv-ity structure of sparse bipartite graphs. Given a bipartite graph G = (U, V, E), we say that two nodes in U are k-neighbors if there ...
18
523
Metrics
Total Citations18
Total Downloads523
Last 12 Months3
Last 6 weeks0
Get Access
research-article
February 2008
Preferential behavior in online groups
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 117–128https://doi.org/10.1145/1341531.1341549

Online communities in the form of message boards, listservs, and newsgroups continue to represent a considerable amount of the social activity on the Internet. Every year thousands of groups ourish while others decline into relative obscurity; likewise, ...
63
1,237
Metrics
Total Citations63
Total Downloads1,237
Last 12 Months15
Last 6 weeks2
Get Access
research-article
February 2008
Collaboration over time: characterizing and modeling network evolution
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 107–116https://doi.org/10.1145/1341531.1341548

A formal type of scientific and academic collaboration is coauthorship which can be represented by a coauthorship network. Coauthorship networks are among some of the largest social networks and offer us the opportunity to study the mechanisms ...
77
1,212
Metrics
Total Citations77
Total Downloads1,212
Last 12 Months49
Last 6 weeks9
Get Access
research-article
February 2008
A scalable pattern mining approach to web graph compression with communities
- Gregory Buehrer,
- Kumar Chellapilla
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 95–106https://doi.org/10.1145/1341531.1341547

A link server is a system designed to support efficient implementations of graph computations on the web graph. In this work, we present a compression scheme for the web graph specifically designed to accommodate community queries and other random ...
167
1,583
Metrics
Total Citations167
Total Downloads1,583
Last 12 Months47
Last 6 weeks2
Get Access
research-article
February 2008
An experimental comparison of click position-bias models
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 87–94https://doi.org/10.1145/1341531.1341545

Search engine click logs provide an invaluable source of relevance information, but this information is biased. A key source of bias is presentation order: the probability of click is influenced by a document's position in the results page. This paper ...
605
3,668
Metrics
Total Citations605
Total Downloads3,668
Last 12 Months243
Last 6 weeks38
Get Access
research-article
February 2008
SoftRank: optimizing non-smooth rank metrics
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 77–86https://doi.org/10.1145/1341531.1341544

We address the problem of learning large complex ranking functions. Most IR applications use evaluation metrics that depend only upon the ranks of documents. However, most ranking functions generate document scores, which are sorted to produce a ...
234
1,356
Metrics
Total Citations234
Total Downloads1,356
Last 12 Months64
Last 6 weeks9
Get Access
research-article
February 2008
Ranking web sites with real user traffic
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 65–76https://doi.org/10.1145/1341531.1341543

We analyze the traffic-weighted Web host graph obtained from a large sample of real Web users over about seven months. A number of interesting structural properties are revealed by this complex dynamic network, some in line with the well-studied boolean ...
56
1,136
Metrics
Total Citations56
Total Downloads1,136
Last 12 Months13
Last 6 weeks5
Get Access
research-article
February 2008
Fast learning of document ranking functions with the committee perceptron
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 55–64https://doi.org/10.1145/1341531.1341542

This paper presents a new variant of the perceptron algorithm using selective committee averaging (or voting). We apply this agorithm to the problem of learning ranking functions for document retrieval, known as the "Learning to Rank" problem. Most ...
17
611
Metrics
Total Citations17
Total Downloads611
Last 12 Months2
Last 6 weeks0
Get Access
research-article
February 2008
Entropy of search logs: how hard is search? with personalization? with backoff?
- Qiaozhu Mei,
- Kenneth Church
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 45–54https://doi.org/10.1145/1341531.1341540

How many pages are there on the Web? 5B? 20B? More? Less? Big bets on clusters in the clouds could be wiped out if a small cache of a few million urls could capture much of the value. Language modeling techniques are applied to MSN's search logs to ...
44
960
Metrics
Total Citations44
Total Downloads960
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2008
Beyond basic faceted search
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 33–44https://doi.org/10.1145/1341531.1341539

This paper extends traditional faceted search to support richer information discovery tasks over more complex data models. Our first extension adds exible, dynamic business intelligence aggregations to the faceted application, enabling users to gain ...
87
2,101
Metrics
Total Citations87
Total Downloads2,101
Last 12 Months32
Last 6 weeks1
Get Access
research-article
February 2008
Disorder inequality: a combinatorial approach to nearest neighbor search
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 25–32https://doi.org/10.1145/1341531.1341538

We say that an algorithm for nearest neighbor search is combinatorial if only direct comparisons between two pairwise similarity values are allowed. Combinatorial algorithms for nearest neighbor search have two important advantages: (1) they do not map ...
24
451
Metrics
Total Citations24
Total Downloads451
Last 12 Months4
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Proceedings Series

Publication Date

Advertising keyword suggestion based on concept hierarchy

A holistic lexicon-based approach to opinion mining

Opinion spam and analysis

Can social bookmarking improve web search?

Finding high-quality content in social media

On ranking controversies in wikipedia: models and evaluation

Understanding temporal aspects in document classification

Personal name classification in web queries

Deep classifier: automatically categorizing search results into large-scale hierarchies

Connectivity structure of bipartite graphs via the KNC-plot

Preferential behavior in online groups

Collaboration over time: characterizing and modeling network evolution

A scalable pattern mining approach to web graph compression with communities

An experimental comparison of click position-bias models

SoftRank: optimizing non-smooth rank metrics

Ranking web sites with real user traffic

Fast learning of document ranking functions with the committee perceptron

Entropy of search logs: how hard is search? with personalization? with backoff?

Beyond basic faceted search

Disorder inequality: a combinatorial approach to nearest neighbor search