Information retrieval

Applied Filters

People

Publications

Conferences

Publication Date

18 Results for: Book/Issue: WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,833,026 records)|Limit your search to The ACM Full-Text Collection (773,090 records)

Showing 1 - 18of18 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
February 2008
Advertising keyword suggestion based on concept hierarchy
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 251–260https://doi.org/10.1145/1341531.1341564

The increasing growth of the World Wide Web constantly enlarges the revenue generated by search engine advertising. Advertisers bid on keywords associated with their products to display their ads on the search result pages. Keyword suggestion methods ...
70
1,555
Metrics
Total Citations70
Total Downloads1,555
Last 12 Months12
Last 6 weeks0
Get Access
research-article
February 2008
A holistic lexicon-based approach to opinion mining
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 231–240https://doi.org/10.1145/1341531.1341561

One of the important types of information on the Web is the opinions expressed in the user generated content, e.g., customer reviews of products, forum posts, and blogs. In this paper, we focus on customer reviews of products. In particular, we study ...
797
5,916
Metrics
Total Citations797
Total Downloads5,916
Last 12 Months126
Last 6 weeks14
Get Access
research-article
February 2008
Opinion spam and analysis
- Nitin Jindal,
- Bing Liu
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 219–230https://doi.org/10.1145/1341531.1341560

Evaluative texts on the Web have become a valuable source of opinions on products, services, events, individuals, etc. Recently, many researchers have studied such opinion sources as product reviews, forum posts, and blogs. However, existing research ...
885
5,599
Metrics
Total Citations885
Total Downloads5,599
Last 12 Months247
Last 6 weeks24
Get Access
research-article
February 2008
Can social bookmarking improve web search?
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 195–206https://doi.org/10.1145/1341531.1341558

Social bookmarking is a recent phenomenon which has the potential to give us a great deal of data about pages on the web. One major question is whether that data can be used to augment systems like web search. To answer this question, over the past year ...
308
2,997
Metrics
Total Citations308
Total Downloads2,997
Last 12 Months14
Last 6 weeks1
Get Access
research-article
February 2008
Finding high-quality content in social media
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 183–194https://doi.org/10.1145/1341531.1341557

The quality of user-generated content varies drastically from excellent to abuse and spam. As the availability of such content increases, the task of identifying high-quality content sites based on user contributions --social media sites -- becomes ...
788
22,337
Metrics
Total Citations788
Total Downloads22,337
Last 12 Months674
Last 6 weeks72
Get Access
research-article
February 2008
On ranking controversies in wikipedia: models and evaluation
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 171–182https://doi.org/10.1145/1341531.1341556

Wikipedia 1 is a very large and successful Web 2.0 example. As the number of Wikipedia articles and contributors grows at a very fast pace, there are also increasing disputes occurring among the contributors. Disputes often happen in articles with ...
61
1,133
Metrics
Total Citations61
Total Downloads1,133
Last 12 Months26
Last 6 weeks2
Get Access
research-article
February 2008
Understanding temporal aspects in document classification
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 159–170https://doi.org/10.1145/1341531.1341554

Due to the increasing amount of information present on the Web, Automatic Document Classification (ADC) has become an important research topic. ADC usually follows a standard supervised learning strategy, where we first build a model using preclassified ...
21
696
Metrics
Total Citations21
Total Downloads696
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2008
Connectivity structure of bipartite graphs via the KNC-plot
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 129–138https://doi.org/10.1145/1341531.1341550

In this paper we introduce the k-neighbor connectivity plot, or KNC-plot, as a tool to study the macroscopic connectiv-ity structure of sparse bipartite graphs. Given a bipartite graph G = (U, V, E), we say that two nodes in U are k-neighbors if there ...
18
523
Metrics
Total Citations18
Total Downloads523
Last 12 Months3
Last 6 weeks0
Get Access
research-article
February 2008
A scalable pattern mining approach to web graph compression with communities
- Gregory Buehrer,
- Kumar Chellapilla
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 95–106https://doi.org/10.1145/1341531.1341547

A link server is a system designed to support efficient implementations of graph computations on the web graph. In this work, we present a compression scheme for the web graph specifically designed to accommodate community queries and other random ...
167
1,583
Metrics
Total Citations167
Total Downloads1,583
Last 12 Months47
Last 6 weeks2
Get Access
research-article
February 2008
An experimental comparison of click position-bias models
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 87–94https://doi.org/10.1145/1341531.1341545

Search engine click logs provide an invaluable source of relevance information, but this information is biased. A key source of bias is presentation order: the probability of click is influenced by a document's position in the results page. This paper ...
605
3,668
Metrics
Total Citations605
Total Downloads3,668
Last 12 Months243
Last 6 weeks38
Get Access
research-article
February 2008
SoftRank: optimizing non-smooth rank metrics
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 77–86https://doi.org/10.1145/1341531.1341544

We address the problem of learning large complex ranking functions. Most IR applications use evaluation metrics that depend only upon the ranks of documents. However, most ranking functions generate document scores, which are sorted to produce a ...
234
1,356
Metrics
Total Citations234
Total Downloads1,356
Last 12 Months64
Last 6 weeks9
Get Access
research-article
February 2008
Fast learning of document ranking functions with the committee perceptron
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 55–64https://doi.org/10.1145/1341531.1341542

This paper presents a new variant of the perceptron algorithm using selective committee averaging (or voting). We apply this agorithm to the problem of learning ranking functions for document retrieval, known as the "Learning to Rank" problem. Most ...
17
611
Metrics
Total Citations17
Total Downloads611
Last 12 Months2
Last 6 weeks0
Get Access
research-article
February 2008
Entropy of search logs: how hard is search? with personalization? with backoff?
- Qiaozhu Mei,
- Kenneth Church
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 45–54https://doi.org/10.1145/1341531.1341540

How many pages are there on the Web? 5B? 20B? More? Less? Big bets on clusters in the clouds could be wiped out if a small cache of a few million urls could capture much of the value. Language modeling techniques are applied to MSN's search logs to ...
44
960
Metrics
Total Citations44
Total Downloads960
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2008
Beyond basic faceted search
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 33–44https://doi.org/10.1145/1341531.1341539

This paper extends traditional faceted search to support richer information discovery tasks over more complex data models. Our first extension adds exible, dynamic business intelligence aggregations to the faceted application, enabling users to gain ...
87
2,101
Metrics
Total Citations87
Total Downloads2,101
Last 12 Months32
Last 6 weeks1
Get Access
research-article
February 2008
Disorder inequality: a combinatorial approach to nearest neighbor search
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 25–32https://doi.org/10.1145/1341531.1341538

We say that an algorithm for nearest neighbor search is combinatorial if only direct comparisons between two pairwise similarity values are allowed. Combinatorial algorithms for nearest neighbor search have two important advantages: (1) they do not map ...
24
451
Metrics
Total Citations24
Total Downloads451
Last 12 Months4
Last 6 weeks0
Get Access
research-article
February 2008
On placing skips optimally in expectation
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 15–24https://doi.org/10.1145/1341531.1341537

We study the problem of optimal skip placement in an inverted list. Assuming the query distribution to be known in advance, we formally prove that an optimal skip placement can be computed quite efficiently. Our best algorithm runs in time O (n log n), ...
15
414
Metrics
Total Citations15
Total Downloads414
Last 12 Months3
Last 6 weeks0
Get Access
research-article
February 2008
Crawl ordering by search impact
- Sandeep Pandey,
- Christopher Olston
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPages 3–14https://doi.org/10.1145/1341531.1341535

We study how to prioritize the fetching of new pages under the objective of maximizing the quality of search results. In particular, our objective is to fetch new pages that have the most impact, where the impact of a page is equal to the number of ...
20
870
Metrics
Total Citations20
Total Downloads870
Last 12 Months7
Last 6 weeks0
Get Access
invited-talk
February 2008
Web information management: past, present and future
- Hector Garcia-Molina
WSDM '08: Proceedings of the 2008 International Conference on Web Search and Data MiningPage 1https://doi.org/10.1145/1341531.1341532

In this talk I will give a brief retrospective on Web Information Management, and will discuss some of the key challenges for the future. I will not give a survey of all work in the area; instead I will give my personal perspective based on work in the ...
2
552
Metrics
Total Citations2
Total Downloads552
Last 12 Months1
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Proceedings Series

Publication Date

Advertising keyword suggestion based on concept hierarchy

A holistic lexicon-based approach to opinion mining

Opinion spam and analysis

Can social bookmarking improve web search?

Finding high-quality content in social media

On ranking controversies in wikipedia: models and evaluation

Understanding temporal aspects in document classification

Connectivity structure of bipartite graphs via the KNC-plot

A scalable pattern mining approach to web graph compression with communities

An experimental comparison of click position-bias models

SoftRank: optimizing non-smooth rank metrics

Fast learning of document ranking functions with the committee perceptron

Entropy of search logs: how hard is search? with personalization? with backoff?

Beyond basic faceted search

Disorder inequality: a combinatorial approach to nearest neighbor search

On placing skips optimally in expectation

Crawl ordering by search impact

Web information management: past, present and future