Computing methodologies

Applied Filters

People

Publications

Conferences

Publication Date

20 Results for: Book/Issue: SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,836,977 records)|Limit your search to The ACM Full-Text Collection (774,400 records)

Showing 1 - 20of20 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

Article
August 2002
Translingual vocabulary mappings for multilingual information access
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 455–456https://doi.org/10.1145/564376.564496
6
317
Metrics
Total Citations6
Total Downloads317
Last 12 Months0
Last 6 weeks0
Get Access
Article
August 2002
Adaptive information extraction for document annotation in amilcare
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPage 451https://doi.org/10.1145/564376.564492

Amilcare is a tool for Adaptive Information Extraction (IE) designed for supporting active annotation of documents for the Semantic Web (SW). It can be used either for unsupervised document annotation or as a support for human annotation. Amilcare is ...
5
493
Metrics
Total Citations5
Total Downloads493
Last 12 Months1
Last 6 weeks0
Get Access
Article
August 2002
Correlating multilingual documents via bipartite graph modeling
- Hongyuan Zha,
- Xiang Ji
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 443–444https://doi.org/10.1145/564376.564485

There is enormous amount of multilingual documents from various sources and possibly from different countries describing a single event or a set of related events. It is desirable to construct text mining methods that can compare and highlight ...
8
537
Metrics
Total Citations8
Total Downloads537
Last 12 Months2
Last 6 weeks1
Get Access
Article
August 2002
Topic structure modeling
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 417–418https://doi.org/10.1145/564376.564472

In this paper, we present a method based on document probes to quantify and diagnose topic structure, distinguishing topics as monolithic, structured, or diffuse. The method also yields a structure analysis that can be used directly to optimize filter (...
3
383
Metrics
Total Citations3
Total Downloads383
Last 12 Months0
Last 6 weeks0
Get Access
Article
August 2002
Building thematic lexical resources by term categorization
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 415–416https://doi.org/10.1145/564376.564471

We discuss the automatic generation of thematic lexicons by means of term categorization, a novel task employing techniques from information retrieval (IR) and machine learning (ML). Specifically, we view the generation of such lexicons as an iterative ...
0
348
Metrics
Total Citations0
Total Downloads348
Last 12 Months0
Last 6 weeks0
Get Access
Article
August 2002
Modeling (in)variability of human judgments for text summarization
- Tadashi Nomoto,
- Yuji Matsumoto
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 407–408https://doi.org/10.1145/564376.564467

The paper proposes and empirically motivates an integration of supervised learning with unsupervised learning to deal with human biases in summarization. In particular, we explore the use of probabilistic decision tree within the clustering framework to ...
0
318
Metrics
Total Citations0
Total Downloads318
Last 12 Months0
Last 6 weeks0
Get Access
Article
August 2002
Automatic metadata generation & evaluation
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 401–402https://doi.org/10.1145/564376.564464

The poster reports on a project in which we are investigating methods for breaking the human metadata-generation bottleneck that plagues Digital Libraries. The research question is whether metadata elements and values can be automatically generated from ...
33
1,492
Metrics
Total Citations33
Total Downloads1,492
Last 12 Months25
Last 6 weeks2
Get Access
Article
August 2002
User-centered interface design for cross-language information retrieval
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 383–384https://doi.org/10.1145/564376.564455

This paper reports on the user-centered design methodology and techniques used for the elicitation of user requirements and how these requirements informed the first phase of the user interface design for a Cross-Language Information Retrieval System. ...
4
712
Metrics
Total Citations4
Total Downloads712
Last 12 Months4
Last 6 weeks0
Get Access
Article
August 2002
Amilcare: adaptive information extraction for document annotation
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 367–368https://doi.org/10.1145/564376.564447
4
399
Metrics
Total Citations4
Total Downloads399
Last 12 Months0
Last 6 weeks0
Get Access
Article
August 2002
ICA and SOM in text document analysis
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 361–362https://doi.org/10.1145/564376.564444

In this study we show experimental results on using Independent Component Analysis (ICA) and the Self-Organizing Map (SOM) in document analysis. Our documents are segments of spoken dialogues carried out over the telephone in a customer service, ...
10
836
Metrics
Total Citations10
Total Downloads836
Last 12 Months1
Last 6 weeks0
Get Access
Article
August 2002
Using self-supervised word segmentation in Chinese information retrieval
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 349–350https://doi.org/10.1145/564376.564438

We propose a self-supervised word-segmentation technique for Chinese information retrieval. This method combines the advantages of traditional dictionary based approaches with character based approaches, while overcoming many of their shortcomings. ...
6
516
Metrics
Total Citations6
Total Downloads516
Last 12 Months1
Last 6 weeks0
Get Access
Article
August 2002
Using part-of-speech patterns to reduce query ambiguity
- James Allan,
- Hema Raghavan
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 307–314https://doi.org/10.1145/564376.564430

Query ambiguity is a generally recognized problem, particularly in Web environments where queries are commonly only one or two words in length. In this study, we explore one technique that finds commonly occurring patterns of parts of speech near a one-...
62
947
Metrics
Total Citations62
Total Downloads947
Last 12 Months3
Last 6 weeks0
Get Access
Article
August 2002
Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 275–282https://doi.org/10.1145/564376.564425

Arabic, a highly inflected language, requires good stemming for effective information retrieval, yet no standard approach to stemming has emerged. We developed several light stemmers based on heuristics and a statistical stemmer based on co-occurrence ...
191
1,826
Metrics
Total Citations191
Total Downloads1,826
Last 12 Months8
Last 6 weeks0
Get Access
Article
August 2002
Empirical studies in strategies for Arabic retrieval
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 269–274https://doi.org/10.1145/564376.564424

This work evaluates a few search strategies for Arabic monolingual and cross-lingual retrieval, using the TREC Arabic corpus as the test-bed. The release by NIST in 2001 of an Arabic corpus of nearly 400k documents with both monolingual and cross-...
42
590
Metrics
Total Citations42
Total Downloads590
Last 12 Months2
Last 6 weeks0
Get Access
Article
August 2002
Methods and metrics for cold-start recommendations
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 253–260https://doi.org/10.1145/564376.564421

We have developed a method for recommending items that combines content and collaborative data under a single probabilistic framework. We benchmark our algorithm against a naïve Bayes classifier on the cold-start problem, where we wish to recommend ...
1,082
7,298
Metrics
Total Citations1,082
Total Downloads7,298
Last 12 Months315
Last 6 weeks36
Get Access
Article
August 2002
Probabilistic combination of text classifiers using reliability indicators: models and results
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 207–214https://doi.org/10.1145/564376.564413

The intuition that different text classifiers behave in qualitatively different ways has long motivated attempts to build a better metaclassifier via some combination of classifiers. We introduce a probabilistic method for combining classifiers that ...
37
857
Metrics
Total Citations37
Total Downloads857
Last 12 Months4
Last 6 weeks0
Get Access
Article
August 2002
A new family of online algorithms for category ranking
- Koby Crammer,
- Yoram Singer
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 151–158https://doi.org/10.1145/564376.564404

We describe a new family of topic-ranking algorithms for multi-labeled documents. The motivation for the algorithms stems from recent advances in online learning algorithms. The algorithms we present are simple to implement and are time and memory ...
28
1,027
Metrics
Total Citations28
Total Downloads1,027
Last 12 Months7
Last 6 weeks0
Get Access
Article
August 2002
Unsupervised document classification using sequential information maximization
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 129–136https://doi.org/10.1145/564376.564401

We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential (sIB) approach is guaranteed to converge to a local maximum of the ...
151
2,033
Metrics
Total Citations151
Total Downloads2,033
Last 12 Months30
Last 6 weeks4
Get Access
Article
August 2002
Cross-document summarization by concept classification
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 121–128https://doi.org/10.1145/564376.564399

In this paper we describe a Cross Document Summarizer XDoX designed specifically to summarize large document sets (50-500 documents and more). Such sets of documents are typically obtained from routing or filtering systems run against a continuous ...
43
1,306
Metrics
Total Citations43
Total Downloads1,306
Last 12 Months5
Last 6 weeks0
Get Access
Article
August 2002
Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering
- Hongyuan Zha
SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrievalPages 113–120https://doi.org/10.1145/564376.564398

A novel method for simultaneous keyphrase extraction and generic text summarization is proposed by modeling text documents as weighted undirected and weighted bipartite graphs. Spectral graph clustering algorithms are useed for partitioning sentences of ...
113
2,314
Metrics
Total Citations113
Total Downloads2,314
Last 12 Months10
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Results

Translingual vocabulary mappings for multilingual information access

Adaptive information extraction for document annotation in amilcare

Correlating multilingual documents via bipartite graph modeling

Topic structure modeling

Building thematic lexical resources by term categorization

Modeling (in)variability of human judgments for text summarization

Automatic metadata generation & evaluation

User-centered interface design for cross-language information retrieval

Amilcare: adaptive information extraction for document annotation

ICA and SOM in text document analysis

Using self-supervised word segmentation in Chinese information retrieval

Using part-of-speech patterns to reduce query ambiguity

Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis

Empirical studies in strategies for Arabic retrieval

Methods and metrics for cold-start recommendations

Probabilistic combination of text classifiers using reliability indicators: models and results

A new family of online algorithms for category ranking

Unsupervised document classification using sequential information maximization

Cross-document summarization by concept classification

Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering