Retrieval models and ranking

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,762,368 records)|Limit your search to The ACM Full-Text Collection (757,135 records)

Showing 1 - 20of201 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Open Access
August 2024
Bridging Dense and Sparse Maximum Inner Product Search
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 6Article No.: 151, Pages 1–38https://doi.org/10.1145/3665324
Maximum inner product search (MIPS) over dense and sparse vectors have progressed independently in a bifurcated literature for decades; the latter is better known as top-\(k\) retrieval in Information Retrieval. This duality exists because sparse and ...
1
377
Metrics
Total Citations1
Total Downloads377
Last 12 Months377
Last 6 weeks208
View online with eReader
PDF
research-article
August 2024
Breaking Through the Noisy Correspondence: A Robust Model for Image-Text Matching
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 6Article No.: 149, Pages 1–26https://doi.org/10.1145/3662732
Unleashing the power of image-text matching in real-world applications is hampered by noisy correspondence. Manually curating high-quality datasets is expensive and time-consuming, and datasets generated using diffusion models are not adequately well-...
0
364
Metrics
Total Citations0
Total Downloads364
Last 12 Months364
Last 6 weeks82
Get Access
research-article
Free
August 2024
JUST ACCEPTED
Federated Recommender System Based on Diffusion Augmentation and Guided Denoising
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3688570
Sequential recommender systems often struggle with accurate personalized recommendations due to data sparsity issues. Existing works use variational autoencoders and generative adversarial network methods to enrich sparse data. However, they often ...
0
190
Metrics
Total Citations0
Total Downloads190
Last 12 Months190
Last 6 weeks190
View online with eReader
PDF
research-article
Free
August 2024
JUST ACCEPTED
SECON: Maintaining Semantic Consistency in Data Augmentation for Code Search
- Xu Zhang,
- Zexu Lin,
- Xiaoyu Hu,
- Jianlei Wang,
- Wenpeng Lu,
- Deyu Zhou
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3686151
Efficient code search techniques are crucial in accelerating software development by aiding developers in locating specific code snippets and understanding code functionalities. This study investigates code search methodologies, focusing on the emerging ...
0
137
Metrics
Total Citations0
Total Downloads137
Last 12 Months137
Last 6 weeks137
View online with eReader
PDF
research-article
Free
July 2024
JUST ACCEPTED
Online and Offline Evaluation in Search Clarification
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3681786
The effectiveness of clarification question models in engaging users within search systems is currently constrained, casting doubt on their overall usefulness. To improve the performance of these models, it is crucial to employ assessment approaches that ...
0
93
Metrics
Total Citations0
Total Downloads93
Last 12 Months93
Last 6 weeks62
View online with eReader
PDF
research-article
Free
July 2024
JUST ACCEPTED
A Self-Distilled Learning to Rank Model for Ad-hoc Retrieval
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3681784
Learning to rank models are broadly applied in ad-hoc retrieval for scoring and sorting documents based on their relevance to textual queries. The generalizability of the trained model in the learning to rank approach, however, can have an impact on the ...
0
139
Metrics
Total Citations0
Total Downloads139
Last 12 Months139
Last 6 weeks87
View online with eReader
PDF
research-article
Free
July 2024
JUST ACCEPTED
On Elastic Language Models
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3677375
Large-scale pretrained language models have achieved compelling performance in a wide range of language understanding and information retrieval tasks. While their large scales ensure capacity, they also hinder deployment. Knowledge distillation offers an ...
0
90
Metrics
Total Citations0
Total Downloads90
Last 12 Months90
Last 6 weeks52
View online with eReader
PDF
research-article
Free
July 2024
JUST ACCEPTED
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
- Yuren Mao,
- Xuemei Dong,
- Wenyi Xu,
- Yunjun Gao,
- Bin Wei,
- Ying Zhang
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3676957
Due to the extraordinarily large number of parameters, fine-tuning Large Language Models (LLMs) to update long-tail or out-of-date knowledge is impractical in lots of applications. To avoid fine-tuning, we can alternatively treat a LLM as a black-box (...
0
170
Metrics
Total Citations0
Total Downloads170
Last 12 Months170
Last 6 weeks92
View online with eReader
PDF
research-article
Free
June 2024
JUST ACCEPTED
ROGER: Ranking-oriented Generative Retrieval
ACM Transactions on Information Systems (TOIS), Just Accepted https://doi.org/10.1145/3603167
In recent years, various dense retrieval methods have been developed to improve the performance of search engines with a vectorized index. However, these approaches require a large pre-computed index and have limited capacity to memorize all semantics in ...
0
355
Metrics
Total Citations0
Total Downloads355
Last 12 Months355
Last 6 weeks103
View online with eReader
PDF
research-article
May 2024
Passage-aware Search Result Diversification
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 136, Pages 1–29https://doi.org/10.1145/3653672
Research on search result diversification strives to enhance the variety of subtopics within the list of search results. Existing studies usually treat a document as a whole and represent it with one fixed-length vector. However, considering that a long ...
1
159
Metrics
Total Citations1
Total Downloads159
Last 12 Months159
Last 6 weeks10
Get Access
research-article
Open Access
April 2024
Listwise Generative Retrieval Models via a Sequential Learning Process
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 133, Pages 1–31https://doi.org/10.1145/3653712
Recently, a novel generative retrieval (GR) paradigm has been proposed, where a single sequence-to-sequence model is learned to directly generate a list of relevant document identifiers (docids) given a query. Existing GR models commonly employ maximum ...
0
794
Metrics
Total Citations0
Total Downloads794
Last 12 Months794
Last 6 weeks202
View online with eReader
PDF
research-article
Open Access
April 2024
Revisiting Bag of Words Document Representations for Efficient Ranking with Transformers
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 114, Pages 1–27https://doi.org/10.1145/3640460
Modern transformer-based information retrieval models achieve state-of-the-art performance across various benchmarks. The self-attention of the transformer models is a powerful mechanism to contextualize terms over the whole input but quickly becomes ...
0
856
Metrics
Total Citations0
Total Downloads856
Last 12 Months856
Last 6 weeks185
View online with eReader
PDF
research-article
April 2024
An Analysis on Matching Mechanisms and Token Pruning for Late-interaction Models
- Qi Liu,
- Gang Guo,
- Jiaxin Mao,
- Zhicheng Dou,
- Ji-Rong Wen,
- Hao Jiang,
- Xinyu Zhang,
- Zhao Cao
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 118, Pages 1–28https://doi.org/10.1145/3639818
With the development of pre-trained language models, the dense retrieval models have become promising alternatives to the traditional retrieval models that rely on exact match and sparse bag-of-words representations. Different from most dense retrieval ...
0
389
Metrics
Total Citations0
Total Downloads389
Last 12 Months389
Last 6 weeks73
Get Access
research-article
Open Access
April 2024
Towards Effective and Efficient Sparse Neural Information Retrieval
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 116, Pages 1–46https://doi.org/10.1145/3634912
Sparse representation learning based on Pre-trained Language Models has seen a growing interest in Information Retrieval. Such approaches can take advantage of the proven efficiency of inverted indexes and inherit desirable IR priors such as explicit ...
0
1,552
Metrics
Total Citations0
Total Downloads1,552
Last 12 Months1,552
Last 6 weeks275
View online with eReader
PDF
research-article
April 2024
Data Augmentation for Sample Efficient and Robust Document Ranking
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 119, Pages 1–29https://doi.org/10.1145/3634911
Contextual ranking models have delivered impressive performance improvements over classical models in the document ranking task. However, these highly over-parameterized models tend to be data-hungry and require large amounts of data even for fine-tuning. ...
1
359
Metrics
Total Citations1
Total Downloads359
Last 12 Months359
Last 6 weeks26
Get Access
research-article
Open Access
April 2024
Efficient Neural Ranking Using Forward Indexes and Lightweight Encoders
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 117, Pages 1–34https://doi.org/10.1145/3631939
Dual-encoder-based dense retrieval models have become the standard in IR. They employ large Transformer-based language models, which are notoriously inefficient in terms of resources and latency.

We propose Fast-Forward indexes—vector forward indexes ...
1
864
Metrics
Total Citations1
Total Downloads864
Last 12 Months864
Last 6 weeks158
View online with eReader
PDF
research-article
April 2024
Retrieval for Extremely Long Queries and Documents with RPRS: A Highly Efficient and Effective Transformer-based Re-Ranker
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 115, Pages 1–32https://doi.org/10.1145/3631938
Retrieval with extremely long queries and documents is a well-known and challenging task in information retrieval and is commonly known as Query-by-Document (QBD) retrieval. Specifically designed Transformer models that can handle long input sequences ...
2
275
Metrics
Total Citations2
Total Downloads275
Last 12 Months275
Last 6 weeks21
Get Access
research-article
April 2024
Multi-grained Document Modeling for Search Result Diversification
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 126, Pages 1–22https://doi.org/10.1145/3652852
Search result diversification plays a crucial role in improving users’ search experience by providing users with documents covering more subtopics. Previous studies have made great progress in leveraging inter-document interactions to measure the ...
0
171
Metrics
Total Citations0
Total Downloads171
Last 12 Months171
Last 6 weeks19
Get Access
research-article
Open Access
April 2024
Cross-Model Comparative Loss for Enhancing Neuronal Utility in Language Understanding
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 123, Pages 1–29https://doi.org/10.1145/3652599
Current natural language understanding (NLU) models have been continuously scaling up, both in terms of model size and input context, introducing more hidden and input neurons. While this generally improves performance on average, the extra neurons do not ...
0
436
Metrics
Total Citations0
Total Downloads436
Last 12 Months436
Last 6 weeks117
View online with eReader
PDF
research-article
Open Access
April 2024
Generalized Weak Supervision for Neural Information Retrieval
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 5Article No.: 121, Pages 1–26https://doi.org/10.1145/3647639
Neural ranking models (NRMs) have demonstrated effective performance in several information retrieval (IR) tasks. However, training NRMs often requires large-scale training data, which is difficult and expensive to obtain. To address this issue, one can ...
0
517
Metrics
Total Citations0
Total Downloads517
Last 12 Months517
Last 6 weeks164
View online with eReader
PDF