Search & Ranking

Help users find exactly what they're looking for • 53 papers

10 subtopics

Learning to Rank

Train models that order search results optimally

2010 1028 cited

From RankNet to LambdaRank to LambdaMART: An Overview

Chris Burges

Definitive reference unifying the RankNet family; LambdaMART remains the industry workhorse for gradient-boosted ranking.

2005

Learning to Rank using Gradient Descent (RankNet)

Chris Burges, et al.

Foundational pairwise neural ranking using cross-entropy loss; won ICML Test of Time Award 2015.

2007 1912 cited

Learning to Rank: From Pairwise to Listwise (ListNet)

Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, Hang Li

First listwise method using probability distributions over permutations; influenced all subsequent listwise methods.

2008 686 cited

Listwise Approach to Learning to Rank (ListMLE)

Fen Xia, Tie-Yan Liu, Jue Wang, Wensheng Zhang, Hang Li

Foundational listwise LTR with theoretical analysis of listwise loss functions.

2002 8000 cited

Optimizing Search Engines using Clickthrough Data

Thorsten Joachims

Seminal SVM-based pairwise LTR using click data for preference constraints; established click-based learning to rank paradigm.

Query Understanding

Figure out what users actually want from their searches

2004 933 cited

Understanding User Goals in Web Search

Daniel Rose, Danny Levinson

Seminal taxonomy of query intent (navigational, informational, transactional); foundational framework still used today.

2006 276 cited

Building Bridges for Web Query Classification

Dou Shen, Rong Pan, Jian-Tao Sun, Jeffrey Junfeng Pan, Kangheng Wu, Jie Yin, Qiang Yang

Foundational work on mapping queries to topic categories using intermediary data sources.

2020 3 cited

Deep Search Query Intent Understanding

Bo Xiang, et al. (Facebook)

Industrial-scale BERT-based intent classification for typeahead and search blending.

2001 3000 cited

Relevance-Based Language Models

Victor Lavrenko, W. Bruce Croft

Foundational pseudo-relevance feedback method introducing RM3; widely used for query expansion in both traditional and neural retrieval.

2005 500 cited

Context-Sensitive Information Retrieval Using Implicit Feedback

Xuehua Shen, Bin Tan, ChengXiang Zhai

Pioneered use of session context for query interpretation; demonstrated significant gains from implicit user feedback.

2020 150 cited

Few-Shot Generative Conversational Query Rewriting

Shi Yu, Jiahua Liu, Jie Yang, Chenyan Xiong, Paul Bennett, Jianfeng Gao, Zhiyuan Liu

Modern neural query reformulation using GPT-2 for conversational settings; addresses context carryover in multi-turn search.

Relevance vs. Engagement

Short-term clicks vs. satisfaction

2016 3182 cited

Deep Neural Networks for YouTube Recommendations

Paul Covington, Jay Adams, Emre Sargin

Landmark paper on watch-time optimization vs. clicks; explains freshness handling at scale.

2019 287 cited

Recommending What Video to Watch Next

Zhe Zhao, et al. (Google)

Multi-objective ranking balancing engagement with satisfaction using multi-gate mixture-of-experts.

2019 90 cited

150 Successful ML Models: 6 Lessons at Booking.com

Lucas Bernardi, et al.

Influential industry paper on balancing business metrics vs. user value in production systems.

2025 26 cited

Engagement, User Satisfaction, and Divisive Content Amplification

Smitha Milli, et al.

Demonstrates empirically that engagement-based ranking underperforms for user satisfaction.

2014 500 cited

Modeling Dwell Time to Predict Click-level Satisfaction

Youngho Kim, Ahmed Hassan, Ryen W. White, Imed Zitouni

Established context-dependent satisfaction thresholds beyond fixed dwell time cutoffs; showed satisfaction prediction requires query-document context.

2013 400 cited

Beyond Clicks: Query Reformulation as a Predictor of Search Satisfaction

Ahmed Hassan, Xiaolin Shi, Nick Craswell, Bill Ramsey

Demonstrated satisfaction signals exist beyond clicks; query reformulation patterns predict user satisfaction better than clicks alone.

Position Bias & Debiasing

Account for the fact that top results get more clicks

2008 935 cited

An Experimental Comparison of Click Position-Bias Models

Nick Craswell, Onno Zoeter, Michael Taylor, Bill Ramsey

Seminal empirical study establishing cascade model as best explanation for user examination behavior.

2017 495 cited

Unbiased Learning-to-Rank with Biased Feedback

Thorsten Joachims, Adith Swaminathan, Tobias Schnabel

Foundational counterfactual/IPS framework for unbiased LTR; propensity-weighted ranking SVM.

2015 260 cited

Click Models for Web Search

Aleksandr Chuklin, Ilya Markov, Maarten de Rijke

Comprehensive synthesis covering all major click models (PBM, DCM, DBN, UBM); essential reference.

2018 179 cited

Unbiased LTR with Unbiased Propensity Estimation (DLA)

Qingyao Ai, Keping Bi, Cheng Luo, Jiafeng Guo, W. Bruce Croft

Dual learning algorithm jointly training ranking and propensity models; widely used for production debiasing.

2020 36 cited

Improving Deep Learning for Airbnb Search

Malay Haldar, et al.

Practical industry case on position bias correction via dropout at inference; shows major production gains.

2019 150 cited

Addressing Trust Bias for Unbiased Learning-to-Rank

Aman Agarwal, Xuanhui Wang, Cheng Li, Michael Bendersky, Marc Najork

First rigorous treatment of trust bias in ULTR framework; extends counterfactual work to account for users trusting higher-ranked results more.

Neural Ranking Models

Use deep learning for search relevance

2019 400 cited

Multi-Stage Document Ranking with BERT

Rodrigo Nogueira, Wei Yang, Kyunghyun Cho, Jimmy Lin

Established BERT-based cross-encoder reranking paradigm with MonoBERT and duoBERT; 'Expando-Mono-Duo' design pattern for neural IR pipelines.

2019 300 cited

From doc2query to docTTTTTquery

Rodrigo Nogueira, Jimmy Lin

Neural document expansion via T5-generated queries; improves BM25 without runtime overhead by enriching documents at indexing time.

2016 800 cited

A Deep Relevance Matching Model for Ad-hoc Retrieval

Jiafeng Guo, Yixing Fan, Qingyao Ai, W. Bruce Croft

Distinguished 'relevance matching' from 'semantic matching' in neural IR; histogram-based matching with term gating mechanism.

2017 560 cited

End-to-End Neural Ad-hoc Ranking with Kernel Pooling

Chenyan Xiong, Zhuyun Dai, Jamie Callan, Zhiyuan Liu, Russell Power

Pioneered end-to-end neural ranking with interpretable soft-match kernels (K-NRM); spawned KNRM variants used in production.

2022 250 cited

Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval

Luyu Gao, Jamie Callan

Pre-training specifically for dense retrieval via Condenser architecture; coCondenser adds corpus-aware contrastive objective for further gains.

Retrieval-Augmented Generation

Combine search with generative AI

2020 1500 cited

REALM: Retrieval-Augmented Language Model Pre-Training

Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, Ming-Wei Chang

First end-to-end pre-training of retrieval + LM with backpropagation through retrieval; foundational architecture for knowledge-intensive tasks.

2020 5000 cited

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela

THE paper that coined 'RAG'; combines pre-trained retriever with seq2seq generator for open-domain QA. Foundation for modern RAG systems.

2021 1200 cited

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

Gautier Izacard, Edouard Grave

Fusion-in-Decoder (FiD) architecture enabling efficient scaling to 100+ passages; backbone of Atlas and subsequent RAG systems.

2022 400 cited

Transformer Memory as a Differentiable Search Index

Yi Tay, Vinh Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Gupta, Tal Schuster, William W. Cohen, Donald Metzler

Generative retrieval paradigm (DSI): model memorizes corpus and generates document IDs directly; alternative to dense/sparse retrieve-then-rank.

Evaluation Methods for IR

Measure search quality effectively

2002 10000 cited

Cumulated Gain-Based Evaluation of IR Techniques

Kalervo Järvelin, Jaana Kekäläinen

Introduced DCG and NDCG; most widely used ranking metric enabling graded relevance evaluation.

2009 800 cited

Expected Reciprocal Rank for Graded Relevance

Olivier Chapelle, Donald Metzler, Ya Zhang, Pierre Grinspan

Cascade-based metric modeling user stopping behavior; primary TREC Web Track metric accounting for diminishing returns.

2016 2000 cited

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen, Mir Rosenberg, Xia Song, Alina Stoica, Saurabh Tiwary, Tong Wang

Large-scale passage ranking benchmark with 1M queries; enabled neural retrieval research and remains primary leaderboard.

2021 800 cited

BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models

Nandan Thakur, Nils Reimers, Andreas Rücklé, Abhishek Srivastava, Iryna Gurevych

18-dataset benchmark testing out-of-distribution generalization; key finding that BM25 remains robust while dense models struggle on domain shift.

2012 400 cited

Large-Scale Validation and Analysis of Interleaved Search Evaluation

Olivier Chapelle, Thorsten Joachims, Filip Radlinski, Yisong Yue

Definitive validation of interleaving as online A/B testing gold standard; demonstrated high agreement with editorial judgments at scale.

Result Diversification

Show varied results that cover different intents

1998 2500 cited

The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries

Jaime Carbonell, Jade Goldstein

THE seminal diversification paper introducing Maximal Marginal Relevance (MMR) formula; balances relevance with novelty.

2008 1000 cited

Novelty and Diversity in Information Retrieval Evaluation

Charles L.A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova, Azin Ashkan, Stefan Büttcher, Ian MacKinnon

Standard evaluation framework for diversity introducing α-nDCG; enabled TREC Web diversity track and systematic diversity research.

2009 600 cited

Diversifying Search Results

Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, Samuel Ieong

Intent-aware diversification with formal coverage guarantees; models query as distribution over intents and optimizes expected coverage.

Must-read papers for tech economists and applied researchers