Credit & Lending

Assess risk and make fair lending decisions • 52 papers

10 subtopics

Credit Scoring & Risk Models

Predict loan defaults and assess creditworthiness

1968 13152 cited

Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy

Edward I. Altman

The Z-score model using MDA; 10,000+ citations and still the benchmark for bankruptcy prediction.

1980 5845 cited

Financial Ratios and the Probabilistic Prediction of Bankruptcy

James A. Ohlson

Introduced logistic regression to default prediction; established logit as industry workhorse.

1974 10960 cited

On the Pricing of Corporate Debt: The Risk Structure of Interest Rates

Robert C. Merton

Structural model treating equity as call option; foundation for KMV/Moody's EDF.

1981 12857 cited

Credit Rationing in Markets with Imperfect Information

Joseph E. Stiglitz, Andrew Weiss

Explains equilibrium credit rationing from adverse selection; essential for lending market theory.

2016

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

Dominant ML algorithm for credit scoring; consistently outperforms logistic regression.

2015 1400 cited

Benchmarking State-of-the-Art Classification Algorithms for Credit Scoring: An Update of Research

Stefan Lessmann, Bart Baesens, Hsin-Vonn Seow, Lyn C. Thomas

Definitive benchmark comparing 41 classifiers across 8 datasets; establishes that ensemble methods outperform logistic regression.

2010 700 cited

Consumer Credit-Risk Models Via Machine-Learning Algorithms

Amir E. Khandani, Adlar J. Kim, Andrew W. Lo

First major paper showing transaction data + ML dramatically improves default prediction; estimates 6-25% cost savings.

2016 245 cited

Risk and Risk Management in the Credit Card Industry

Florentin Butaru, Qingqing Chen, Brian Clark, Sanmay Das, Andrew W. Lo, Akhtar Siddique

Uses account-level data from 6 major US banks; finds substantial heterogeneity—no single model works for all.

Fair Lending & Disparate Impact

Ensure lending decisions are fair across groups

1996 1026 cited

Mortgage Lending in Boston: Interpreting HMDA Data

Alicia H. Munnell, Geoffrey M.B. Tootell, Lynn E. Browne, James McEneaney

Seminal Boston Fed study documenting lending discrimination; foundational for fair lending enforcement.

2022 458 cited

Consumer-Lending Discrimination in the FinTech Era

Robert Bartlett, Adair Morse, Richard Stanton, Nancy Wallace

Latinx/African-American borrowers pay 7.9 bps more; FinTech reduces but doesn't eliminate discrimination.

2022 349 cited

Predictably Unequal? The Effects of Machine Learning on Credit Markets

Andreas Fuster, Paul Goldsmith-Pinkham, Tarun Ramadorai, Ansgar Walther

ML models create distributional impacts favoring advantaged groups even without using race.

2021 13 cited

How Costly is Noise? Data and Disparities in Consumer Credit

Laura Blattner, Scott Nelson

Credit scores are noisier for minority borrowers; quantifies how data disparities translate to lending disparities.

2017 3000 cited

Inherent Trade-Offs in the Fair Determination of Risk Scores

Jon Kleinberg, Sendhil Mullainathan, Manish Raghavan

Proves impossibility of satisfying calibration and error rate parity simultaneously—THE foundational fairness theorem.

2017 2500 cited

Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments

Alexandra Chouldechova

Independently derives impossibility result with explicit disparate impact focus; addressed ProPublica/COMPAS controversy.

Pricing Credit Products

Set interest rates based on risk

1991 789 cited

The Failure of Competition in the Credit Card Market

Lawrence M. Ausubel

Documented sticky credit card rates; introduced adverse selection explanations establishing the field.

1999 225 cited

Adverse Selection in the Credit Card Market

Lawrence M. Ausubel

First direct evidence of adverse selection using randomized solicitations.

2010 90 cited

Estimating Welfare in Insurance Markets Using Variation in Prices

Liran Einav, Amy Finkelstein, Mark R. Cullen

Demand-and-cost curve framework for analyzing selection; widely applied to credit markets.

2011 333 cited

Selection in Insurance Markets: Theory and Empirics in Pictures

Liran Einav, Amy Finkelstein

Intuitive graphical framework for understanding selection and welfare in credit/insurance.

2018 156 cited

Time to Default in Credit Scoring Using Survival Analysis: A Benchmark Study

Dieter Djeundje, Jonathan Crook

Benchmark comparing survival methods for WHEN default occurs—critical for IFRS 9 and lifetime expected credit loss.

Collections & Recovery

Optimize strategies for collecting overdue payments

2002 563 cited

An Empirical Analysis of Personal Bankruptcy and Delinquency

David B. Gross, Nicholas S. Souleles

Landmark study finding increased default propensity independent of risk composition; suggests declining stigma.

2004 277 cited

What Do We Know About Loss Given Default?

Til Schuermann

Comprehensive LGD estimation review; go-to reference for Basel II/III recovery models.

2002 20 cited

LossCalc: Model for Predicting Loss Given Default

Greg M. Gupton, Roger M. Stein (Moody's)

Industry-standard LGD model using debt type, seniority, and macro factors.

2004 117 cited

Measuring LGD on Commercial Loans: An 18-Year Internal Study

Michel Araten, Michael Jacobs, Peeyush Varshney

JPMorgan Chase's 18-year study of 3,761 defaults establishing key LGD drivers.

Alternative Data

Use non-traditional data for credit decisions

2020 69 cited

Behavior Revealed in Mobile Phone Usage Predicts Credit Repayment

Daniel Björkegren, Darrell Grissen

Mobile behavioral data outperforms credit bureaus for thin-file borrowers; foundational fintech paper.

2022 17 cited

Invisible Primes: Fintech Lending with Alternative Data

Marco Di Maggio, Dimuthu Ratnadiwakara, Don Carmichael

Alternative data identifies 'invisible primes' overlooked by traditional scores.

2015 674 cited

Predicting Poverty and Wealth from Mobile Phone Metadata

Joshua Blumenstock, Gabriel Cadamuro, Robert On

Mobile metadata predicts socioeconomic status; opened mobile-based credit scoring in developing countries.

2017

Use of Alternative Data in Credit Process

CFPB

Documents 45 million 'credit invisible' Americans; foundational regulatory framework.

2020 456 cited

On the Rise of FinTechs: Credit Scoring Using Digital Footprints

Tobias Berg, Valentin Burg, Ana Gombović, Manju Puri

Digital footprints (device, email domain, typing) match credit bureau accuracy; foundational fintech credit study.

Fraud, Credit and Claim Risk & Anomaly Detection

Identify fraudulent transactions and anomalous patterns in credit applications

2002 24500 cited

SMOTE: Synthetic Minority Over-sampling Technique

Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, W. Philip Kegelmeyer

The foundational imbalanced-learning method used in virtually every fraud detection system.

2002 1856 cited

Statistical Fraud, Credit and Claim Risk: A Review

Richard J. Bolton, David J. Hand

Authoritative taxonomy of fraud detection methods; still cited for conceptual foundations.

2009 10750 cited

Anomaly Detection: A Survey

Varun Chandola, Arindam Banerjee, Vipin Kumar

Definitive survey covering statistical, ML, and proximity-based anomaly methods.

2018 789 cited

Credit Card Fraud, Credit and Claim Risk: A Realistic Modeling and a Novel Learning Strategy

Andrea Dal Pozzolo, Giacomo Boracchi, Olivier Caelen, Cesare Alippi, Gianluca Bontempi

Addresses realistic fraud detection challenges: class imbalance, concept drift, and delayed feedback.

2016 456 cited

Feature Engineering Strategies for Credit Card Fraud, Credit and Claim Risk

Alejandro Correa Bahnsen, Djamila Aouada, Aleksandar Stojanovic, Björn Ottersten

Transaction aggregation features improve fraud detection by 40%; widely adopted in industry.

2015 1523 cited

Graph-Based Anomaly Detection and Description: A Survey

Leman Akoglu, Hanghang Tong, Danai Koutra

Survey on using graph structure to detect fraud rings and coordinated attacks.

2015 267 cited

APATE: A Novel Approach for Automated Credit Card Transaction Fraud, Credit and Claim Risk Using Network-Based Extensions

Véronique Van Vlasselaer, Tina Eliassi-Rad, Leman Akoglu, Monique Snoeck, Bart Baesens

Network propagation improves fraud detection by leveraging transaction graph structure.

Insurance Claims & Actuarial ML

Apply ML to insurance pricing, claims prediction, and reserving

2019 234 cited

Nesting Classical Actuarial Models into Neural Networks

Jürg Schelldorfer, Mario V. Wüthrich

Embedding GLMs into neural networks improves insurance pricing while maintaining interpretability.

2018 89 cited

Data Driven Binning for Insurance Tariffs

Roel Henckaerts, Marie-Pier Côté, Antonio Bella, Katrien Antonio

Evolutionary trees optimize premium segmentation while respecting regulatory constraints.

2023 45 cited

Detecting Insurance Fraud Using Supervised and Unsupervised Machine Learning

Jonas Debener, Johannes Kriebel, Oskar Kożlowski

Comprehensive comparison of fraud detection methods for insurance claims.

2019 156 cited

Claims Frequency Modeling Using Telematics Car Driving Data

Guangyuan Gao, Shengwang Meng, Mario V. Wüthrich

Telematics data (speed, braking) improves claims prediction; foundational usage-based insurance paper.

2018 178 cited

Neural Networks Applied to Chain-Ladder Reserving

Mario V. Wüthrich

Neural networks improve reserve estimation over traditional chain-ladder methods.

Safety & Trust Scoring on Platforms

Build and analyze reputation and trust systems for marketplace participants

2010 756 cited

The Dynamics of Seller Reputation: Evidence from eBay

Luís Cabral, Ali Hortaçsu

Foundational empirical study of reputation dynamics and their impact on seller behavior.

2015 423 cited

The Limits of Reputation in Platform Markets: An Empirical Analysis and Field Experiment

Chris Nosko, Steven Tadelis

Field experiment showing reputation inflation and limits of feedback systems.

2016 387 cited

Reputation and Feedback Systems in Online Platform Markets

Steven Tadelis

Comprehensive survey of reputation system design and effectiveness in platforms.

2017 234 cited

The Value of Reputation Information: Evidence from a Natural Experiment

Seth Freedman, Ginger Zhe Jin

Natural experiment measuring the causal impact of reputation visibility on market outcomes.

Explainability & Regulatory ML

Build interpretable models that satisfy regulatory requirements

2017 18500 cited

A Unified Approach to Interpreting Model Predictions (SHAP)

Scott M. Lundberg, Su-In Lee

SHAP values unify feature attribution methods; now standard for credit model explanations.

2016 15200 cited

'Why Should I Trust You?': Explaining the Predictions of Any Classifier (LIME)

Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

Local interpretable explanations for any black-box model; widely used for adverse action notices.

2019 3450 cited

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead

Cynthia Rudin

Argues interpretable models match black-box accuracy for credit; influential regulatory perspective.

2015 2340 cited

Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission (GA²M)

Rich Caruana, Yin Lou, Johannes Gehrke, Paul Koch, Marc Sturm, Noemie Elhadad

Generalized additive models with interactions; template for interpretable credit scoring.

Real-time Decisioning & Deployment

Deploy and maintain credit models in production with concept drift handling

2014 3890 cited

A Survey on Concept Drift Adaptation

João Gama, Indrė Žliobaitė, Albert Bifet, Mykola Pechenizkiy, Abdelhamid Bouchachia

Comprehensive survey on detecting and adapting to changing data distributions in credit.

2014 89 cited

Selection Bias in Credit Scorecard Evaluation

David J. Hand, Niall M. Adams

Identifies sample selection bias in scorecard validation; essential for production monitoring.

2022 67 cited

Reject Inference Methods in Credit Scoring: A Systematic Review and New Approaches

Adrien Ehrhardt, Christophe Biernacki, Vincent Vandewalle, Philippe Heinrich

Reviews and advances reject inference methods for handling missing data from declined applications.

2018 234 cited

Streaming Active Learning Strategies for Real-Life Credit Card Fraud, Credit and Claim Risk

Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Gianluca Bontempi

Active learning reduces labeling costs for streaming fraud detection systems.

Must-read papers for tech economists and applied researchers