Spatial & Geo
Analyze location data and optimize geographic coverage • 41 papers
Real Estate & Proptech
Build automated valuation models and analyze housing market dynamics
Hedonic Prices and Implicit Markets: Product Differentiation in Pure Competition
Theoretical foundation for all hedonic pricing models; every AVM derives from this framework.
The Efficiency of the Market for Single-Family Homes
Original Case-Shiller methodology establishing weighted repeat-sales indices used in S&P/Case-Shiller and FHFA.
Housing Dynamics
Models serial correlation in house prices (momentum at 1-year, mean reversion at 5+ years); explains forecasting difficulty.
How Much is the View from the Window Worth? Machine Learning-Driven Hedonic Pricing Model
25% accuracy improvement by integrating image and text data with interpretable SHAP-based ML for AVMs.
Why is Intermediating Houses so Difficult? Evidence from iBuyers
Definitive analysis of Opendoor and Zillow Offers; quantifies adverse selection vs. liquidity tradeoffs in iBuying.
The Effect of Home-Sharing on House Prices and Rents: Evidence from Airbnb
IV strategy on nationwide Airbnb data; 1% listing increase → 0.018% rent increase via supply reallocation.
Do Short-Term Rental Platforms Affect Housing Markets? Evidence from Airbnb in Barcelona
Rigorous event-study finding Airbnb increased Barcelona rents 1.9% and prices 4.6%.
Housing Market Expectations
Comprehensive survey on household price expectation formation; documents extrapolation and social network effects.
Geospatial Demand Modeling
Model demand across geographic areas
Kernel Density Estimation
Foundational non-parametric density estimation underpinning demand heatmaps.
Geographically Weighted Regression
Local regression allowing coefficients to vary spatially—key for heterogeneous demand.
Deep Gravity: Large-Scale Origin-Destination Flows
Neural network predicting mobility flows, outperforming classic gravity models.
Understanding Uber Demand
Uber's real-time geospatial demand prediction powering surge and dispatch.
ETA Prediction with Graph Neural Networks in Google Maps
Production GNN that reduced negative ETA outcomes by 40%+ in cities like Sydney; deployed at scale.
Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting
Foundational spatiotemporal GNN with 5,000+ citations; models traffic as diffusion on road graphs.
A Review of Self-Exciting Spatio-Temporal Point Processes and Their Applications
Definitive Hawkes process review for modeling clustered demand in surge pricing and fraud detection.
Location Choice & Site Selection
Choose optimal locations for stores or services
Maximum Coverage Location Problem
Seminal optimization formulation for facility siting under coverage constraints.
Spatial Interaction Models
Entropy-maximization derivation of gravity and retail models.
Starbucks Store Locator with Machine Learning
ML-driven site selection combining foot traffic, demographics, and competition.
Competitive Location Models: A Review
Authoritative review covering Hotelling spatial competition, gravity-based capture models, and location games.
Geo-Spotting: Mining Online Location-Based Services for Optimal Retail Store Placement
Pioneering ML site selection using Foursquare data with gradient boosted trees; foundational for location analytics.
Facility Location Under Uncertainty: A Review
Canonical review of stochastic and robust facility location for warehouse network design.
Spatial Econometrics
Account for geographic dependencies in analysis
Spatial Econometrics: Methods and Models
Definitive textbook: spatial lag/error models, Moran's I, and inference.
A Spatial Cliff-Ord-Type Model with Heteroskedastic Innovations
Robust GMM estimation for spatial autoregressions—standard in applied work.
PySAL: A Python Library for Spatial Analysis
Open-source library implementing spatial weights, autocorrelation, regression.
Spatial Spillovers in Airbnb Pricing
Demonstrates spatial lag effects in short-term rental markets using hedonic models.
GMM Estimation with Cross Sectional Dependence
Foundational paper on spatial HAC standard errors ('Conley SEs'); industry-standard correction.
Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
Causal forests for heterogeneous treatment effects with valid confidence intervals; implemented in grf and EconML.
Geographic Boundaries as Regression Discontinuities
Foundational geographic RDD methodology addressing boundary selection and spatial inference.
Mapping & Geocoding
Convert addresses to coordinates and match locations
libpostal: International Address Parsing
NLP library parsing addresses in 60+ languages—widely used in ride-sharing and logistics.
H3: Uber's Hexagonal Hierarchical Spatial Index
Hexagonal grid system enabling efficient geospatial joins and aggregations.
Placekey: A Universal Place Identifier
Standardized POI identifier joining disparate datasets without address matching.
Bing Geocoding
Production geocoding system at Bing scale; covers parsing, candidate generation, and ranking pipeline.
Probabilistic Record Linkage and a Method to Calculate the Positive Predictive Value
HMM-based address parsing and record linkage; foundational for modern address normalization.
Mobility & Movement Patterns
Analyze how people move through space
Understanding Individual Human Mobility Patterns
Large-scale mobile-phone analysis revealing power-law travel distributions.
The Universal Visitation Law of Human Mobility
Unifying framework explaining visitation frequency to locations across cities.
DeepMove: Predicting Human Mobility
Attention-based RNN capturing periodicity and transition patterns in check-ins.
Google Mobility Reports
Aggregated mobility trends by place category—used globally for pandemic response.
SafeGraph Foot Traffic Data
Panel-based POI visit data enabling offline-to-online attribution and demand studies.
Fear, Lockdown, and Diversion: Comparing Drivers of Pandemic Economic Decline
Uses smartphone mobility to show voluntary behavioral changes dwarfed lockdown effects on foot traffic.
Creating Synthetic Baseline Populations
Iterative Proportional Fitting (IPF) to generate representative synthetic populations for microsimulation.
Comprehensive Econometric Microsimulator for Daily Activity-Travel Patterns
CEMDAP: full-day activity-travel simulator; basis for large-scale urban demand models.