tech-econ

ContextualBandits

Implements a wide range of contextual bandit algorithms (linear, tree-based, neural) and off-policy evaluation methods.

A/B testing experimentation machine learning

Adaptive Experimentation & Bandits

MABWiser

Production-ready, scikit-learn style library for contextual & stochastic bandits with parallelism and simulation tools.

A/B testing experimentation

Adaptive Experimentation & Bandits

Open Bandit Pipeline (OBP)

Framework for **offline evaluation (OPE)** of bandit policies using logged data. Implements IPS, DR, DM estimators.

A/B testing experimentation

Adaptive Experimentation & Bandits

PyXAB

Library for advanced bandit problems: X-armed bandits (continuous/structured action spaces) and online optimization.

A/B testing experimentation

Adaptive Experimentation & Bandits

SMPyBandits

Comprehensive research framework for single/multi-player MAB algorithms (stochastic, adversarial, contextual).

A/B testing experimentation

Adaptive Experimentation & Bandits

Bayesian Econometrics

Bambi

High-level interface for building Bayesian GLMMs, built on top of PyMC. Uses formula syntax similar to R's `lme4`.

Bayesian inference

Bayesian Econometrics

LightweightMMM

Bayesian Marketing Mix Modeling (see Marketing Mix Models section).

Bayesian inference

Bayesian Econometrics

NumPyro

Probabilistic programming library built on JAX for scalable Bayesian inference, often faster than PyMC.

Bayesian inference

Bayesian Econometrics

PyMC

Flexible probabilistic programming library for Bayesian modeling and inference using MCMC algorithms (NUTS).

Bayesian inference

Bayesian Econometrics

Causal Discovery & Graphical Models

Ananke

Causal inference using graphical models (DAGs), including identification theory and effect estimation.

causal inference graphs

Causal Discovery & Graphical Models

Causal Discovery Toolbox (CDT)

Implements algorithms for causal discovery (recovering causal graph structure) from observational data.

causal inference graphs

Causal Discovery & Graphical Models

CausalNex

Uses Bayesian Networks for causal reasoning, combining ML with expert knowledge to model relationships.

causal inference graphs Bayesian

Causal Discovery & Graphical Models

LiNGAM

Specialized package for learning non-Gaussian linear causal models, implementing various versions of the LiNGAM algorithm including ICA-based methods.

causal inference graphs

Causal Discovery & Graphical Models

Tigramite

Specialized package for causal inference in time series data implementing PCMCI, PCMCIplus, LPCMCI algorithms with conditional independence tests.

causal inference graphs

Causal Discovery & Graphical Models

causal-learn

Comprehensive Python package serving as Python translation and extension of Java-based Tetrad toolkit for causal discovery algorithms.

causal inference graphs

Causal Discovery & Graphical Models

gCastle

Huawei Noah's Ark Lab end-to-end causal structure learning toolchain emphasizing gradient-based methods with GPU acceleration (NOTEARS, GOLEM).

causal inference graphs

Causal Discovery & Graphical Models

py-tetrad

Python interface to Tetrad Java library using JPype, providing direct access to Tetrad's causal discovery algorithms with efficient data translation.

causal inference graphs

Causal Discovery & Graphical Models

Causal Inference & Matching

CausalInference

Implements classical causal inference methods like propensity score matching, inverse probability weighting, stratification.

causal inference matching

Causal Inference & Matching

CausalLib

IBM-developed package that provides a scikit-learn-inspired API for causal inference with meta-algorithms supporting arbitrary machine learning models.

causal inference matching

Causal Inference & Matching

CausalML

Focuses on uplift modeling and heterogeneous treatment effect estimation using machine learning techniques.

causal inference matching

Causal Inference & Matching

CausalMatch

Implements Propensity Score Matching (PSM) and Coarsened Exact Matching (CEM) with ML flexibility for propensity score estimation.

causal inference matching

Causal Inference & Matching

CausalPlayground

Python library for causal research that addresses the scarcity of real-world datasets with known causal relations. Provides fine-grained control over structural causal models.

causal inference matching

Causal Inference & Matching

CausalPy

Developed by PyMC Labs, focuses specifically on causal inference in quasi-experimental settings. Specializes in scenarios where randomization is impossible or expensive.

causal inference matching

Causal Inference & Matching

DoWhy

End-to-end framework for causal inference based on causal graphs (DAGs) and potential outcomes. Covers identification, estimation, refutation.

causal inference matching

Causal Inference & Matching

fastmatch

Fast k-nearest-neighbor matching for large datasets using Facebook's FAISS library.

causal inference matching

Causal Inference & Matching

scikit-uplift

Focuses on uplift modeling and estimating heterogeneous treatment effects using various ML-based methods.

causal inference matching

Causal Inference & Matching

Core Libraries & Linear Models

Scikit-learn

Foundational ML library with regression models (incl. regularized), model selection, cross-validation, evaluation metrics.

regression linear models

Core Libraries & Linear Models

Statsmodels

Comprehensive library for estimating statistical models (OLS, GLM, etc.), conducting tests, and data exploration. Core tool.

regression linear models

Core Libraries & Linear Models

Dimensionality Reduction

FactorAnalyzer

Specialized library for Exploratory (EFA) and Confirmatory (CFA) Factor Analysis with rotation options for interpretability.

machine learning dimensionality

Dimensionality Reduction

openTSNE

Optimized, parallel implementation of t-distributed Stochastic Neighbor Embedding (t-SNE) for large datasets.

machine learning dimensionality

Dimensionality Reduction

umap-learn

Fast and scalable implementation of Uniform Manifold Approximation and Projection (UMAP) for non-linear reduction.

machine learning dimensionality

Dimensionality Reduction

Discrete Choice Models

Biogeme

Maximum likelihood estimation of parametric models, with strong support for complex discrete choice models.

discrete choice logit

Discrete Choice Models

PyBLP

Tools for estimating demand for differentiated products using the Berry-Levinsohn-Pakes (BLP) method.

discrete choice logit

Discrete Choice Models

PyLogit

Flexible implementation of conditional/multinomial logit models with utilities for data preparation.

discrete choice logit

Discrete Choice Models

XLogit

Fast estimation of Multinomial Logit and Mixed Logit models, optimized for performance.

discrete choice logit

Discrete Choice Models

torch-choice

PyTorch framework for flexible estimation of complex discrete choice models, leveraging GPU acceleration.

discrete choice logit

Discrete Choice Models

Double/Debiased Machine Learning (DML)

DoubleML

Implements the double/debiased ML framework (Chernozhukov et al.) for estimating causal parameters (ATE, LATE, POM) with ML nuisances.

machine learning causal inference

Double/Debiased Machine Learning (DML)

EconML

Microsoft toolkit for estimating heterogeneous treatment effects using DML, causal forests, meta-learners, and orthogonal ML methods.

machine learning causal inference

Double/Debiased Machine Learning (DML)

pydoublelasso

Double‑post Lasso estimator for high‑dimensional treatment effects (Belloni‑Chernozhukov‑Hansen 2014).

machine learning causal inference

Double/Debiased Machine Learning (DML)

pyhtelasso

Debiased‑Lasso detector of heterogeneous treatment effects in randomized experiments.

machine learning causal inference

Double/Debiased Machine Learning (DML)

Instrumental Variables (IV) & GMM

py-econometrics `gmm`

Lightweight package for setting up and estimating custom GMM models based on user-defined moment conditions.

IV GMM

Instrumental Variables (IV) & GMM

Marketing Mix Models (MMM) & Business Analytics

Lifetimes

Analyze customer lifetime value (CLV) using probabilistic models (BG/NBD, Pareto/NBD) to predict purchases.

marketing analytics

Marketing Mix Models (MMM) & Business Analytics

MaMiMo

Lightweight Python library focused specifically on Marketing Mix Modeling implementation.

marketing analytics

Marketing Mix Models (MMM) & Business Analytics

PyMC Marketing

Collection of Bayesian marketing models built with PyMC, including MMM, CLV, and attribution.

marketing analytics Bayesian

Marketing Mix Models (MMM) & Business Analytics

mmm_stan

Python/STAN implementation of Bayesian Marketing Mix Models.

marketing analytics Bayesian

Marketing Mix Models (MMM) & Business Analytics

Natural Language Processing for Economics

Gensim

Library focused on topic modeling (LDA, LSI) and document similarity analysis.

NLP text analysis

Natural Language Processing for Economics

Transformers

Access to thousands of pre-trained models for NLP tasks like text classification, summarization, embeddings, etc.

NLP text analysis

Natural Language Processing for Economics

spaCy

Industrial-strength NLP library for efficient text processing pipelines (NER, POS tagging, etc.).

NLP text analysis

Natural Language Processing for Economics

Numerical Optimization & Computational Tools

JAX

High-performance numerical computing with autograd and XLA compilation on CPU/GPU/TPU.

optimization computation

Numerical Optimization & Computational Tools

PyTorch

Popular deep learning framework with flexible automatic differentiation.

optimization computation machine learning

Numerical Optimization & Computational Tools

Panel Data & Fixed Effects

FixedEffectModelPyHDFE

Solves linear models with high-dimensional fixed effects, supporting robust variance calculation and IV.

panel data fixed effects

Panel Data & Fixed Effects

Linearmodels

Estimation of fixed, random, pooled OLS models for panel data. Also Fama-MacBeth and between/first-difference estimators.

panel data fixed effects

Panel Data & Fixed Effects

PyFixest

Fast estimation of linear models with multiple high-dimensional fixed effects (like R's `fixest`). Supports OLS, IV, Poisson, robust/cluster SEs.

panel data fixed effects

Panel Data & Fixed Effects

duckreg

Out-of-core regression (OLS/IV) for very large datasets using DuckDB aggregation. Handles data that doesn't fit in memory.

panel data fixed effects

Panel Data & Fixed Effects

pydynpd

Estimation of dynamic panel data models using Arellano-Bond (Difference GMM) and Blundell-Bond (System GMM). Includes Windmeijer correction & tests.

panel data fixed effects

Panel Data & Fixed Effects

Power Simulation & Design of Experiments

ADOpy

Bayesian Adaptive Design Optimization (ADO) for tuning experiments in real-time, with models for psychometric tasks.

power analysis experiments Bayesian

Power Simulation & Design of Experiments

Adaptive

Parallel active learning library for adaptive function sampling/evaluation, with live plotting for monitoring.

power analysis experiments

Power Simulation & Design of Experiments

DoEgen

Automates generation and optimization of designs, especially for mixed factor-level experiments; computes efficiency metrics.

power analysis experiments

Power Simulation & Design of Experiments

pyDOE2

Implements classical Design of Experiments: factorial (full/fractional), response surface (Box-Behnken, CCD), Latin Hypercube.

power analysis experiments

Power Simulation & Design of Experiments

Program Evaluation Methods (DiD, SC, RDD)

CausalImpact

Python port of Google's R package for estimating causal effects of interventions on time series using Bayesian structural time-series models.

DiD synthetic control RDD Bayesian

Program Evaluation Methods (DiD, SC, RDD)

Differences

Implements modern difference-in-differences methods for staggered adoption designs (e.g., Callaway & Sant'Anna).

DiD synthetic control RDD

Program Evaluation Methods (DiD, SC, RDD)

SyntheticControlMethods

Implementation of synthetic control methods for comparative case studies when panel data is available.

DiD synthetic control RDD

Program Evaluation Methods (DiD, SC, RDD)

csdid

Python adaptation of the R `did` package. Implements multi-period DiD with staggered treatment timing (Callaway & Sant’Anna).

DiD synthetic control RDD

Program Evaluation Methods (DiD, SC, RDD)

mlsynth

Implements advanced synthetic control methods: forward DiD, cluster SC, factor models, and proximal SC. Designed for single-treated-unit settings.

DiD synthetic control RDD

Program Evaluation Methods (DiD, SC, RDD)

pycinc

Changes‑in‑Changes (CiC) estimator for distributional treatment effects (Athey & Imbens 2006).

DiD synthetic control RDD causal inference

Program Evaluation Methods (DiD, SC, RDD)

pyleebounds

Lee (2009) sample‑selection bounds for treatment effects; trims treated distribution to match selection rates.

DiD synthetic control RDD causal inference

Program Evaluation Methods (DiD, SC, RDD)

rdd

Toolkit for sharp RDD analysis, including bandwidth calculation and estimation, integrating with pandas.

DiD synthetic control RDD

Program Evaluation Methods (DiD, SC, RDD)

rdrobust

Comprehensive tools for Regression Discontinuity Designs (RDD), including optimal bandwidth selection, estimation, inference.

DiD synthetic control RDD

Program Evaluation Methods (DiD, SC, RDD)

Quantile Regression & Distributional Methods

pyqreg

Fast quantile regression solver using interior point methods, supporting robust and clustered standard errors.

quantile regression

Quantile Regression & Distributional Methods

pyrifreg

Recentered Influence‑Function (RIF) regression for unconditional quantile & distributional effects (Firpo et al., 2008).

quantile regression

Quantile Regression & Distributional Methods

quantile-forest

Scikit-learn compatible implementation of Quantile Regression Forests for non-parametric estimation.

quantile regression

Quantile Regression & Distributional Methods

Spatial Econometrics

(PySAL Core)

The broader PySAL ecosystem contains many tools for spatial data handling, weights, visualization, and analysis.

spatial geography

Spatial Econometrics

PySAL (spreg)

The spatial regression `spreg` module of PySAL. Implements spatial lag, error, IV models, and diagnostics.

spatial geography

Spatial Econometrics

Standard Errors, Bootstrapping & Reporting

Awesome Quant

Curated list of quantitative finance libraries and resources (many statistical/TS tools overlap with econometrics).

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

Beyond Jupyter (TransferLab)

Teaches software design principles for ML—modularity, abstraction, and reproducibility—going beyond ad hoc Jupyter workflows. Focus on maintainable, production-quality ML code.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

Causal Inference for the Brave and True

Modern introduction to causal inference methods (DiD, IV, RDD, Synth, ML-based) with Python code examples.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

Coding for Economists

Practical guide by A. Turrell on using Python for modern econometric research, data analysis, and workflows.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

Deep Learning Specialization (Coursera)

Intermediate 5-course series by Andrew Ng covering deep neural networks, CNNs, RNNs, transformers, and real-world DL applications using TensorFlow.

bootstrap standard errors machine learning

Standard Errors, Bootstrapping & Reporting

Machine Learning Specialization (Coursera)

Beginner-friendly 3-course series by Andrew Ng covering core ML methods (regression, classification, clustering, trees, NN) with hands-on projects.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

Python for Econometrics

Comprehensive intro notes by Kevin Sheppard covering Python basics, core libraries, and econometrics applications.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

QuantEcon Lectures

High-quality lecture series on quantitative economic modeling, computational tools, and economics using Python/Julia.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

SciPy Bootstrap

(`scipy.stats.bootstrap`) Computes bootstrap confidence intervals for various statistics using percentile, BCa methods.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

Stargazer

Python port of R's stargazer for creating publication-quality regression tables (HTML, LaTeX) from `statsmodels` & `linearmodels` results.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

The Missing Semester of Your CS Education (MIT)

Teaches essential developer tools often skipped in formal education—command line, Git, Vim, scripting, debugging, etc.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

wildboottest

Fast implementation of various wild cluster bootstrap algorithms (WCR, WCU) for robust inference, especially with few clusters.

bootstrap standard errors

Standard Errors, Bootstrapping & Reporting

State Space & Volatility Models

FilterPy

Focuses on Kalman filters (standard, EKF, UKF) and smoothers with a clear, pedagogical implementation style.

volatility state space

State Space & Volatility Models

Metran

Specialized package for estimating Dynamic Factor Models (DFM) using state-space methods and Kalman filtering.

volatility state space

State Space & Volatility Models

PyKalman

Implements Kalman filter, smoother, and EM algorithm for parameter estimation, including support for missing values and UKF.

volatility state space

State Space & Volatility Models

PyMC Statespace

(See Bayesian) Bayesian state-space modeling using PyMC, integrating Kalman filtering within MCMC for parameter estimation.

volatility state space Bayesian

State Space & Volatility Models

stochvol

Efficient Bayesian estimation of stochastic volatility (SV) models using MCMC.

volatility state space Bayesian

State Space & Volatility Models

Statistical Inference & Hypothesis Testing

Pingouin

User-friendly interface for common statistical tests (ANOVA, ANCOVA, t-tests, correlations, chi², reliability) built on pandas & scipy.

inference hypothesis testing

Statistical Inference & Hypothesis Testing

PyWhy-Stats

Part of the PyWhy ecosystem providing statistical methods specifically for causal applications, including various independence tests and power-divergence methods.

inference hypothesis testing

Statistical Inference & Hypothesis Testing

Scipy.stats

Foundational module within SciPy for a wide range of statistical functions, distributions, and hypothesis tests (t-tests, ANOVA, chi², KS, etc.).

inference hypothesis testing

Statistical Inference & Hypothesis Testing

hypothetical

Library focused on hypothesis testing: ANOVA/MANOVA, t-tests, chi-square, Fisher's exact, nonparametric tests (Mann-Whitney, Kruskal-Wallis, etc.).

inference hypothesis testing

Statistical Inference & Hypothesis Testing

lifelines

Comprehensive library for survival analysis: Kaplan-Meier, Nelson-Aalen, Cox regression, AFT models, handling censored data.

inference hypothesis testing

Statistical Inference & Hypothesis Testing

Structural Econometrics & Estimation

Dolo

Framework for describing and solving economic models (DSGE, OLG, etc.) using a declarative YAML-based format.

structural estimation

Structural Econometrics & Estimation

HARK

Toolkit for solving, simulating, and estimating models with heterogeneous agents (e.g., consumption-saving).

structural estimation

Structural Econometrics & Estimation

QuantEcon.py

Core library for quantitative economics: dynamic programming, Markov chains, game theory, numerical methods.

structural estimation

Structural Econometrics & Estimation

respy

Simulation and estimation of finite-horizon dynamic discrete choice (DDC) models (e.g., labor/education choice).

structural estimation

Structural Econometrics & Estimation

Synthetic Data Generation

SDV (Synthetic Data Vault)

Comprehensive library for generating synthetic tabular, relational, and time series data using various models.

synthetic data simulation

Synthetic Data Generation

Synthpop

Port of the R package for generating synthetic populations based on sample survey data.

synthetic data simulation

Synthetic Data Generation

Time Series Econometrics

ARCH

Specialized library for modeling and forecasting conditional volatility using ARCH, GARCH, EGARCH, and related models.

time series econometrics

Time Series Econometrics

Kats

Broad toolkit for time series analysis, including multivariate analysis, detection (outliers, change points, trends), feature extraction.

time series econometrics

Time Series Econometrics

LocalProjections

Community implementations of Jordà (2005) Local Projections for estimating impulse responses without VAR assumptions.

time series econometrics

Time Series Econometrics

Time Series Forecasting

MLForecast

Scalable time series forecasting using machine learning models (e.g., LightGBM, XGBoost) as regressors.

forecasting time series machine learning

Time Series Forecasting

NeuralForecast

Deep learning models (N-BEATS, N-HiTS, Transformers, RNNs) for time series forecasting, built on PyTorch Lightning.

forecasting time series machine learning

Time Series Forecasting

Prophet

Forecasting procedure for time series with strong seasonality and trend components, developed by Facebook.

forecasting time series

Time Series Forecasting

StatsForecast

Fast, scalable implementations of popular statistical forecasting models (ETS, ARIMA, Theta, etc.) optimized for performance.

forecasting time series

Time Series Forecasting

pmdarima

ARIMA modeling with automatic parameter selection (auto-ARIMA), similar to R's `forecast::auto.arima`.

forecasting time series

Time Series Forecasting

sktime

Unified framework for various time series tasks, including forecasting with classical, ML, and deep learning models.

forecasting time series machine learning

Time Series Forecasting

Tree & Ensemble Methods for Prediction

CatBoost

Gradient boosting library excelling with categorical features (minimal preprocessing needed). Robust against overfitting.

machine learning prediction

Tree & Ensemble Methods for Prediction

LightGBM

Fast, distributed gradient boosting (also supports RF). Known for speed, low memory usage, and handling large datasets.

machine learning prediction

Tree & Ensemble Methods for Prediction

NGBoost

Extends gradient boosting to probabilistic prediction, providing uncertainty estimates alongside point predictions. Built on scikit-learn.

machine learning prediction

Tree & Ensemble Methods for Prediction

Scikit-learn Ens.

(`RandomForestClassifier`/`Regressor`) Widely-used, versatile implementation of Random Forests. Easy API and parallel processing support.

machine learning prediction

Tree & Ensemble Methods for Prediction

XGBoost

High-performance, optimized gradient boosting library (also supports RF). Known for speed, efficiency, and winning competitions.

machine learning prediction

Tree & Ensemble Methods for Prediction

cuML (RAPIDS)

GPU-accelerated implementation of Random Forests for significant speedups on large datasets. Scikit-learn compatible API.

machine learning prediction

Tree & Ensemble Methods for Prediction