From classical statistical models to reinforcement learning, projects that create business value.
Problem: What is the real impact of our email campaigns on conversions?
Complete A/B testing platform combining Bayesian analysis and Causal ML to go beyond simple correlation and measure causal impact of marketing interventions.
Approach:
Results: On 64,000 e-commerce customers, identified +16% lift for βMensβ emails vs +8% for βWomensβ.
Technologies: Python, PyMC, CausalML, SHAP, FastAPI, Streamlit
Problem: How to optimize prices in real-time based on context and demand?
Project exploring surge pricing strategies used by Uber and Lyft, with methodological progression from econometrics to reinforcement learning.
Progressive approach:
Dataset: ~700K Uber & Lyft rides (Boston, Nov 2018)
Technologies: Python, PyMC, ArviZ, scikit-learn, Streamlit
Problem: How to deploy an ML model end-to-end with modern architecture?
Demonstrative project of a complete ML architecture: from training to deployment, through API and frontend, with R and Python integration.
Architecture:
DuckDB β scikit-learn β Vetiver β FastAPI β Shiny (Python & R)
Highlights:
Technologies: Python, R, scikit-learn, Vetiver, FastAPI, Shiny, Docker, GitHub Actions
Problem: Predict house sale prices in Iowa from their characteristics.
Final project for Harvard Data Science certificate, exploring and comparing many regression approaches.
Models tested:
Results: Systematic performance comparison and identification of most important features.
Technologies: R, RMarkdown, caret, xgboost, randomForest