Sam’s Blog

Sam Anklesaria

Data scientist at Minnesota Department of Human Services. Interested in explainable models and causal inference.

Bayesian and Frequentist Sample Size Estimation

This post reviews two approaches to sample size estimation in A/B/n testing: one Frequentist, and one Bayesian. Say we’re trying to test which variant of an ...

Topic Extraction (Old and New)

How do you find thematic clusters in a large corpus of text documents? There are the standared algorithms baked into sklearn: k-means, nonnegative matrix fac...

Random Walks on Polls

A while ago, the Economist published an explanation of their model of the 2016 US presidential election. This post implements a simplified form of that model.

Converting Dataframes to Pytrees

Numpyro models expect data in the form of a pytrees: nested lists or dictionaries of multidimensional arrays. But datasets usually don’t start that way: we u...

Synthetic Controls in Texas

Replicating a syntheic control study with a Bayesian workflow

GP Matching in Observational Studies

Accounting for confounders using Gaussian Processes

Advent Of Code

I’ve been playing Advent of Code this year. I started writing solutions in R, but soon realized I was better served by a more general purpose language. Julia...

Basu’s Bayesian Bears

Bayesian calibrated survey sampling

Recent Posts