Bayes Experiment Analysis

Dani Mermelstein

This is a Dockerized/containerized skeleton service for applying Bayesian statistics to A/B tests and outputting standardized reports. It leverages and productionizes the BayesABTest package and Quarto publishing system. The service will result in a website with a front page listing all tests conducted and various HTML outputs for each experiment.

An example of this service can be run from Replit 😄! UPDATE JAN 2024: There have been some changes to resources Replit allocates to free repls, so it might no longer run there. Will run on machines with at least 1GB RAM.

Example of the front page:

Example of an experiment result page:

Usage:

Start the service with docker-compose up from the terminal.

It all depends on the experiments.yaml file. This is where metadata is listed for each experiment. This metadata could be expanded to include a summary, logic on whether the experiment has finished running, or anything else that you might want to insert in the final writeup. This service is a skeleton, so really up to you.

Quarto is used to generate all HTML files related to experiments, which might get a little heavy depending on how many experiments you run. A more lightweight version could use a similar method to the generation of index.html where you have a template file to which you insert charts and variable values. Quarto renders every experiment through the analysis_output Jupyter notebook (although this could be switched to a markdown file with embedded code).

The service could generate reporting on a schedule (eg daily) or when new experiments are concluded (ie with some trigger).

The default included analysis is for split tests that are judged on conversion rates and requires a binary outcome (ie conversion happened yes/no). BayesABTest additionally allows for analysis of continuous or discrete variables (eg minutes spend on calls, deposit amounts, account balance) which would only require some minor tweaking. It would also be possible to implement different SQL queries for data gathering, depending on the KPI being measured, or different traditional statistical tests as needed.

Suggested hosting method:

The most stable end state for hosting the output would probably be a static website on S3. That's because this is not an app, and while it is possible to connect to a database (ahem strongly suggested) the output is static HTML files. A static website would also result in improved uptime, less downtime, and not having to figure out what's wrong with the server. That said, the service does default to hosting on a local server which is useful for local testing but can be easily removed.

Hopefully this provides the heavy lifting for you to analyze experiments at scale. Happy testing!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

docker-compose.yaml

docker-compose.yaml

requirements.txt

requirements.txt

Repository files navigation

Bayes Experiment Analysis

Example of the front page:

Example of an experiment result page:

Usage:

Suggested hosting method:

About

Releases

Packages

Languages

License

mermelstein/experiment_analysis

Folders and files

Latest commit

History

Repository files navigation

Bayes Experiment Analysis

Example of the front page:

Example of an experiment result page:

Usage:

Suggested hosting method:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages