🐾 Animal Shelter Outcome Prediction (edX Choose Your Own)

This project was developed as part of the final Capstone for the HarvardX Data Science Professional Certificate.
It explores the prediction of shelter animal outcomes (e.g. Adoption, Transfer, Euthanasia)
at the Austin Animal Center, based on intake-related features.

🎯 Goal

The objective is to predict shelter animal outcomes based on intake data such as
animal type, condition, and intake type.
Model performance is evaluated using accuracy, confusion matrix, and feature importance.
All steps follow the edX Honor Code.

📚 Data Source

The dataset is provided by the Austin Animal Center,
hosted on Kaggle. It contains detailed records of animal intakes and outcomes.

⚠️ Note: The dataset is not included in this repository.
Please download it manually from Kaggle and use it locally for testing and report generation.
👉 https://www.kaggle.com/datasets/aaronschlegel/austin-animal-center-shelter-intakes-and-outcomes

🗂️ Project Structure

File	Description
`01_load_data.R`	Load and inspect the shelter dataset
`02_explore_data.R`	Exploratory data analysis (EDA): distributions, NA overview
`03_model_baseline.R`	Baseline model: predict most frequent outcome ("Adoption")
`04_model_randomforest.R`	Random Forest model with 5-fold CV and feature importance
`05_compare_models.R`	Comparison of baseline vs. Random Forest (accuracy & plots)
`06_final_model.R`	Final model application without CV, final evaluation
`07_final_pipeline.R`	Complete pipeline with all steps and explanatory comments
`chooseyourproject_report.Rmd`	Final R Markdown report (edX-compliant)
`chooseyourproject_report.pdf`	Rendered PDF version for submission
`chooseyourproject_report.html`	Rendered HTML version
`LICENSE`	MIT License for reuse
`.gitignore`	Excludes data files and system folders
`README.md`	This project overview

🔎 Final Results

Metric	Value
Baseline Accuracy	42.18 %
Random Forest Accuracy	58.09 %
Absolute Improvement	+15.91 pp
Relative Improvement	+37.7 %
Final Model Trees	500

Random Forest classifier clearly outperformed the naive baseline.
Most important predictors: intake_type, sex_upon_intake, intake_condition.

💻 Requirements

R 4.x or newer
RStudio
Required R packages:
- tidyverse
- caret
- randomForest
- scales
- tidytext (for variable importance visualization)

📄 Report Access

The final report submitted for edX is available in two formats:

It includes all modeling steps, evaluations, plots, and interpretations.

👩‍💻 Author and License

This project was created by Yvonne Kirschler
and is licensed under the MIT License.

If you reuse code from this repository, please provide proper attribution.

GitHub profile: @alunera-data
LinkedIn: Yvonne Kirschler

This project was developed independently.
ChatGPT (OpenAI) was used to support structure, planning and phrasing.
All modeling, evaluation and reporting were performed and reviewed by the author.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🐾 Animal Shelter Outcome Prediction (edX Choose Your Own)

🎯 Goal

📚 Data Source

🗂️ Project Structure

🔎 Final Results

💻 Requirements

📄 Report Access

👩‍💻 Author and License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
01_load_data.R		01_load_data.R
02_explore_data.R		02_explore_data.R
03_model_baseline.R		03_model_baseline.R
04_model_randomforest.R		04_model_randomforest.R
05_compare_models.R		05_compare_models.R
06_final_model.R		06_final_model.R
07_final_pipeline.R		07_final_pipeline.R
LICENSE		LICENSE
README.md		README.md
animal-shelter-ml.Rproj		animal-shelter-ml.Rproj
chooseyourproject_code.R		chooseyourproject_code.R
chooseyourproject_report.Rmd		chooseyourproject_report.Rmd
chooseyourproject_report.html		chooseyourproject_report.html
chooseyourproject_report.pdf		chooseyourproject_report.pdf

License

alunera-data/animal-shelter-ml

Folders and files

Latest commit

History

Repository files navigation

🐾 Animal Shelter Outcome Prediction (edX Choose Your Own)

🎯 Goal

📚 Data Source

🗂️ Project Structure

🔎 Final Results

💻 Requirements

📄 Report Access

👩‍💻 Author and License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages