RBA Claims (EDA & Modeling)

This notebook provides an extensive framework to perform exploratory data analysis (EDA) and modeling on claims data. The analysis and modeling aims to find the statistical significance of various marketing metrics to insuarance claims (sales). Given the nuanced sector in which this analysis was performed on, the results varied and warrant further exploration -- with search based spending showing more "correlation." The amount of unknowns outweighed the amount of data available to attribute a large portion of market spends to actual claims.

Overview

The notebook is structured into several sections:

Import Libraries & Initialize Functions
- Loads essential libraries for data processing (e.g., pandas, numpy), visualization (e.g., matplotlib, seaborn), and modeling (e.g., scikit-learn, statsmodels).
- Initializes utility functions (e.g., for safe division and data processing).
Gather and Process Datasets
- Pulls claims data (from Excel files and/or Snowflake) and RBA data.
- Cleans and preprocesses the data:
  - Date parsing and reindexing.
  - Handling missing values/extrapolation via interpolation.
  - Converting data types and reformatting headers.
Exploratory Data Analysis (EDA)
- Investigates the dataset by performing:
  - Seasonal decomposition to evaluate trends.
  - Autocorrelation analysis.
  - Visualization of the correlation matrix and heatmaps.
  - Joint plots and monthly time series (line) plots.
Modeling
- Implements various regression techniques (e.g., Linear Regression, Ridge, Lasso, Random Forest, Gradient Boosting) for evaluating claims.
- Includes cross-validation and parameter tuning pipelines.
- Evaluates model performance using metrics such as MSE, MAE, R², etc.
Visualization of Model Outputs
- Provides visual insights into model performance.
- Explores the impact of impression lags on overall model accuracy.
- Contains sections for further exploration such as combined LSTM and digital LSTM approaches for sequential data modeling.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.DS_Store		.DS_Store
README.md		README.md
modeling_flowchart.png		modeling_flowchart.png
rba_claims_eda_modeling.ipynb		rba_claims_eda_modeling.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RBA Claims (EDA & Modeling)

Overview

About

Uh oh!

Contributors 2

Uh oh!

Languages

jra333/claims-attribution-analysis

Folders and files

Latest commit

History

Repository files navigation

RBA Claims (EDA & Modeling)

Overview

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages