There should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*.
For this project, I was interestested in using Stack Overflow data from 2020 to better understand the differences between female and male developers:
- Whats the difference in income of professional female and male developers?
- What was the difference in income in the past years?
- How is salary distributed in terms of professional coding experience?
- Is there a gender bias in female and male incomes?
There are 4 notebooks available here to showcase work related to the above questions. Each of the notebooks is exploratory in searching through the data pertaining to the questions showcased by the notebook title. Markdown cells were used to explain the individual steps.
The main findings of the code can be found at the post available here.
Must give credit to Stack Overflow for the data. You can find the Licensing for the data and other descriptive information at the Kaggle link available here. For the boostrap analysis the basic function from Datacamps Statistical Thinking with Python Part 2 was used and changed to meet my requirements. Otherwise, feel free to use the code here as you would like! 1