Skip to content
View VictorOwinoKe's full-sized avatar
Block or Report

Block or report VictorOwinoKe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
VictorOwinoKe/README.md

Victor Owino - Data Analyst Portfolio

About Me SQL Project Data Analysis Python Projects BI and Dashboard Projects Publications Contact Me

About Me

Hi, I'm Victor Owino, a seasoned Data and Analytics professional with over 3 years experience in the data engineering, analytics and Bussiness Intelligence (BI). Over the years, I have developed a strong foundation in the life sciences, passion and commitment to leveraging technology and data to craft innovative data-driven solutions across diverse sectors especially in areas like healthcare, education, social impact, retail, and insurance. I've got a knack for finding solutions, and I'm always on the lookout for new challenges in other domains.

I take pride in my strength, which lies in my ability to empower companies and organizations with the right data, in the right format, analyzed to perfection, and transformed into actionable insights for informed decision-making. For me, creating user-centric BI Dashboards and Reports isn't just a skill; it's my passion, turning data into insights that empower decisions. I am deeply committed to enabling everybody in an organization, irrespective of their technical know-how, to work with data comfortably, to feel confident talking about it, and, as a result, make data-informed decisions and build customer experiences powered by data. You can refer to me as a versatile data analyst who handles everything from managing the nitty-gritty data pipelines to crafting beautiful and insightful end-user dashboards. Whether working on a team or independently, I am driven by the thrill of discovering new insights and the satisfaction of using data to solve problems.

Victor led a team in the design and implementation of a highly scalable data pipeline and BI solution from the ground up. The goal was to centralize and optimize data storage and processing, enabling the organization to harness the power of its data for advanced analytics, reporting, and business insights. The project entails an automated data pipelines, ingesting and transforming data from diverse sources into Google BigQuery, a cloud data warehouse. The project culminated in a meticulously curated and user-friendly Power BI dashboard, making data exploration and decision-making an intuitive and accessible process for stakeholders across the organization.

In my leisure hours, I enjoy waltching and playing soccer (Manchester United Fun) and board games like Chess. I also find relaxation in the virtual world, sometimes immersing myself in video games such as FIFA.

This repository is my Project Portfolio showcasing a range of technical prowess, problem-solving thought processes, and domain expertise.

Projects SQL Project Data Analysis Python Projects BI and Dashboard Projects

1.0 SQL Projects

SQL is a must-have skill for any aspiring data practitioner. Many modern companies store vast amounts of their data in various tables of relational databases. To extract the necessary data from a database for further manipulation and data analysis, you must have a good grasp of SQL. This repository serves as a showcase for my SQL projects

1.1 Covid 19 Data Exploration

Code: COVID Portfolio Project

Description: The dataset contains records of Covid-19 cases, deaths and vaccine records by country in 2020-2021. This project includes the following steps: data loading, data cleaning and preprocessing and EDA (exploratory data analysis).

Objective: Using SQL queries to provide insights into infection rates, death percentages, and vaccination progress at both global and regional levels. The results are stored in temporary tables and views for further analysis and visualization.

Skills: Joins, CTE's, Temp Tables, Windows Functions, Aggregate Functions, Creating Views, Converting Data Types

Technology: SQL Server

1.2 SQL Data Cleaning Queries: Nashville Housing.sql

Code: SQL Data Cleaning Queries: Nashville Housing.sql

Description: The dataset contains a list of houses that have been sold in Nashville between 2013 and 2019. This project includes the following steps: data loading, data cleaning and preprocessing.

Objective: To clean and enhance the NashvilleHousing data table by standardizing date formats, populating missing property addresses, breaking full addresses into individual components, converting specific values, removing duplicates, and deleting unused columns.

Skills: DML(Data Manipulation Language), DQL (Data Query Language), DDL (Data Definition Language).

Technology: SQL Server

1.3 Student Mental EDA SQL & Python

Code: Student Mental Analysis.ipynb

Description: Does going to university in a different country affect your mental health? Studying abroad can be both exciting and difficult. But what might be contributing to this? A Japanese international university surveyed its students in 2018 and published a study the following year that was approved by several ethical and regulatory boards.

The study found that international students have a higher risk of mental health difficulties than the general population, and that social connectedness (belonging to a social group) and acculturative stress (stress associated with joining a new culture) are predictive of depression.

Objective: To explore the students data using SQL and Python to find out if we can come to a similar conclusion for international students and see if the length of stay is a contributing factor.

Skills: DML(Data Manipulation Language), DQL (Data Query Language), DDL (Data Definition Language), Python (Pandas)

Technology: SQL Server , Notebook

More SQL Projects Here

2.0 Data Analysis Python Projects SQL Project BI and Dashboard Projects

Learning Python is crucial for anyone interested in working with data. This repository showcases my Python projects, demonstrating my ability to analyze, manipulate and automate data analysis using python

2.1 Student Mental Analysis[EDA + ML]

Code: Student Mental Analysis[EDA + ML]

How do factors, such as CGPA , Course, Year of Study and Age together with other academic factors influence mental health outcomes of collage students?

Description: The transition from high school to university is a critical period in a student's life, often accompanied by significant emotional and mental challenges. This project aims to address the importance of mental health in college students by employing data analysis techniques, specifically focusing on EDA and the implementation of a machine learning model to infer how various academic factors influence college students’ mental health status.

Objective: To perform Exploratory Data analysis (EDA) and Inferential Statistics with a n ML Classifier model to understand the correlation efficient of various associated with mental health challenges among college students. The ultimate the goal is to provide actionable recommendations for targeted support towards curbing the depressive impacts.

Skills: LogisticRegression, EDA, Model Training, RandomForestClassifier, Aggregate Functions, Pandas Data Manipulation, Matplotlib

Technology: Python Notebook

2.2 Optimizing Online Learning Engagement: Udacity A/B Testing Experiment

Code: Udacity A/B Testing Experiment

Can asking students in advance about their time commitment reduce early course cancellations in online education??

Description: In the experiment, Udacity tested a change where if the student clicked "start free trial", they were asked how much time they had available to devote to the course. If the student indicated 5 or more hours per week, they would be taken through the checkout process as usual. If they indicated fewer than 5 hours per week, a message would appear indicating that Udacity courses usually require a greater time commitment for successful completion, and suggesting that the student might like to access the course materials for free. At this point, the student would have the option to continue enrolling in the free trial, or access the course materials for free instead. This screenshot shows what the experiment looks like.

Objective: Ivestigate if seting clearer expectations for students upfront, help in reducing the number of frustrated students who left the free trial because they didn't have enough time—without significantly reducing the number of students to continue past the free trial and eventually complete the course.

Skills: A/B Testing, EDA, scipy, Pandas, Math

Technology: Python Notebook

More Python Data Analysis Projects

3.0 Data Vizualization and Dashboards (Power BI) SQL Project Data Analysis Python Projects

Every day your business generates more data on sales revenue, marketing performance, customer interactions, inventory levels, production metrics, staffing levels, costs, and other KPIs. But with so much data to sift through, it can be difficult for people to see the story it tells.Data visualization brings data to life, making you the master storyteller of the insights hidden within your numbers.

Below are sample projects through live data dashboards, interactive reports, charts, graphs, and other visual representations, that demostrate how we can use data visualization to helps users develop powerful business insight quickly and effectively.

3.1 Healthcare Insurance Analytics Dashboard Solutions

image

Live Dashboard: Healthcare Insurance Analytics Dashboard Solutions

Not only the financial loss is a great concern but also to protect the healthcare system so that they can provide quality and safe care to legitimate patients. Say goodbye to critical pain points in the insurance industry with realtime insurance analytics dashboard powered by Power BI.

Description: Statistics shows that 15% of the total medicare expense are caused due to fraud claims. Insurance companies are the most vulnerable institutions impacted due to these bad practices. Insuarance premium is also increasing day by day due to this bad practice. The dataset includes healthcare claims data with details about procedures, diagnoses, and billing amounts. Preprocessing involves:

  • Identifying anomalies.
  • Creating features for claim patterns.
  • Labeling claims as legitimate or suspicious.

Objective: This project involves building a realtime dashboard to detect fraudulent healthcare claims using historical claims data and data analytics techniques.

Skills: Power Querry, Power BI, Dashboard, Data Vizualization, Insurence Claims

Technology: Microsoft Power BI

3.2 Early Childhood Development Impact Dashboard

"Imagine a world where all children regardless of where they are born, have the opportunity to reach their full potential." image

Description: Every day, thousands of working mothers in East Africa’s informal settlements drop off their young children at unlicensed and congested “babycare” centres.As a result, the health, growth and development of the child is severely compromised. Without supportive care in their early years, these children enter school with physical & learning disabilities that result in them being locked in an intergenerational cycle of poverty that is near impossible to escape. Kidogo is dedicated to chamge this narrative by enhancing access to high-quality, affordable Early Childhood Care & Education in low-income communities across East Africa. Kidogo firmly believes that offering comprehensive and quality childcare and education during the critical first five years of a child's life paves the way for them to grow into content, healthy adults, and valuable contributors to society. "This project utilizes a real dataset that has undergone deidentification, ensuring the protection of individual privacy while allowing for meaningful analysis and insights."

Objective: This Power BI dashboard serves as a real-time analytics tool, aiding Kidogo in measuring the impact of their Theory of Change.

image

image

Skills: Power Querry, Power BI, Dashboard, Data Vizualization, CREDI , ECDI, Profitability

Technology: Microsoft Power BI

More BI and dashbord Projects Here

Speaking and Community Engagements

  • Mentor and Speaker at the Datathon 2023 Kenyan Chapter held at the Moringa School, organised by DTE consultacy, Predictive Analystics, Konza Technopolis and Mescript Analytics.

  • Speaker at Data Science Summit (2022): Presented on the topic of "Using Data Science for Social Impact: Lessons from Humanitarian Projects."

  • Served as a Co-Lead within the local community for the Google Developers Student Club Eldoret Chapter (2019-2020), Student Ambassador for EldoHub (2020).

    Publications

  • Coming soon

Connect with me on social media:

LinkedIn Gmail WhatsApp Facebook

  • LinkedIn: Read my articles to Stay updated on the latest in the tech and data field.
  • Email: Reach out for consulting gigs, job opportunities, or collaboration on startup ideas. I'm open to remote or hybrid opportunities and willing to relocate.
  • WhatsApp: Let's chat over evening coffee while watching a soccer game.(Kevin Systrom and Mike Krieger, the co-founders of Instagram, met and discussed the initial idea for the photo-sharing app at a coffee shop in San Francisco.)

Just for Fun

-- Why do programmers prefer dark mode?
print("Because light attracts bugs! 😂🦟💡")

-- Welcome to the Magical Kingdom Database!

-- Let's create a table to store information about our adorable dragons.
CREATE TABLE Dragons (
    DragonID INT PRIMARY KEY,
    DragonName VARCHAR(50),
    Color VARCHAR(20),
    Age INT,
    LikesIceCream BIT,
    FavoriteGame VARCHAR(50)
);

-- Now, let's insert some cute dragon data!
INSERT INTO Dragons (DragonID, DragonName, Color, Age, LikesIceCream, FavoriteGame)
VALUES 
    (1, 'Sparky', 'Red', 50, 1, 'Hide and Seek'),
    (2, 'Bubbles', 'Blue', 30, 0, 'Tag'),
    (3, 'Sunny', 'Yellow', 40, 1, 'Hopscotch'),
    (4, 'Glimmer', 'Purple', 20, 1, 'Dragonball');

-- Great! Now, let's check who likes ice cream and what games they enjoy.
SELECT DragonName, LikesIceCream, FavoriteGame
FROM Dragons
WHERE LikesIceCream = 1;

-- Uh-oh! Looks like Sparky, Sunny, and Glimmer all have a sweet tooth!

-- Let's organize a dragon game tournament.
-- First, we need to add a column to track the tournament scores.
ALTER TABLE Dragons
ADD COLUMN TournamentScore INT DEFAULT 0;

-- Update scores based on dragon's age because older dragons are wiser, right?
UPDATE Dragons
SET TournamentScore = Age * 2;

-- Now, let's see the leaderboard!
SELECT DragonName, TournamentScore
FROM Dragons
ORDER BY TournamentScore DESC
LIMIT = 3;

-- And the winner is... *drumroll*... Sparky! The wise, ice cream-loving champion!

Pinned

  1. SQL-Porfolio-Projects-DDL-DML SQL-Porfolio-Projects-DDL-DML Public

    SQL is a must-have skill for any aspiring data practitioner. Many modern companies store vast amounts of their data in various tables of relational databases. To extract the necessary data from a d…

    Jupyter Notebook 1

  2. Data-Vizualization-and-Dashboards-Power-BI- Data-Vizualization-and-Dashboards-Power-BI- Public

    This repository showcases my sample projects through live data dashboards, interactive reports, charts, graphs, and other visual representations, that demostrate how we can use data visualization t…

    1

  3. Data-Analysis-Python-Projects Data-Analysis-Python-Projects Public

    Learning Python is crucial for anyone interested in working with data. This repository showcases my Python projects, demonstrating my ability to analyze, manipulate and automate data analysis using…

    Jupyter Notebook 1

  4. Custmer-segmentation-using-RFM-python- Custmer-segmentation-using-RFM-python- Public

    Customer Segmentation using the Recency, Frequency and Monetary Values

    HTML

  5. Vizualizing-the-stock-market-data Vizualizing-the-stock-market-data Public

    python vizualization

    Python 1

  6. CNN-Image-classification-Using-Fashion-MNIST-dataset CNN-Image-classification-Using-Fashion-MNIST-dataset Public

    There's been much speculation in recent years about neural networks technologies and other deep learning algorithms, primarily because of the popularity of several implementations in the sector uti…

    Jupyter Notebook 4 1