Skip to content

LABS Final Project: Yelp_GoogleMaps_Reviews - Roles: Data Engineer, Data Analyst, Machine Learning Engineer, Data Scientist | Bootcamp Henry: Data Science Career | DataFT Cohort 17

Notifications You must be signed in to change notification settings

JohannaRangel/FinalProject_YelpGoogleMaps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sentiment Analysis on Google Maps and Yelp Reviews for Ulta Beauty

Final Project - Henry Bootcamps Data Science

Important

This activity (purely educational) corresponds to the final project of Bootcamp Henry - Data Science. It is part of our portfolio of practices that has allowed us to improve Data Science skills with real-world problems and data sets.

portada

As a data consulting firm, our commitment goes beyond mere data collection. We believe in the transformative power of data, working tirelessly to turn this belief into a tangible reality for our clients. We invite you to explore the structure of this repository to delve deeper into the intricacies of our project. From datasets and preprocessing to analysis and visualization, the repository is a comprehensive resource that provides insight into our methodology and approach.

Online Reputation: According to Brightlocal 76% of consumers “regularly” read online reviews when browsing for local businesses. Regarding review platforms in 2022 Google is the most used and trusted platform for reviews followed by Yelp.

platform

In this particular project, our experience led us to work hand in hand with Ulta Beauty, one of the leading companies in the beauty industry. We were tasked with performing a Sentiment Analysis on Google Maps and Yelp Reviews. Extensive exploration of these reviews aimed to uncover valuable insights, improve Ulta Beauty's understanding of customer sentiment, and facilitate data-driven decision making.

At Data Insight Pro, we don't just provide data consulting; we build bridges to the future, where informed decision-making drives business success. Our dedication to excellence and innovation places us at the forefront of the industry, and every project is an opportunity to showcase how data can be the compass guiding businesses toward their objectives.

Thank you for exploring our project repository. Feel free to navigate through the different sections to gain a comprehensive understanding of our methodologies and outcomes.

If you have any questions or inquiries, please don't hesitate to reach out.
Contact Us

Repository Structure

├─ assets/ Contains multimedia resources, images, or other assets used in the project.
├─ dashboard/ Includes files related to the creation and development of an interactive dashboard.
├─ datasets/ Stores datasets used in the project, organized into separate folders for csv and parquet files.
├─ deploy_ml/ This folder encompasses all elements related to deploying machine learning models on the Google Cloud. Crucial files such as Dockerfile, main.py, and requirements.txt are located here. You can access the instructions for deploying Streamlit with Google Run from this link. See AboutDeployML.md for more details in this link
│ ├─ csv/ Files generated as part of the data analysis.
│ ├─ parquets/ Containing the raw data provided by Henry as the mandatory starting point for the project.
├─ diagrams/ This folder contains diagrams and visual schematics related to the data pipeline. Explore graphical representations here that illustrate the architecture, process flows, and other relevant visual aspects of the project.
├─ docs/ Contains essential documentation for the project such as: guidelines, generalities, evaluation criteria, instructions for implementing MLs, others.
├─ notebooks/ Here, you will find Jupyter notebooks or other documents related to data exploration and analysis such as ETL/EDA. See AboutNotebooks.md for more details in this link.
├─ sources/ This folder contains Python scripts and modules related to data processing, utilities, and backend functionality. If you're looking for the Jupyter notebooks or documents related to data exploration and analysis, check folder notebooks.
├─ videos/ In this folder, files related to visual material are stored, such as tutorials, demonstrations, or any multimedia content in video format that is relevant to the project.

Projects Displays project management and ongoing tasks.
├─ Gantt
├─ Kanban
├─ Backlog

Wiki See the Wiki for full documentation.
├─ Guidelines
├─ Executive Summary
├─ Technical Report
├─ References
├─ Conclusions & Acknowledgments
├─ Contributors

Final Products: Two Machine Learning models (Sentiment Analysis and Topic Detection) along with an interactive Dashboard featuring 5 key KPIs. Implemented on Google Cloud Platform (GCP) and deployed on Streamlit. Explore the application and enjoy!.

This presentation video offers a concise and professional overview of our final project, highlighting its distinctive features, innovative functionalities (endpoints), and the strategic impact it aims to achieve.Click to watch the video

README.md

lema

About

LABS Final Project: Yelp_GoogleMaps_Reviews - Roles: Data Engineer, Data Analyst, Machine Learning Engineer, Data Scientist | Bootcamp Henry: Data Science Career | DataFT Cohort 17

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published