Skip to content

BASH-EPIC/WINE-DATASET-Hypothesis-Testing-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

WINE-DATASET-Hypothesis-Testing-

This dataset was provided by the professor. This dataset is having 14 variables with 1499 observations. These two variables are in integer format and others are in string format.

image 2. Then I used describe func., as it shows mean, median, mode, S.D and missing values, highest, lowest and missing distinct values. 3. image 3. I tried to remove duplicate values in this plot but there no such duplicate value is present. 4. I use a subset in countries columns to remove one row, using this!= "condition that row removed. As it’s replaced all the unwanted values to spaces and removed one row of countries column as there was no name has been mentioned so I dropped that row. image I created a country variable and created the sum of all the countries, so I’ll get to know which country has the maximum and minimum frequency. 7. I created histograms of points and price. 8. image image In points histogram, it’s showing right-skewed distribution with normal distribution and price histogram are extremely right-skewed as it increases and suddenly decreases at the peak point. 8. I created a scatter point to see what the difference between points and price was and then created a regression line for the clear vision. image It shows that there are a lot of outliers were presented in this graph. That outlier plays an important role so we can’t ignore that outlier. SD are in between 90. 9. This chart shows, how many wines are present and how many times they gets tested. 10. image 10. Replaced all NA values dataset to zero(0). 11. Replaced all the blank values to “Unknown” words in the regional sector. 12. using group by I created a new variable called, "vapes" and I summarise with price and points as its numerical values. image image image This bar chart shows what was wine prices in different countries. In the next plot, I also created a boxplot. 14. I created mean and median for countries using new variables.This graph shows, I create a mean point variable and using the mean points it’s’ showing the graph. 15. image image image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published