Kaggle-Titanic

https://www.kaggle.com/c/titanic

The Titanic challenge on Kaggle is a competition in which the task is to predict the survival or the death of a given passenger based on a set of variables:

Survived: Outcome of survival (0 = No; 1 = Yes)
Pclass: Socio-economic class (1 = Upper class; 2 = Middle class; 3 = Lower class)
Name: Name of passenger
Sex: Sex of the passenger
Age: Age of the passenger (Some entries contain NaN)
SibSp: Number of siblings and spouses of the passenger aboard
Parch: Number of parents and children of the passenger aboard
Ticket: Ticket number of the passenger
Fare: Fare paid by the passenger
Cabin Cabin number of the passenger (Some entries contain NaN)
Embarked: Port of embarkation of the passenger (C = Cherbourg; Q = Queenstown; S = Southampton)

Since we're interested in the outcome of survival for each passenger or crew member, we can remove the Survived feature from this dataset and store it as its own separate variable outcomes. We will use these outcomes as our prediction targets.

Goal:

It is your job to predict if a passenger survived the sinking of the Titanic or not. For each PassengerId in the test set, you must predict a 0 or 1 value for the Survived variable.

The submission file should have exactly 2 columns:

PassengerId (sorted in any order) Survived (contains your binary predictions: 1 for survived, 0 for deceased)

I have Predicted the survival of the Titanic passengers using Random Forest and Logistics Regression algorithm with the help of following techniques:

Assess Data Quality & Missing Values
Exploratory Data Analysis
Feature selection & Recursive feature elimination
Feature ranking with recursive feature elimination and cross-validation
Model evaluation metrics
Model evaluation based on simple train/test split using train_test_split()
Model evaluation based on K-fold cross-validation using cross_val_score() and cross_validate()
GridSearchCV evaluating using multiple scorers

***** I have uploaded the train and test dataset along with my final prediction on test set from both the algorithms **********

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Final Prediction for Test set using LogReg.csv		Final Prediction for Test set using LogReg.csv
Final Prediction for Test set using RF.csv		Final Prediction for Test set using RF.csv
Kaggle Titanic LogReg.ipynb		Kaggle Titanic LogReg.ipynb
Kaggle Titanic RF.ipynb		Kaggle Titanic RF.ipynb
README.md		README.md
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Final Prediction for Test set using LogReg.csv

Final Prediction for Test set using LogReg.csv

Final Prediction for Test set using RF.csv

Final Prediction for Test set using RF.csv

Kaggle Titanic LogReg.ipynb

Kaggle Titanic LogReg.ipynb

Kaggle Titanic RF.ipynb

Kaggle Titanic RF.ipynb

README.md

README.md

test.csv

test.csv

train.csv

train.csv

Repository files navigation

Kaggle-Titanic

About

Releases

Packages

Languages

lovpatel93/Kaggle-Titanic

Folders and files

Latest commit

History

Repository files navigation

Kaggle-Titanic

About

Topics

Resources

Stars

Watchers

Forks

Languages