lkr0
With all of the dependencies installed, simply run "jupyter notebook" on the command line, from the same directory as the titanic3.xls Once we have read the spreadsheet file into a Pandas dataframe (imagine a hyperpowered Excel table), we can peek at the first five rows of data using the head() command. Before we can feed our data set into a machine learning algorithm, we have to remove missing values and split it into training and test sets. We will feed the training set into the classification algorithm to form a trained model. Interestingly, after splitting by class, the main deciding factor determining the survival of women is the ticket fare that they paid, while the deciding factor for men is their age (with children being much more likely to survive).
Jul-6-2016, 00:46:13 GMT