Logistic Regression
I created dummies for the categorical data in the dataset in order to perform Logistic regression. The data was again split into test and train datasets with a 30 –70 split. The logistic regression model was trained on the training dataset. The model report is shown below.
The precision when predicting white is 0.67 and when predicting black is 0.66. This is to be expected since the number of instances for white and black is way more than the other races. The recall score for white is 0.78 and for black is 0.61.
F1 score for white was 0.72 and for black was 0.61. The image below shows the full classification report.
The total accuracy score for the logistic regression model was 0.648.