The data set and descriptions are available at
http://archive.ics.uci.edu/ml/datasets/Bank+Marketing
Please use bank-additional-full.csv data
Please read the instruction carefully. The field “duration” should NOT be used in the model (think about why).
In R code, you need to include the following: Please check all columns’ types; change types if necessary
Please check whether there are any missing values or other abnormal values
Please check important variables’ statistics and distributions
Please create at least one bar chart, at least one histogram chart, and at least one normalized overlay chart.
Please partition data into training and validation
Please build a decision tree model.
Please use the confusion Matrix function to assess the model’s performance
Please create lift charts to assess the model’s performance.
Recent Comments