Data analytics
1a. What is the average cholesterol for the group with the highest overall cholesterol?
1b. Does this group (identified above) have more males or females?
You have many patients who have still not signed up to use the patient portal, which allows patients to manage several health care related tasks online, such as medication refills, scheduling appointments and messaging their physician.
To increase patient portal adoption, you would like to send a targeted mail campaign to the group of current non-users who are more likely to sign up for the patient portal. Pick an appropriate machine learning program and use the Training.xlsx file to train your model and predict portal adoption for patients listed in Scoring.xlsx
2a. What machine learning algorithm would be appropriate: linear regression or logistic regression?
2b. What is the predicted patient portal adoption status for Patient ID 993? If targeted in the marketing campaign, is she likely to become an adopter or stay non-adopter?
You are interested in learning what factors play a role in patient adoption of patient portals, so you can plan your marketing strategy accordingly. Create a decision tree using the data in Training.xlsx. What is the most important factor determining adoption?
Very briefly, what is your observation regarding this variable, as determined by the decision tree?