Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
Transcript of JSM 2013
Predicting 5-year risk of cardiovascular events using electronic health record data.
Combines health information from clinic/hospital visits, prescription database, and vital records.
A machine learning technique often used for classification problems.
2. Use assumptions on P(X|Y) to make life easier, e.g., independence = "Naive Bayes":
Electronic health record (EHR)
Censored Bayesian networks
Huge sample sizes
Incomplete follow-up and censoring
Opportunities and challenges
Our dataset provides 10 years of health information on >400,000 people.
BP, cholesterol, etc. measured irregularly (if at all), especially among young people.
CVD is multifactorial and several risk factors may interact.
Extending Naive Bayes to censored, time-to-event data
Naive Bayes assumption &
Julian Wolfson, Assistant Professor, Division of Biostatistics
University of Minnesota School of Public Health
(in collaboration with the UMN/HP iPredict Research Group)
firstname.lastname@example.org z.umn.edu/julianw @DrJWolfson
1. Model class probabilities by applying Bayes' rule:
Yep, this assumption is pretty crazy. Amazingly, Naive Bayes often works pretty well even if the assumption is severely violated!
At the poster session this afternoon:
An app which uses these methods to calculate your five-year cardiovascular risk.