Send the link below via email or IMCopy
Present to your audienceStart remote presentation
- Invited audience members will follow you as you navigate and present
- People invited to a presentation do not need a Prezi account
- This link expires 10 minutes after you close the presentation
- A maximum of 30 users can follow your presentation
- Learn more about this feature in our knowledge base article
open404 (IT Remix)
Transcript of open404 (IT Remix)
Data is useful.
Data is available.
It needs improvement…
…which only happens if we use it.
> than #s
Required by the state legislature
Compiled by MDCH BHDDA
Contains service cost and utilization
Aggregated by CMHSP, disability type
Distributed as PDF
Compiled into master dataset
New variables: Fiscal Year, PIHP, % cost, % use, Service Types and sub-groups
Standardized units to hours
Aggregated by population and region
Reformatted to look at correlations
Inconsistent CPT/HCPCS code use
Ambiguous variable definition
Difficult to make meaningful comparisons of population, service use, and gaps
Little incentive to use it as a tool
Data quality improves through engagement and use.
Reliable information exists in flawed data sets.
Decisions based on data provide significant advantages.
Standardized data collection & reporting
Clear variable definitions
Agreed upon outcome measurements
Easily accessible in machine-readable format
Barriers to innovation:
[ ] Desire (this one’s on you)
Give access to data
Provide free tools
Teach to fish
Ask hard questions
Prioritize new datasets to open
Invest in tech & talent to collect & analyze
What is it?
Availability & Accessibility
Reuse & Redistribution
Why would we want that?
Adapted from Open Knowledge Foundation: http://okfn.org/opendata/
...as you know it
"but the data isn't right..."
"...your observations are to be taken with great pains and accuracy, to be entered intelligibly
for others as well as yourself
Who has more incentive to make sure the data is right than the person the data describes?
- Thomas Jefferson, to Lewis and Clark
...the best of
- Justice Louis Brandeis
McKinsey Global Institute (2013) "Open data: Unlocking innovation and performance with liquid information"
"the use of open and proprietary data in health care could help generate value of
$300 billion to $450 billion
per year in the United States"
Your boat has a lot of holes.
Drowning for the Perfect Data
a.k.a. Hypothesis generation
What does it mean when CLS Daily and Personal Care diverge or converge?
Do the units per case for ACT match the frequency required for fidelity?
One CMH provided CLS Daily to one less person in 2012 than in 2006 but spent $2M more doing it. Why?
Why would a CMH serve more people in ACT than Case Management and then reduce the number of cases in each to < 1/4 of the original number?
Does a steady decrease in health services correspond to a greater use of non-Medicaid health services?
How much do provider admin costs impact the
unit cost of services?
Are any CMHs leading in the move to decrease residential services and move people into community-based settings?
important part...just a start
This presentation and tool would not have been possible without the work of…
RAW :: http://raw.densitydesign.org/
ManyEyes :: http://www-958.ibm.com/software/data/cognos/manyeyes/
RStudio :: http://www.rstudio.com/ide/download/
clean & analyze
SAMHDA :: http://www.icpsr.umich.edu/icpsrweb/SAMHDA/browse
BRFSS :: http://www.cdc.gov/brfss/annual_data/annual_2012.html
Medicare Provider Charge Data :: http://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/Physician-and-Other-Supplier.html
entails business risks, including reputational issues related to the potential release of negative information
Potential benefits include:
new products and services
price/services for consumers
[plug your ideas into a need]
customize actions for consumer populations
better match supply with demand
allow innovative approaches by market entrants
one little thing...*
from "Thinking, Fast and Slow" by Daniel Kahneman (2011, Macmillan)
1 million + 1 million = ?
17 x 24 = ?
Let's try it out...
So, did you really solve the first one?
Screening for human:
Have you ever...
believed a report because it told a coherent story, without asking about the quality of evidence?
ignored evidence that contradicted your preconceived notions?
focused on how data was presented rather than the data itself?
assumed green = good for KPIs
Degrees of Open-ness
Version control (distributed)
Command line = bash shell a la Linux
Stick a Fork in it
Clone it on your desktop
Integrated development environment for R
Create Markdown, Sweave documents for reproducible analysis
Projects integrated with Git/SVN for version control
Hone in on areas where...
high cost services are provided often to a small group
high variation exists across CMHSPs for per-person use of a service
high frequency services are provided with high units/person
Resource Use / Efficiency
Look at instances where:
different service codes are provided in varying amounts within a given service type
year-over-year use of high-cost services is flat or increasing
*Make it useful:
knowledge is power