Juhan Sonin • over 13 years ago
Sythnesized Patient Data Set?
Is there an open, synthesized patient data set of a statistically relevant size (1,000-10,000 records) that contestants can use (not just the single, anonymized record http://www.va.gov/BLUEBUTTON/docs/VA_My_HealtheVet_Blue_Button_Sample_Version_12_All_Data.txt)?
Comments are closed.

5 comments
Ryan Panchadsaram Manager • over 13 years ago
Hi Juhan,
We're working on getting a few example records for participants.
In the meantime, this is a good summary of the fields and sections:
http://blue-button.github.com/challenge/files/health-design-challenge-fields.pdf
Best,
Ryan
Juhan Sonin • over 13 years ago
Thanks Ryan.
In the meantime and most likely for the design, I'll use my own data (which has been open sourced under the Apache 2 license and is mostly structured data).
Ryan Panchadsaram Manager • over 13 years ago
Okay. When you get a chance, give me a peek at the direction you are going. I want to make sure you are submitting something that meets the requirements of the challenge.
My email address is: ryan.panchadsaram@hhs.gov
Good luck!
Jonathan Irwin • over 13 years ago
The data set in the anonymized record is much more detailed, and includes much more information (blood type, occupation, etc) than the summary. Which one would you recommend designing for? Generic or actual? Thanks in advance!
Ryan Panchadsaram Manager • over 13 years ago
Hi Jonathan,
You should design towards the summary:
http://blue-button.github.com/challenge/files/health-design-challenge-fields.pdf
I will be updating this PDF in the next day or two. I will be adding details to the Lab Results section.
Best,
Ryan