<123>
José A. Guerrero's image Rank 19th
Posts 144
Thanks 21
Joined 27 Jan '11 Email user

http://www.heritagehealthprize.com/c/hhp/forums/t/1268/i-found-this-useful

 
S.U.T.'s image Posts 43
Thanks 7
Joined 5 Sep '11 Email user

"Prognostic Indices" -

Study behind paywall: http://jama.ama-assn.org/content/307/2/182.short

Graphic: http://www.eprognosis.org/ (Note: charlson index not mentioned)

NYT summary: http://well.blogs.nytimes.com/2012/01/19/why-doctors-cant-predict-how-long-a-patient-will-live/?ref=health

 

 

 
CreativSolutions's image Posts 4
Joined 7 Jan '12 Email user

I don't yet know if this data is useful, but I did submit an entry using it, and I want to make sure I can use the data in the future if I find it useful.

4 Attachments —
 
Jeremy Howard (Kaggle)'s image Posts 166
Thanks 58
Joined 13 Oct '10 Email user
From Kaggle

It should be fine to use that data, as long as it can be licensed to competitors to use on this comp, and to HPN to use with the final model. Can you please confirm where you got that data, and how it is licensed?

 
HangZ's image Rank 5th
Posts 8
Thanks 1
Joined 1 Nov '11 Email user

I am sharing an external data source that might be useful for this competition. It is attached with this post. It is the 2010 census data that is publicly available from the census.gov website http://www.census.gov/prod/cen2010/briefs/c2010br-03.pdf

I just contacted the census.gov, what I was told is as follows:

"Census data is "public domain", you do not need our permission to use it, copy it, publish it, or cite it."

1 Attachment —
 
_JeremyA's image Posts 23
Thanks 6
Joined 5 Apr '11 Email user

http://www.heritageprovidernetwork.com/?p=medical-groups

http://www.calhospitalcompare.org/

I'll probably use the data located on these webpages as inputs at some point, I assume this is considered 'external data'?

 

~jba

 
DavidChudzicki's image
DavidChudzicki
Kaggle Admin
Posts 424
Thanks 106
Joined 21 Nov '10 Email user
From Kaggle

JeremyA wrote:

http://www.heritageprovidernetwork.com/?p=medical-groups

http://www.calhospitalcompare.org/

I'll probably use the data located on these webpages as inputs at some point, I assume this is considered 'external data'?

 

~jba

 

Yes, anything other than the data sets provided with the competition are "external data." 

 
G's image
G
Posts 1
Joined 18 Mar '11 Email user

In section "7. USE OF OTHER DATA" of the rules it states: "You may not, however, link the Data Sets to records in other external databases such that new demographic, socioeconomic or clinical information about the members in the Data Sets is gained."

Is a concise definition available for what exactly constitutes demographic, socioeconomic, and clinical information in the context of this sentence?

thanks

 
_JeremyA's image Posts 23
Thanks 6
Joined 5 Apr '11 Email user

G wrote:

In section "7. USE OF OTHER DATA" of the rules it states: "You may not, however, link the Data Sets to records in other external databases such that new demographic, socioeconomic or clinical information about the members in the Data Sets is gained."

Is a concise definition available for what exactly constitutes demographic, socioeconomic, and clinical information in the context of this sentence?

thanks

Do the two links I've provided fall under this rule? The avg LoS for California as well as the in-service provider info from the Hertiage Health wesite certainly qualify as "new demographic, socioeconomic or clinical information about the members", just not for the purposes of 'Patient Identification/Privacy'; which is what I thought the rule was geared towards...?

 

Thanks in Advance,

~jba

 
DavidChudzicki's image
DavidChudzicki
Kaggle Admin
Posts 424
Thanks 106
Joined 21 Nov '10 Email user
From Kaggle
G-- I'm sorry, but that's what we have. I think we'll just have to figure out how that applies on a case-by-case basis. JeremyA-- We'll need to have a look at that data and think about it with HHN. I'll be sure to give a response by Friday next week (March 16). Thanks, David
 
DavidChudzicki's image
DavidChudzicki
Kaggle Admin
Posts 424
Thanks 106
Joined 21 Nov '10 Email user
From Kaggle

JeremyA-- I'm sorry. I'm still trying to find out what HHN thinks of this. I'll be in touch again as soon as I can.

 
_JeremyA's image Posts 23
Thanks 6
Joined 5 Apr '11 Email user

Don't worry.  I anticipated it might elicit some difficulty.
And there's lots of time left in the competition.

~jba

 

 

 
Kno.e.sis's image Posts 4
Joined 28 Nov '11 Email user

We haven't submitted any prediction model yet to the competition but I will get to it some time. However, I'm planning to use some external data sources. Please let me know if I should be posting links to these datasets here.

Thanks a lot in advance!
Pramod.

 
DavidChudzicki's image
DavidChudzicki
Kaggle Admin
Posts 424
Thanks 106
Joined 21 Nov '10 Email user
From Kaggle

Yes, you should post links here.

 

7. USE OF OTHER DATA

Entrants may use data other than the Data Sets to develop and test their Prediction Algorithms and Entries provided that (i) such data are freely available to all other Entrants and (ii) the data and/or a link to the data are published in the "External Data" topic in the Forums section of the Website within one (1) week of the date on which an Entry that uses such data is submitted to the Website. Entrants may not use new external data in connection with the development of their Entries after 11:59:59 UTC on April 4, 2012 without the prior written permission of Sponsor. Any third-party service provider, consultant or contractor of Sponsor that received or receives data or other information in connection with work performed for or on behalf of Sponsor may not use such data or other information in connection with the Competition.

You may not, however, link the Data Sets to records in other external databases such that new demographic, socioeconomic or clinical information about the members in the Data Sets is gained. Sponsor reserves the right in its sole discretion to disqualify any Entrant who Sponsor discovers has undertaken or attempted to undertake such linking of the Data Sets.

 
Kno.e.sis's image Posts 4
Joined 28 Nov '11 Email user

Thanks a lot for the reply!

Here are some of the external data sets I'm planning to use:

Pubmed: http://www.ncbi.nlm.nih.gov/pubmed/ 

datasets on LODD : http://www.w3.org/wiki/HCLSIG/LODD/Data 

ICD9 data: http://www.cdc.gov/nchs/icd/icd9.htm 

Mortality statistics: http://www.cdc.gov/nchs/deaths.htm 

Disease ontology: http://do-wiki.nubic.northwestern.edu/index.php/Main_Page 

 

 

 

 

 
<123>

Reply

Flag alert Flagging is a way of notifying administrators that this message contents inappropriate or abusive content. Are you sure this forum post qualifies?