 http://www.heritageprovidernetwork.com/?p=medical-groups http://www.calhospitalcompare.org/ I'll probably use the data located on these webpages as inputs at some point, I assume this is considered 'external data'? ~jba
 JeremyA wrote: http://www.heritageprovidernetwork.com/?p=medical-groups http://www.calhospitalcompare.org/ I'll probably use the data located on these webpages as inputs at some point, I assume this is considered 'external data'? ~jba Yes, anything other than the data sets provided with the competition are "external data."
 In section "7. USE OF OTHER DATA" of the rules it states: "You may not, however, link the Data Sets to records in other external databases such that new demographic, socioeconomic or clinical information about the members in the Data Sets is gained." Is a concise definition available for what exactly constitutes demographic, socioeconomic, and clinical information in the context of this sentence? thanks
 G wrote: In section "7. USE OF OTHER DATA" of the rules it states: "You may not, however, link the Data Sets to records in other external databases such that new demographic, socioeconomic or clinical information about the members in the Data Sets is gained." Is a concise definition available for what exactly constitutes demographic, socioeconomic, and clinical information in the context of this sentence? thanks Do the two links I've provided fall under this rule? The avg LoS for California as well as the in-service provider info from the Hertiage Health wesite certainly qualify as "new demographic, socioeconomic or clinical information about the members", just not for the purposes of 'Patient Identification/Privacy'; which is what I thought the rule was geared towards...? Thanks in Advance, ~jba
 G-- I'm sorry, but that's what we have. I think we'll just have to figure out how that applies on a case-by-case basis. JeremyA-- We'll need to have a look at that data and think about it with HHN. I'll be sure to give a response by Friday next week (March 16). Thanks, David
 JeremyA-- I'm sorry. I'm still trying to find out what HHN thinks of this. I'll be in touch again as soon as I can.
 Don't worry.  I anticipated it might elicit some difficulty. And there's lots of time left in the competition. ~jba
 We haven't submitted any prediction model yet to the competition but I will get to it some time. However, I'm planning to use some external data sources. Please let me know if I should be posting links to these datasets here. Thanks a lot in advance! Pramod.
 Thanks a lot for the reply! Here are some of the external data sets I'm planning to use: datasets on LODD : http://www.w3.org/wiki/HCLSIG/LODD/Data  ICD9 data: http://www.cdc.gov/nchs/icd/icd9.htm  Mortality statistics: http://www.cdc.gov/nchs/deaths.htm  Disease ontology: http://do-wiki.nubic.northwestern.edu/index.php/Main_Page
 JeremyA, I'm sorry -- I think we have to say not to use it. -David
 Hi Kaggle Admins, census.gov was already mentioned in this thread… I'm thinking about use of other external data from that source with a social-economic dimension. Like the data linked from that document: http://www.census.gov/hhes/www/income/income.html Would it be ok to integrate that data in my models? -theafh
 Hi Theafh, I'll have to look into it and get back to you within a week, but I fear the answer will be the same as for JeremyA's question. Thanks, David
 Hi, We are planning to leverage the following data and information which is free to the public: Thanks!
 Hi Admins,  I just wanted to know what I needed to do to get approval for the use of external data sets after the april 4th deadline. In addition if I have compiled data via automated data mining from published and freely available journal articles, must I provide links to each article, or just provide the compiled dataset ? It may be easier to provide the compiled dataset as the number of articles used would be huge.  Cheers!
 As a general rule, external data won't be approved after the deadline.
 Hi, I'm just starting with the contest. Did any external data sources get approved? It didn't look like it from this forum, but I wanted to be sure. Thanks, David
 David Gainer wrote: I'm just starting with the contest. Did any external data sources get approved? It didn't look like it from this forum, but I wanted to be sure. Prior to April 4, 2012 external data didn't need approval (as long as all of the conditions in the rules were satisfied). That's why you see people posting it here (without approval). After that date, external data requires approval, which is unlikely to happen.
 David, Can you provide a final list of specific external data sources that we can use? Some people have listed websites which seems vague. If any data from any url posted before the deadline can be used that you can just verify this. Thanks, John
 If there are particular cases that aren't clear from questions & responses on the forum thread, can you ask about those specifically?
