Partitioning the data into Training and Validation sets?

« Prev
Topic
» Next
Topic
mgomari's image Posts 14
Thanks 7
Joined 5 Apr '11 Email user

I wasn't able to find an answer to this in the rules.

Is it fair to assume that Entrants can partition the available data into Training and Validation sets as they wish?

Further, are Entrants allowed to fine tune these data sets if they see fit, e.g. removal of junk data?

Thanks

 

 
Anthony Goldbloom (Kaggle)'s image
Anthony Goldbloom (Kaggle)
Competition Admin
Kaggle Admin
Posts 382
Thanks 72
Joined 20 Jan '10 Email user
From Kaggle
@mgomari, the answer to both questions is yes.
Thanked by mgomari
 

Reply

Flag alert Flagging is a way of notifying administrators that this message contents inappropriate or abusive content. Are you sure this forum post qualifies?