I cobbled together some R code that will plot the live leaderboard and show you where you are. If you can enhance this or make it more efficient then please let us know.
http://anotherdataminingblog.blogspot.com/2011/06/scraping-up-leaderboard.html
|
Posts 292 Thanks 113 Joined 22 Jun '10 Email user |
I cobbled together some R code that will plot the live leaderboard and show you where you are. If you can enhance this or make it more efficient then please let us know.
http://anotherdataminingblog.blogspot.com/2011/06/scraping-up-leaderboard.html
|
|
Thanks 178 Joined 21 Aug '10 Email user |
Nice! You might also want to look at the ZIP file containing the raw data for the leaderboard that's at the bottom of the leaderboard page. It contains a CSV file that tracks every team's improvement since the start of the competition. Effectively, it gives you the same data as if you had continually scraped the leaderboard page to track changes.
Thanked by
Sarkis
|
|
Thanks 9 Joined 2 Dec '10 Email user |
|
|
Thanks 178 Joined 21 Aug '10 Email user |
Ford Prefect wrote: @Jeff If you're feeling up to it, it would be neat to have an interactive plot with the top teams on one side, so that if you mouse over a team name, then the history of their submissions gets highlighted on the graph. That way we can see who's hovering and who's on the bleeding edge over time.
I tried to get a simple start on this with the arrows in the leaderboard showing trends. One hidden leaderboard feature is that you can specify the "delta" period that you'd like to look at by specifying the time period using "h" for hours, "d" for days, "w" for weeks, and "m" for months. For example: http://www.heritagehealthprize.com/c/hhp/Leaderboard?delta=2w will show trend indicators comparing each team's position from 2 weeks ago to now http://www.heritagehealthprize.com/c/hhp/Leaderboard?delta=12h will show the past half day (as would http://www.heritagehealthprize.com/c/hhp/Leaderboard?delta=0.5d ) I tried to write a basic algorithm that slowly grows the delta period towards one week as a competition progresses. I'd be up for suggestions on tweaking this algorithm (is 1 week too much? too little?) I like the idea of more interactive graphs and would be willing to write the backend code if someone knows of a great graphing (JavaScript) tool that's relatively easy to integrate. |
|
Thanks 9 Joined 2 Dec '10 Email user |
Jeff Moser wrote: I like the idea of more interactive graphs and would be willing to write the backend code if someone knows of a great graphing (JavaScript) tool that's relatively easy to integrate.
Jeff, maybe this suggestion isn't web 2.0 enough, but couldn't you have a simple image map with rollovers? That way, the static images are just swapped and the processing logic is entirely segregated to the backend. |
|
Thanks 8 Joined 3 Jun '11 Email user |
How about this motion chart? http://code.google.com/apis/chart/interactive/docs/gallery/motionchart.html#Example You can create a motion chart that shows how each team evolved its score over time, its automatically animated. Don't know how fast it will be with more than 200 circles though ...
|
|
Thanks 29 Joined 28 May '10 Email user |
Good one, thanks. I have been scraping it regularly for a while:
PDF: http://static.cybaea.net/Kaggle/HHP/history.pdf library("XML", quietly = TRUE)
Thanked by
Sali Mali
|
|
Posts 292 Thanks 113 Joined 22 Jun '10 Email user |
|
|
Thanks 6 Joined 5 Apr '11 Email user |
|
|
Posts 158 Thanks 92 Joined 6 Apr '11 Email user |
|
Flagging is a way of notifying administrators that this message contents inappropriate or abusive content. Are you sure this forum post qualifies?
with —