There will be 3rd, 4th or more blogs poping up talking about the relationaship between SD_John_lily and the current top player Opera Solution by our famous Phil Brierley , AKA, Sali Mali, if I do not spend some of time saying something. SD_John and Lily
are two close teams collaborating each other. We can see it without learning data mining skills. We merged into one to follow the new rules posted by Kaggle.
If the purpose of our Phil's hard work is only to discover SD_John and lily are collaborating teams, it may not be interesting enough to spend the precious time and effort. The main goal here I see is to prove SD_John_Lily are part of Opera Solution's team.
If it was not so sure in the first blog, it is definitely certain in the second blog: "Lily, SD_John and Opera Solutions are all essentially the same entity (and JYL also entered) ....". So, our famous data miner Phil digged out this undoubtable conclusion.
I was thinking revealing our own blogs or linkedIn pages to clear the confusion. Now I feel it is better to leave it as is so Phil can keep digging. This reminds me the story of Robert A. Millikan, who was an experimental physicist, and Nobel laureate in
physics. It was widely believe that he picked 58 data points to support his claim instead of using all raw data on the measurement of electron charge. If SD_John's correlation with JYL is > 0.99 based on Phil's measurement, I bet there must be some teams have
greater than 0.99 correlation with MarketMaker if you repeat the same calculation on MarketMaker against all other teams. The reason I am so sure is because I never know JYL. Get to the know the name for the first time from the blog. When I saw Opera's final
results on the credit competition, I can not help laughing. I know it gives another data point to prove I am part of it. Who knows, maybe in the future. I hear it is a pretty good place for data mining scientists. They always welcome great scientists.
I am suprised that I have not done anything on the HPN competition for more than 2 months. I have to come back to make more submissions. Maybe I should closely follow maketmaker's submission to increase the correlation? To give some hints on my profile:
I participated several data mining competitions in the past (many times the performance was not bad); I had met David, member of MarketMaker years ago in conference, had a lot of respects to his early winning achievements on the KDD cup (great job on this
competition, of course); Had chance to compete with Opera on other contest (outside of Kaggle); I encourage everyone to compete in the CHALEARN Gesture Challenge recently available because I am somehow associated and not eligible to win the prize. There are
not many teams yet, it should be very interesting. ....