According to this presentation
https://www.slideshare.net/DataFestTbilisi/how-to-win-a-mach...
He worked at H2O.ai (but I think he was now fired). Prior to that (again according to the above).
- Master of Science from Moscow State university
- New economics school (Moscow)
- Financial Consultant
- Quantitative Researcher
- HFT Fund partner
Overall seems to be impressive track, this is the type of track that often mentioned on HN, the top firms would hire from...Completely not clear why he needed to cheat, are there other sophisticated cheaters out there for these types of competitions?
May be there needs to be prises for 'checking' other peoples work..?
Edit: I didn't see that the test data was given. See the first reply to this comment.
The issue is that they manually labeled the test data, and then pretended they didn't.
The competition objective is to provide an ML solution that produces labels for the test data, showing your work with code (to prove you didn't just hand label the data.)
Instead, they did manually label the data, and hid their manual labels in the id column of that external data source.