@IrwinCaladinResearch said in #21:
> Sorry, I forgot to reply to this one and it is a really good point:
>
>
>
> Disclaimer: I already had a beer, so this may be flawed, but thinking about how to get labels for cheating detection properly is a good thought.
>
> Unless Lichess proves me wrong, I would say the percentage of confessions out of flagged cheaters is negligible. Relying on this to get labels is not going to work.
>
> Here is how I would do it and it's wild - ASK PEOPLE TO CHEAT. No, don't stop reading here and start writing a response to call me crazy, hear me out and let me describe how this could work:
> Lichess could reach out to players to sign-up for a "anti-cheater program". The selection of players would have to be as random as possible (this is important!), but you'd probably have to make some restrictions (e.g., you could use only accounts that have existed for more than a year) and I'd rely on Lichess to make the right call here.
Wouldn't the random selection of players to participate in a cheater program create its won bias? You would now be creating a database of cheated games created mostly by people who normally don't cheat. I find it hard to believe they behave the same as notorious cheaters. Not only are they "beginners" in cheating, but by participating in the program, they're likely to "overcompensate" (for instance, follow an engines recommendation more often than a real cheater would).
Maybe it will all work out fine, but I need a bit more convincing this creates a good training set. (Disclaimer: while I've been working in IT/CS for nearly 4 decades, I don't much expertise in ML (and only a little in ML (the language))).
> The players playing against those "programmatic cheaters" would obviously get a refund on their rating (and the programmatic cheater would not gain rating) as well as a message telling them that Lichess is grateful they played the last game against a cheater to improve Lichess mechanisms against actual cheating.
So, if I play against such a cheater, all I get is a message indicating my time has been wasted? Sure, I won't losing rating points (but I didn't gain them either), but I won't be compensated for the time. Now, most people won't care, but if it happens often enough, at least some people will move on. Also, if I were to participate in that program, and immediately after the game Lichess points out that I participated/cheated to my opponent, how does that change her view of me? Will she be less likely to play against me a next time?