Did puzzles rating get inflated with V2 update?

dionmei

My highest was around 1900 in puzzles before, but I usually stayed around the 1800 range. My highest in puzzles after the update is 2100 right now, and I am only 1600 rated in rapid. I try not to look at how high my rating is in either puzzles or regular chess, but I want to know if I am getting better or if it's just inflation. Thank you. Sorry if this was already answered before.

phoenixshade edited

My opinion is that they are. My puzzle rating has jumped by about 400 points since the introduction of v2. I had done about 2800 puzzles on the old system, and I've done around 380 or so on the new and my rating seems to have stabilized, increasing from about 1800 to 2200.

I also notice it qualitatively. Puzzles that under the old system would have been rated around 1700 or 1800 seem to be at least 300 points higher than that now.

One reason I think this may be happening is that all puzzles are rated, even if you choose a custom set like a particular tactical motif. If you already know that every solution is going to be a fork, or an underpromotion, or whatever, you will tend to find it a lot faster and a lot more often than when you're solving a "Healthy Mix." This will inflate the rating of the solver (and deflate those particular problems), such that when they resume doing a "Healthy Mix," their inflated rating gets fed back to the puzzles.

Chesstempo, for this exact reason, makes all custom puzzle sets unrated. You can solve a million "underpromotion" puzzles where you already know that 99.5% of the time you need to make a knight, but it won't affect your rating nor those of the puzzles.

Dead_Can_Dance edited

If you only train certain puzzle types you can easily inflate your rating. but puzzles are for training only so i don't think it matters. i personally really enjoy the new option because now i can train specific motives just like on chesstempo. for me personally this puzzle update just made chesstempo superfluous.

phoenixshade edited

ChessTempo's classification of motifs is more mature and robust. Computers are good at FINDING tactics, but they are not as good at IDENTIFYING them by motif. For that, users contribute, on both sites. The difference is that this has been the case on ChessTempo since the beginning, so they have a several year head start.

The other thing I prefer about ChessTempo puzzles is that, especially on long and difficult-to-spot combinations, many ChessTempo puzzles play out the combination, rather than abortatively sacrifice material worth one less pawn. This forces the player to spot the tactical combination to its conclusion. Again, this is something that arises due to human contribution (via the "alternate move" and "needs more moves" tags).

ChessTempo also has MUCH better tools for reviewing your puzzle history, including tracking your performance by tactical motif. This is an excellent tool for identifying your tactical strengths and weaknesses objectively.

Lastly, ChessTempo makes use of a spaced repetition algorithm to develop and reinforce your weakest skills in an evidence-based way that leads to concrete improvement.

V2 puzzles are a definite improvement in many ways, but they are still a long way from replacing ChessTempo from my perspective.

aPatron

This talk of cheese is making me hungry

fotc77 edited

My rating has gone down, lol

Nvm, its stayed the same

Splorer

The rating deviation value for puzzles got reset with V2, so people should experience some volatility both ways.

dionmei

I'm actually having a hard time maintaining 2000 for my puzzles rating so maybe it is going back to normal. I just want the puzzle difficulty to stay consistent for a given rating.

phoenixshade

@Dead_Can_Dance Yes, one can easily inflate one's rating that way, but that does not explain MY approximately 300-400 point increase, as I ONLY do "Healthy Mix" and so far have only done them at "normal" difficulty. I think what's happening is some solvers ARE artificially increasing their rating with themed sets, then reverting to "Healthy Mix" and getting a lot more wrong, feeding their inflated points back into the puzzle pool, where solvers like me who only do "Healthy Mix" then have an opportunity to pick up more points per puzzle since their ratings are higher.

I wonder if any specific themed sets are more prone to this type of abuse, and if the puzzles included in such a set are currently artificially LOW due to this practice.

In my opinion, anything other than "Healthy Mix" should be unrated in puzzle solving. If you know what specific tactic you are looking for, for you the difficulty of the puzzle is like 400 rating points less.

Roman30061990 edited

#10

to much stupid puzzles
for example #NneD9 - chess problem
I find the winning move Qe7. But here was one more win take on d5.
If some one do not find the puzzle - lichess.org/dNRM90Y8/white#61
puzzle begin after blacks 30 move

This topic has been archived and can no longer be replied to.