lichess.org
Donate

AlphaZero is a hoax.... (READ)

Ummm... so I have some news... I decided to stockfish a couple AlphaZero vs. stockfish games and realized that stockfish was playing terribly. if it was stockfish 8, AlphaZero would have been destroyed.

And now let me admit something:
Don't get me wrong, AlphaZero is great, it became a (about) 2400 in 4 hours, and who knows what it will do within a year.

I am just disappointing in google, ````I know that stockfish was "slightly" handicapped (1 minute per move, only 1 gigabyte).
But still, stockfish (even the old version) is better than that.
I think google just wanted some fame.

Comment "I agree @rajma420" if you agree, or
"I disagree @rajma420 because, (Proof)"
Since you mentioned "proof", can we see some to support your claim?
@rajma420

from the paper itself (arxiv.org/pdf/1712.01815.pdf)

"To evaluate performance in chess, we used Stockfish version 8 (official Linux release) as a
baseline program, using 64 CPU threads and a hash size of 1GB."

So it was stockfish 8.

I am 100% certain that it was a legitimate victory. Go is many many orders of magnitude more complex than chess, and AlphaGo Zero easily mastered it. I wouldn't believe for a second that AlphaZero would have lost to any version of stockfish.
Also how did you even get to play a game with AlphaZero ? It's not like it's embedded in a webapp.
"I am just disappointing" — it says it all.
"I decided to stockfish a couple AlphaZero vs. stockfish games and realized that stockfish was playing terribly."

There's the problem. You're evaluating the games using stockfish and not AlphaZero. I'm sure by now AlphaZero has played a million+ more games against itself and has improved even more.
Dude, I defeated Stockfish like yesterday by moving the difficulty slider to the left! You too can defeat Stockfish! :-)
@rajma420 You are wrong at several points:

1) You cannot just casually analyze Stockfish playing on a big server hardware by a Stockfish running on your home computer. What your home computer is probably showing as "best move" which was not done is likely to be refuted on a higher depth and that was the exact reason why Stockfish didn't play it. When it shows "mistakes" or "blunders" - they are just positions that are already lost no matter what a move to make but you homefish cannot see that at such a low depth.

2) Words like "it became a (about) 2400 in 4 hours, and who knows what it will do within a year" indicate that you either didn't read a paper or didn't understand it. On Figure 1 it is clearly seen that AlphaZero strength stabilized and will not increase significantly during further learning. It may improve, however, if they invent a better learning algorithm.

3) Google does not need a cheap fame that can compromise its reputation if it becomes clear in the future that the paper is a fraud. You can endlessly argue about the fairness of hardware and environment conditions described in the paper, but we have no single reason to suspect they are actually lying. DeepMind convincingly defeated the humans in the game that was previously considered to be unsurmountable for computers and there is nothing suspicious in the fact they can also defeat the best engines in chess. Anyway, we should wait for the peer-reviewed version of the paper to draw definite conclusions.
thibault
1 hour ago
#2

Since you mentioned "proof", can we see some to support your claim?

Forum General Chess Discussion Reaction from Stockfish Author (statement)

Pineta
edited
1 day ago
#1

source: chess.com(news) - any thoughts?

Meanwhile Chess.com also received a lengthy comment from one of the original Stockfish authors, Tord Romstad, which we'll give in full:

The match results by themselves are not particularly meaningful because of the rather strange choice of time controls and Stockfish parameter settings: The games were played at a fixed time of 1 minute/move, which means that Stockfish has no use of its time management heuristics (lot of effort has been put into making Stockfish identify critical points in the game and decide when to spend some extra time on a move; at a fixed time per move, the strength will suffer significantly). The version of Stockfish used is one year old, was playing with far more search threads than has ever received any significant amount of testing, and had way too small hash tables for the number of threads. I believe the percentage of draws would have been much higher in a match with more normal conditions.

On the other hand, there is no doubt that AlphaZero could have played better if more work had been put into the project (although the "4 hours of learning" mentioned in the paper is highly misleading when you take into account the massive hardware resources used during those 4 hours). But in any case, Stockfish vs AlphaZero is very much a comparison of apples to orangutans. One is a conventional chess program running on ordinary computers, the other uses fundamentally different techniques and is running on custom designed hardware that is not available for purchase (and would be way out of the budget of ordinary users if it were).

From another perspective, the apples vs orangutans angle is the most exciting thing about this: We now have two extremely different (both on the hardware and the software side) man-made entities that both display super-human chess playing abilities. That's much more interesting than yet another chess program that does the same thing as existing chess programs, just a little better. Furthermore, the adaptability of the AlphaZero approach to new domains opens exciting possibilities for the future.

For chess players using computer chess programs as a tool, this breakthrough is unlikely to have a great impact, at least in the short term, because of the lack of suitable hardware for affordable prices.

For chess engine programmers -- and for programmers in many other interesting domains -- the emergence of machine learning techniques that require massive hardware resources in order to be effective is a little disheartening. In a few years, it is quite possible that an AlphaZero like chess program can be made to run on ordinary computers, but the hardware resources required to _create_ them will still be way beyond the budget of hobbyists or average sized companies. It is possible that an open source project with a large distributed network of computers run by volunteers could work, but the days of hundreds of unique chess engines, each with their own individual quirks and personalities, will be gone.

This topic has been archived and can no longer be replied to.