Yes, too much compression. It was not trolling as perhaps tongue in cheek. Going through the hypothesis and its consequences without spoiling it at the beginning that the conclusion was losing fine grain of chess.
I still have no clue about the bits and entropy (information theory, not physics) in terms of bits.. maybe i should find out. The information definition there. is it move chess or position chess, that is informative. And then sequential surprise or positional "entropy" or "information". looking again...
So this is about best move distributions. That kind of information. Trained on many games of what nature? I will guess high level games, those would reduce the uniform policy to high-level policies (very peaked).
Well, that is definitely imitation learning. Not enough patzer games in there. No logic of the board in there.
All the suboptimal hidden chess, is missing. That is what is not played in those games. But that is what needs to be learned. High-level chess is lots of pruning over the years.
And yet, that is how the tradition goes about learning chess from high level best chess. (well, a good part of it, as a caricature). But somehow here it is buried in bits and entropy notions.
Am I completely off base here?
I still have no clue about the bits and entropy (information theory, not physics) in terms of bits.. maybe i should find out. The information definition there. is it move chess or position chess, that is informative. And then sequential surprise or positional "entropy" or "information". looking again...
So this is about best move distributions. That kind of information. Trained on many games of what nature? I will guess high level games, those would reduce the uniform policy to high-level policies (very peaked).
Well, that is definitely imitation learning. Not enough patzer games in there. No logic of the board in there.
All the suboptimal hidden chess, is missing. That is what is not played in those games. But that is what needs to be learned. High-level chess is lots of pruning over the years.
And yet, that is how the tradition goes about learning chess from high level best chess. (well, a good part of it, as a caricature). But somehow here it is buried in bits and entropy notions.
Am I completely off base here?