well i wonder, if the outcome from a position where a single root search by engine is already having an advantage per its scoring, is not just going to implement the search tree in self play, and each time it update, being of equal strengths on both sides, assuming, is bound to fructify that advantage.. even if the position early score was not accurate chess evaluation of the outcome odds under perfect chess.
If the engine is biased against an opening, even a slight bit, as some position. I would expect self-play to keep amplifying it.
unless that engine score difference is within some margin of error of the engine scoring function (which we don't have a clue about).
Although, maybe doing repeated engine experiment over many different human trusted openings (to be drawish), might give us some clue about the precision..
from the lazy eye of statistics we can forget** that the FEN is a strong factor, and act as if experiments with different FENs was the same event under which to do frequency statistics.
in that context (perhaps needed if self-play does not have more diversity of outcome) we coud figure out what score difference (forget the human error bins. or glyph advantage category, ,keep quantitative at first, engines do it, why not us).
makes for a sure outcome in its direction.. which would mean 100% odds.
Maybe get a sense of variability of outcome for same FEN under non-interfered self-play (with replicates), would help decide. If one FEN is enough. then we could include FENs in the statistical factors...
Recap: hypothesis: should not an engine self play out of a position where that same engine in single root search finds some advantage, result, during self-play continuation from there, at least keep that advance... all side being equal strenght. And given that chess has an attrition bias (eventually either terminal outcome or material attrition, maybe even under random play), isn't an engine which already in its root legal tree search saw an advantage at some deep leaf, going to keep fructifying that advantage.
now how to refute that with experiements. or more likely how to find the conditions under which that is true.
Can we construct the conversion curves that lichess uses for accuracy for engine self play (and are supporte by its own database about relation between engine score and outcome odds, conditional to pair rating average).. see the FAQ on accuracy (i also dug out a maia paper figure where the basis for that FAQ might be).
That is what the previous post raises as background questions to figure out in paralell or to give it some support. one might need to use the starting position depth also as a potential factor.. So starting position depth, position FEN itself, engine single root search at that position, engine parameters, and replicated experiments to terminal outcome.. This could be feasible and might allow to put some quantitative in the above post context.
** This may seem unorthodox, as often statistics don,t spell out factors, and then ask you to forget one of them... But philosophically, classical statistics have always been doing that.. With the "let data speak" motto, where one should not even think about what the statistical model should be having as internal structure, as it could inject some bias into the data analysis, they would actually inject the notion that all non considered factors were equally small factors, and that using a normal distribution would be the occam's razor first set of data expereiment and analysis. And then learn from outliers... or normal test failing.. if the case.. Everything can be assumed "random" at first... if you don't want to look inside (might be a conclusion... ).
well i wonder, if the outcome from a position where a single root search by engine is already having an advantage per its scoring, is not just going to implement the search tree in self play, and each time it update, being of equal strengths on both sides, assuming, is bound to fructify that advantage.. even if the position early score was not accurate chess evaluation of the outcome odds under perfect chess.
If the engine is biased against an opening, even a slight bit, as some position. I would expect self-play to keep amplifying it.
unless that engine score difference is within some margin of error of the engine scoring function (which we don't have a clue about).
Although, maybe doing repeated engine experiment over many different human trusted openings (to be drawish), might give us some clue about the precision..
from the lazy eye of statistics we can forget** that the FEN is a strong factor, and act as if experiments with different FENs was the same event under which to do frequency statistics.
in that context (perhaps needed if self-play does not have more diversity of outcome) we coud figure out what score difference (forget the human error bins. or glyph advantage category, ,keep quantitative at first, engines do it, why not us).
makes for a sure outcome in its direction.. which would mean 100% odds.
Maybe get a sense of variability of outcome for same FEN under non-interfered self-play (with replicates), would help decide. If one FEN is enough. then we could include FENs in the statistical factors...
Recap: hypothesis: should not an engine self play out of a position where that same engine in single root search finds some advantage, result, during self-play continuation from there, at least keep that advance... all side being equal strenght. And given that chess has an attrition bias (eventually either terminal outcome or material attrition, maybe even under random play), isn't an engine which already in its root legal tree search saw an advantage at some deep leaf, going to keep fructifying that advantage.
now how to refute that with experiements. or more likely how to find the conditions under which that is true.
Can we construct the conversion curves that lichess uses for accuracy for engine self play (and are supporte by its own database about relation between engine score and outcome odds, conditional to pair rating average).. see the FAQ on accuracy (i also dug out a maia paper figure where the basis for that FAQ might be).
That is what the previous post raises as background questions to figure out in paralell or to give it some support. one might need to use the starting position depth also as a potential factor.. So starting position depth, position FEN itself, engine single root search at that position, engine parameters, and replicated experiments to terminal outcome.. This could be feasible and might allow to put some quantitative in the above post context.
** This may seem unorthodox, as often statistics don,t spell out factors, and then ask you to forget one of them... But philosophically, classical statistics have always been doing that.. With the "let data speak" motto, where one should not even think about what the statistical model should be having as internal structure, as it could inject some bias into the data analysis, they would actually inject the notion that all non considered factors were equally small factors, and that using a normal distribution would be the occam's razor first set of data expereiment and analysis. And then learn from outliers... or normal test failing.. if the case.. Everything can be assumed "random" at first... if you don't want to look inside (might be a conclusion... ).