The Axiom System - Part 4: Justification in Chess

Ok, the last blog came out on 26 May, It's 26 June right now! And it's not like we have gotten anything from the blogs until now first three parts were proving why conventional chess knowledge is wrong, fourth part goes a Little overview on what we are GOING to do(and some training and method-building techniques).
What have we actually gotten from some four parts? Where is our Axiom System, We got your point through the first two blogs. no need to keep proving why your System actually works. I'm starting a new hashtag on this. #WewantSystem

Ok, the last blog came out on 26 May, It's 26 June right now! And it's not like we have gotten anything from the blogs until now first three parts were proving why conventional chess knowledge is wrong, fourth part goes a Little overview on what we are GOING to do(and some training and method-building techniques). What have we actually gotten from some four parts? Where is our Axiom System, We got your point through the first two blogs. no need to keep proving why your System actually works. I'm starting a new hashtag on this. #WewantSystem

dboing

edited

#72

@Wodjul said in #70:

Overlearning

E. effect? see NDpatzer blogs. I forgot the spelling of the E.
Hammer and nails. Hammering training.

There exist a more pragmatic and mathematical language or formalism. MAchine learning is based on that question. But it need 2 direction of errors. in generalization from training to unseen test set of input data. (positions).

So three different source of thinking about this. I only knew the mathemtical one from ML.
The problem has been made reproducible and testable. Not only qualitative.

The "over" problem also comes with the under problem or question. One has the 2 questions to consider, and finding the room in between is the theory of learning research or art creativity of effort.

The problem definitoin, needs to be well formed. And I am sorry to say, but I think SF NNue do not realize it or do not think it important to share as much as the meaningless ELO numbers. If we don't ask.. but I do.
What is taken as known, and then was is used to defined the learning objective, and the dataset sampling construction and relation etc.. I wonder, if having had something always developped at code level, and working well enough, is not its own training that one does not need to know why it works. or what it means about the chess data information flow in the learning problem.

one can get lost is pairs of words that mean the same thing in the end. my point is that there are 2 types of misgeneralization.

sometimes people talk about the NN parameter set obtained after some training. is having a case of underfitting. this would mean that one has use a fucntion space that is not flexible enough to express the phenomenology fujnction complexity (this is not combinatorial complexity, unless one is careful about what is being counted... and it is not about the number of positions.. although that might have some effects in case of mis gen.

the othercase. with big computers and big NN sizes (number of layers or layers numbers of units or both or else).
can be sometimes using the words overfitting to go with the oppsite word in the pair.

Overfitting the data, is actually meaning it is fitting even the quirks of the training data that are possibly measurement error, but in chess we don,t have that. well not when the problem is well stated (and I am still wainting form SF to get its act together, but lazy to find out, waiting for it to come from the grapevine magics. been burned in the past seeking such basic information).

In chess. The generalizatni problem is not from input having some typos. or in SF NN stuff, from the training SF oracle score to be having error, The undefined truth there is that the phenomenology is exactly SF score at the target vector output givens. That the NN will have to fit one the big set of position inputs. I am basing this on SF blog crumbs. They have a wiki now. The exhaustive search part has been well extracted from the encrypted programming language encoding (laughing) back to higher level where chess users can hope to undertand how the search part independent or modular from the leaf evaluation part (the NN being component).
I did not dare to be disappointed again for the other wiki part. And I had bad glimpses that not the whole dev team is eager to cooperatoin with the writer, which did the more ubitquitous exhaustive search engine model presentation, pretty well (but I had already an understanding, about it, from previous rabbit hole, explaining current lazyness. Source code is the worst user manual surrogate. And the readme files. Well, on the other more obscure part of SF that is their current target of improvement I gather, well some orbit repository have kept previous misleading sentences that had rumors persisting about NNue using reinforcement learning, and then using leelas data, in one sibiline sentence in SF16 blog. The two sorry to say sloppy communication from the SF team to the user popuation, in spite of the wiki effort, have made me waste ramblings. something wrong with me.

anyway.. there is a lot of prevoius work on that quesiton, and many tools and we might just be missing cultural curiosity. or knowledge of their existence. We have lots of chess data sitting on servers, some inert less informative versions (liek the puzzle database, missing a lot of the real data). but there is the lichess opening sequences database (not the explorer, which uses that database internally to attribute single name to input porisiont based on some policy of name priority from which branch contains the position, if many opening sequences or sequences of named connecte segments contain the postions. they have a shorter one wins the name, or also a popularity rule. Anyway. The thing is there is plenty of position dataset , some well restricted and many, mostly pre lichess, obscure non-reproducible ones.

So misgneeraliastion in chess is crucial here. About repetition on the same positoin very often if the training set is itself not represenatative of the acutal wilderness probability of ranom encounter (timeout: one might count on swarming convergence of the latest novelty, or that within the unspecified axiom 1 duration of the prolem of improvment from not yet improved to improved some notch, it might be argued that in some tournament event player number or maximal position in vogue might be not as wild as all of chess. The chances that some old historical playable but not hot, positoins that is not a novelty but is not usual encoutner nowadays, might be small.

it depends on how much and which pragmatic level one is thinking. So, I find that we can get lost there

and i prefer a clearer approach. Which need discussion, but which is need some other kind of pragmatism about specifying exaclty what your put in #2. which is only half the story.

Actually in chess, there is also, the notoin of learned pattern taught by examples. becasue training of a constructed or chosen set of many positions (which ones is part of the questions I think should be part of discussion).

The first exposure and the interaction with others using language or pattern defition. One can also be trained with the same set and also have the same kind of gneralization problems or actualy positive effect.

I guess there might be 4 problems in chess theories of learning if one stops being gun ho on one magic bullet theory., and actually consider that the concious and many head cooperative dwarf still surviving in the culture, we could call chess theory rebuilding efforts (another problem is being very shy with being critical AND constructive, also if liking one book, or one authors, not being able to be surgical, but in general, I find the lack ot cooperation and habit of disscussion like here might be to have been retarding chess theory and chess learning theory, for a while. For some reason, I suspect the confusio between performance and learning.

but what do I know. i would like to share mnore. but I think I would need questions. It is hard to know where to start, when all we know of each other is that chess is our common interest. and even that, which chess.

@Wodjul said in #70: > 2. Overlearning E. effect? see NDpatzer blogs. I forgot the spelling of the E. Hammer and nails. Hammering training. There exist a more pragmatic and mathematical language or formalism. MAchine learning is based on that question. But it need 2 direction of errors. in generalization from training to unseen test set of input data. (positions). So three different source of thinking about this. I only knew the mathemtical one from ML. The problem has been made reproducible and testable. Not only qualitative. The "over" problem also comes with the under problem or question. One has the 2 questions to consider, and finding the room in between is the theory of learning research or art creativity of effort. The problem definitoin, needs to be well formed. And I am sorry to say, but I think SF NNue do not realize it or do not think it important to share as much as the meaningless ELO numbers. If we don't ask.. but I do. What is taken as known, and then was is used to defined the learning objective, and the dataset sampling construction and relation etc.. I wonder, if having had something always developped at code level, and working well enough, is not its own training that one does not need to know why it works. or what it means about the chess data information flow in the learning problem. one can get lost is pairs of words that mean the same thing in the end. my point is that there are 2 types of misgeneralization. sometimes people talk about the NN parameter set obtained after some training. is having a case of underfitting. this would mean that one has use a fucntion space that is not flexible enough to express the phenomenology fujnction complexity (this is not combinatorial complexity, unless one is careful about what is being counted... and it is not about the number of positions.. although that might have some effects in case of mis gen. the othercase. with big computers and big NN sizes (number of layers or layers numbers of units or both or else). can be sometimes using the words overfitting to go with the oppsite word in the pair. Overfitting the data, is actually meaning it is fitting even the quirks of the training data that are possibly measurement error, but in chess we don,t have that. well not when the problem is well stated (and I am still wainting form SF to get its act together, but lazy to find out, waiting for it to come from the grapevine magics. been burned in the past seeking such basic information). In chess. The generalizatni problem is not from input having some typos. or in SF NN stuff, from the training SF oracle score to be having error, The undefined truth there is that the phenomenology is exactly SF score at the target vector output givens. That the NN will have to fit one the big set of position inputs. I am basing this on SF blog crumbs. They have a wiki now. The exhaustive search part has been well extracted from the encrypted programming language encoding (laughing) back to higher level where chess users can hope to undertand how the search part independent or modular from the leaf evaluation part (the NN being component). I did not dare to be disappointed again for the other wiki part. And I had bad glimpses that not the whole dev team is eager to cooperatoin with the writer, which did the more ubitquitous exhaustive search engine model presentation, pretty well (but I had already an understanding, about it, from previous rabbit hole, explaining current lazyness. Source code is the worst user manual surrogate. And the readme files. Well, on the other more obscure part of SF that is their current target of improvement I gather, well some orbit repository have kept previous misleading sentences that had rumors persisting about NNue using reinforcement learning, and then using leelas data, in one sibiline sentence in SF16 blog. The two sorry to say sloppy communication from the SF team to the user popuation, in spite of the wiki effort, have made me waste ramblings. something wrong with me. anyway.. there is a lot of prevoius work on that quesiton, and many tools and we might just be missing cultural curiosity. or knowledge of their existence. We have lots of chess data sitting on servers, some inert less informative versions (liek the puzzle database, missing a lot of the real data). but there is the lichess opening sequences database (not the explorer, which uses that database internally to attribute single name to input porisiont based on some policy of name priority from which branch contains the position, if many opening sequences or sequences of named connecte segments contain the postions. they have a shorter one wins the name, or also a popularity rule. Anyway. The thing is there is plenty of position dataset , some well restricted and many, mostly pre lichess, obscure non-reproducible ones. So misgneeraliastion in chess is crucial here. About repetition on the same positoin very often if the training set is itself not represenatative of the acutal wilderness probability of ranom encounter (timeout: one might count on swarming convergence of the latest novelty, or that within the unspecified axiom 1 duration of the prolem of improvment from not yet improved to improved some notch, it might be argued that in some tournament event player number or maximal position in vogue might be not as wild as all of chess. The chances that some old historical playable but not hot, positoins that is not a novelty but is not usual encoutner nowadays, might be small. it depends on how much and which pragmatic level one is thinking. So, I find that we can get lost there and i prefer a clearer approach. Which need discussion, but which is need some other kind of pragmatism about specifying exaclty what your put in #2. which is only half the story. Actually in chess, there is also, the notoin of learned pattern taught by examples. becasue training of a constructed or chosen set of many positions (which ones is part of the questions I think should be part of discussion). The first exposure and the interaction with others using language or pattern defition. One can also be trained with the same set and also have the same kind of gneralization problems or actualy positive effect. I guess there might be 4 problems in chess theories of learning if one stops being gun ho on one magic bullet theory., and actually consider that the concious and many head cooperative dwarf still surviving in the culture, we could call chess theory rebuilding efforts (another problem is being very shy with being critical AND constructive, also if liking one book, or one authors, not being able to be surgical, but in general, I find the lack ot cooperation and habit of disscussion like here might be to have been retarding chess theory and chess learning theory, for a while. For some reason, I suspect the confusio between performance and learning. but what do I know. i would like to share mnore. but I think I would need questions. It is hard to know where to start, when all we know of each other is that chess is our common interest. and even that, which chess.

FM DailyInsanity

#73

@ViAaNjS said in #71:

Ok, the last blog came out on 26 May, It's 26 June right now!

Sorry, ViAaNjS, but - believe it or not - there are other important aspects of life besides writing lichess blog posts. I suppose this message can be helpful for others reading who are awaiting the next part in the series. While I have the ideas and general structure of the next part planned out (as well as for several subsequent parts), refining them into a final post takes considerable time. Also, it takes a lot of time (days, if not weeks) to get in the right headspace needed to think and write well on the topic. I'm currently occupied with other things and so I'm making no promises as to when the next post will be. In the meantime, I appreciate and have enjoyed reading other people's thoughts on the topic.

And it's not like we have gotten anything from the blogs until now first three parts were proving why conventional chess knowledge is wrong, fourth part goes a Little overview on what we are GOING to do(and some training and method-building techniques).

Are you hate reading or something? :) Clearly we have covered a lot in the first 4 parts. If you're waiting to 'get' some fool-proof, step-by-step method that's guaranteed to improve your chess, or some quick improvement solution, then you're going to be disappointed.

@ViAaNjS said in #71: > Ok, the last blog came out on 26 May, It's 26 June right now! Sorry, ViAaNjS, but - believe it or not - there are other important aspects of life besides writing lichess blog posts. I suppose this message can be helpful for others reading who are awaiting the next part in the series. While I have the ideas and general structure of the next part planned out (as well as for several subsequent parts), refining them into a final post takes considerable time. Also, it takes a lot of time (days, if not weeks) to get in the right headspace needed to think and write well on the topic. I'm currently occupied with other things and so I'm making no promises as to when the next post will be. In the meantime, I appreciate and have enjoyed reading other people's thoughts on the topic. >And it's not like we have gotten anything from the blogs until now first three parts were proving why conventional chess knowledge is wrong, fourth part goes a Little overview on what we are GOING to do(and some training and method-building techniques). Are you hate reading or something? :) Clearly we have covered a lot in the first 4 parts. If you're waiting to 'get' some fool-proof, step-by-step method that's guaranteed to improve your chess, or some quick improvement solution, then you're going to be disappointed.

Raagav101 edited

#74

Personally, I am enjoying this series. Take your time. I am sure it will be a delectable end product if you do.

ViAaNjS

#75

Nobody:

Absolutely nobody:

When you have same points as a person but they get first first due to 0.5 tiebreak:

WAHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH

Nobody: Absolutely nobody: When you have same points as a person but they get first first due to 0.5 tiebreak: WAHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH

dboing

edited

#76

And they I would suggest that those done with first pass, of reading blogs and discussions (remember, not only one author can get it right, this approached has been overdone in chess culture, and I think it emerges from the blog precautions and willing ness to exposed own assumptions through an axiomatic exercise is about making more transparent reasoning, because that gives the readers of any knowledge level, the licence to use their own "think", and participate. We owe it to that aspect, if not mistaken, to redo some critical thinking ourselves, not negatively to reject the whole package, but to help spot the blind spots that any one fallible human being is pretty much always doing, in spite of best efforts. The chess titling is not about being infallible in reasoning. It is about chess performance first. For the thoery of learning, it is one data point with the same fading memory problem as in any other human complex thing or activity.. Internalization for performance or going further in general, is part of that fading of the conscious, I would think. Now, coaching, can help build hypotheses of learning theory, if not in hindsight using only evidence of not preventing improvement, and being attributed factor of it. If there were control groups, spending the same amount of time (say the conclusion was about scheduling chess activity), not applying the methods, which one would notice, are all about spending lots of daily time resources on chess activity. The coaching experience might provide for theories of learning hypotheses, and certainly an exposure to diversity of learning initial conditions or possible patterns of transition from not learned to learned-conscious (or not) or at least to learned-internalized. But we would not know until we would have access to the students' ship, characteristics and so forth.

No here the proposal so far seems to be an invitation to bring our own "think" (author can infirm this).

So we should help. By revising and taking a certain dosage of skeptical. Exercise; revise the axioms, and wonder if they could mean something else than what you read the first pass..

You would all give me time to come back to this.. I don,t have much reading stamina for things that can trigger my ought previous ideas about the same topic, and the expectation is of logical nature in formalism, but also what it meant behind the words, and the possiblity of non-intersecting common sense from the author to the reader. (that we might read from our own biases of what is out there).

in conclusion (would the chat bot say), don't give up anyone.

4 parts. as for me. there are things of the blog I need to read carefully still, and all the dicussions. This is a novelty that part of the blog: that it is needing discussion to got further. I would rather have the slow (and sporadic, as author situation might be), pace to read with all my previous ideas to bear on my reading and note-taking (i.e. chewing on it) before the author commits to further work. Although if the author always had some conclusion in mind, maybe like the video that was linked in first blog. Perhaps this discussion space here (is it that of the 4th?), could propose its essential current working theory bullet points. And they I would suggest that those done with first pass, of reading blogs and discussions (remember, not only one author can get it right, this approached has been overdone in chess culture, and I think it emerges from the blog precautions and willing ness to exposed own assumptions through an axiomatic exercise is about making more transparent reasoning, because that gives the readers of any knowledge level, the licence to use their own "think", and participate. We owe it to that aspect, if not mistaken, to redo some critical thinking ourselves, not negatively to reject the whole package, but to help spot the blind spots that any one fallible human being is pretty much always doing, in spite of best efforts. The chess titling is not about being infallible in reasoning. It is about chess performance first. For the thoery of learning, it is one data point with the same fading memory problem as in any other human complex thing or activity.. Internalization for performance or going further in general, is part of that fading of the conscious, I would think. Now, coaching, can help build hypotheses of learning theory, if not in hindsight using only evidence of not preventing improvement, and being attributed factor of it. If there were control groups, spending the same amount of time (say the conclusion was about scheduling chess activity), not applying the methods, which one would notice, are all about spending lots of daily time resources on chess activity. The coaching experience might provide for theories of learning hypotheses, and certainly an exposure to diversity of learning initial conditions or possible patterns of transition from not learned to learned-conscious (or not) or at least to learned-internalized. But we would not know until we would have access to the students' ship, characteristics and so forth. No here the proposal so far seems to be an invitation to bring our own "think" (author can infirm this). So we should help. By revising and taking a certain dosage of skeptical. Exercise; revise the axioms, and wonder if they could mean something else than what you read the first pass.. You would all give me time to come back to this.. I don,t have much reading stamina for things that can trigger my ought previous ideas about the same topic, and the expectation is of logical nature in formalism, but also what it meant behind the words, and the possiblity of non-intersecting common sense from the author to the reader. (that we might read from our own biases of what is out there). in conclusion (would the chat bot say), don't give up anyone.

ViAaNjS

#77

Ending the Awkward silence

ViAaNjS

#78

actually, i'm just thinking(does it take this many days to create a blog post?) ik there are other aspects of life. I'll try and create a blogpost myself, let's see if it does take this much time.

FM DailyInsanity

#79

@ViAaNjS said in #78:

actually, i'm just thinking(does it take this many days to create a blog post?) ik there are other aspects of life. I'll try and create a blogpost myself, let's see if it does take this much time.

Yep, that sounds like a better use of your time than the comments you've been leaving here

@ViAaNjS said in #78: > actually, i'm just thinking(does it take this many days to create a blog post?) ik there are other aspects of life. I'll try and create a blogpost myself, let's see if it does take this much time. Yep, that sounds like a better use of your time than the comments you've been leaving here

ViAaNjS

#80

@DailyInsanity said in #79:

Yep, that sounds like a better use of your time than the comments you've been leaving here
progress on part 5

@DailyInsanity said in #79: > Yep, that sounds like a better use of your time than the comments you've been leaving here progress on part 5