Auto fill history planes #452

Dorus · 2018-04-27T22:54:28Z

Instead of going drastic like in #443

It would be cool if somebody added the ability to leela to produce some random history if it is missing. As long as this history includes a capture or pawn move as the last move, it should not affect 3 fold rule.

It might not be possible to do this for all positions, but for any legal position it shouldn't be too hard. The moves do not need to be realistic as long as they are legal right?

killerducky · 2018-05-17T15:27:04Z

There are several graphs made by Trevor in this thread:
https://groups.google.com/forum/#!topic/lczero/PNHkPgV3bCM

They show auto filling with the same position is an improvement over auto filling with zeros. And this is much easier than trying to guess a real history.

mooskagh · 2018-05-18T21:38:20Z

I think this is worth implementing/experimenting when things stabilize (they kind of started to stabilize now, but there is already switching on resignation and switch to lc0 in a queue).
Also instead of doing this, there is also a possibility to drop history completely.

Dorus · 2018-05-19T14:09:48Z

Auto filling with the same position seems like a good first step. Easy to implement and seems to work very well too.

I would propose to add that first, and then benchmark auto-fill vs same-fill, if results are comparable, same-fill should have the preference.

so-much-meta · 2018-05-19T18:40:50Z

I created a PR with some additional details here: #632
And more details about the win-at-chess evaluations here: https://groups.google.com/forum/#!topic/lczero/egceNdtPE1I

While I completely agree that this is less important than waiting for things to stabilize, I also think it's very unfortunate that people keep testing and forming opinions of Leela's strength based on cold-start positions. This is an easy improvement for that. Although, as noted in the PR, if the start position isn't handled as a special case, this will cause Leela to operate slightly different in self-play training data generation and match play at the opening (it looks like copying the history has a slightly flattening effect to the policy near the start position) -- but I don't think that effect will remain significant for long.

gyathaar · 2018-05-19T19:14:09Z

I created a similar PR for for copying last position into history for lc0

#633

so-much-meta · 2018-05-19T19:32:50Z

Regarding dropping history completely... I've seen evidence that Leela's network cares a lot about the previous position, but not much about the others (2 through 7, where 0 is current).

First point of evidence. Input weights to previous position are big, but all others before that are small.
input_weights.gif
[note: red=mean weights to input channel, transformed to (mean*10 - 0.025); blue=std dev of weights to input channel]

Second point of evidence... There just isn't very that much of a difference in the second node evaluation when history 2 through 7 is copied vs left all zero, as seen in the NextMove value comparisons here:
win-at-chess--history-compare.pdf

However - copying is still better than leaving zero for those positions.

The fact that Leela's network seems to care a lot about the previous position and only the previous position suggests that it is in fact using that feature, and that removing history completely will have a detrimental effect.

Furthermore, removing history completely would prevent the network from seeing en passant moves. So I'd think that at a minimum, an en passant plane would have to be added, which would be fairly disruptive.

Dorus · 2018-05-19T23:32:26Z

Beside en passant, you probably also need another plane that indicate if a move is going to be a repetition. You might get away with ignoring 2 folds, but 3 folds are important, else the network completely depend on search for that. Not providing 2 or 3 folds might result in missed draws.

Disruptive changes like that will need a complete new bootstrap of the NN. I'm glad only the first history plane is really used, that indeed indicates it's not making any stupid use of the history plane (like assuming a piece is still on a position because it was there before).

killerducky mentioned this issue May 20, 2018

Leela misses "obvious" en passant captures #634

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto fill history planes #452

Auto fill history planes #452

Dorus commented Apr 27, 2018

killerducky commented May 17, 2018

mooskagh commented May 18, 2018

Dorus commented May 19, 2018

so-much-meta commented May 19, 2018

gyathaar commented May 19, 2018

so-much-meta commented May 19, 2018 •

edited

Loading

Dorus commented May 19, 2018

Auto fill history planes #452

Auto fill history planes #452

Comments

Dorus commented Apr 27, 2018

killerducky commented May 17, 2018

mooskagh commented May 18, 2018

Dorus commented May 19, 2018

so-much-meta commented May 19, 2018

gyathaar commented May 19, 2018

so-much-meta commented May 19, 2018 • edited Loading

Dorus commented May 19, 2018

so-much-meta commented May 19, 2018 •

edited

Loading