Different Neural Network Algorithm #6

anubisthejackle · 2015-04-06T13:26:57Z

It's likely that a convolution network isn't sufficient for this type of problem. I'm beginning to think that a better solution would be a Long Short-Term Network. This means using a different neural network platform than ConvNetJS. Synaptic is a quality system that's architecture free, and just so happens to have examples of how to setup a LST Net on their README.md

davidak · 2015-11-28T22:53:07Z

like you see in the chart i contributed there is only a small positive trend.

StateManager.scores.length: 636 (number of games)

experience replay size: 100002
exploration epsilon: 0.01
age: 100004
average Q-learning loss: 0.39981905616281765
smooth-ish reward: 0.5372073053150149

anubisthejackle · 2015-11-29T13:45:26Z

I've noticed the same results. I'm largely convinced that the backing network architecture needs to be changed. I've been working on cleaning up the code to make that an easier process.

davidak · 2015-11-30T19:04:25Z

i started another run with 3075 games, same result.

experience replay size: 524667
exploration epsilon: 0.01
age: 524669
average Q-learning loss: 0.3571297194077231
smooth-ish reward: 0.5395184739721711

anubisthejackle · 2015-11-30T19:08:29Z

I'm thinking that switching to a Long Short-Term Memory network will improve the training speed. The convolution network seems to plateau really quickly. I've let this run overnight, and all day long, for multiple days, and I've never gotten 2048.

anubisthejackle added the enhancement label Apr 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different Neural Network Algorithm #6

Different Neural Network Algorithm #6

anubisthejackle commented Apr 6, 2015

davidak commented Nov 28, 2015

anubisthejackle commented Nov 29, 2015

davidak commented Nov 30, 2015

anubisthejackle commented Nov 30, 2015

Different Neural Network Algorithm #6

Different Neural Network Algorithm #6

Comments

anubisthejackle commented Apr 6, 2015

davidak commented Nov 28, 2015

anubisthejackle commented Nov 29, 2015

davidak commented Nov 30, 2015

anubisthejackle commented Nov 30, 2015