Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different Neural Network Algorithm #6

Open
anubisthejackle opened this issue Apr 6, 2015 · 4 comments
Open

Different Neural Network Algorithm #6

anubisthejackle opened this issue Apr 6, 2015 · 4 comments

Comments

@anubisthejackle
Copy link
Owner

It's likely that a convolution network isn't sufficient for this type of problem. I'm beginning to think that a better solution would be a Long Short-Term Network. This means using a different neural network platform than ConvNetJS. Synaptic is a quality system that's architecture free, and just so happens to have examples of how to setup a LST Net on their README.md

@davidak
Copy link
Contributor

davidak commented Nov 28, 2015

like you see in the chart i contributed there is only a small positive trend.

bildschirmfoto 2015-11-28 um 23 48 34

StateManager.scores.length: 636 (number of games)

experience replay size: 100002
exploration epsilon: 0.01
age: 100004
average Q-learning loss: 0.39981905616281765
smooth-ish reward: 0.5372073053150149

@anubisthejackle
Copy link
Owner Author

I've noticed the same results. I'm largely convinced that the backing network architecture needs to be changed. I've been working on cleaning up the code to make that an easier process.

@davidak
Copy link
Contributor

davidak commented Nov 30, 2015

i started another run with 3075 games, same result.

bildschirmfoto 2015-11-30 um 19 55 46

experience replay size: 524667
exploration epsilon: 0.01
age: 524669
average Q-learning loss: 0.3571297194077231
smooth-ish reward: 0.5395184739721711

@anubisthejackle
Copy link
Owner Author

I'm thinking that switching to a Long Short-Term Memory network will improve the training speed. The convolution network seems to plateau really quickly. I've let this run overnight, and all day long, for multiple days, and I've never gotten 2048.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants