<feed xmlns='http://www.w3.org/2005/Atom'>
<title>maze-rlnn, branch master</title>
<subtitle>Reinforcement learning with a neuronal network as Q-function in a maze (toy example)
</subtitle>
<id>https://windfis.ch/git/maze-rlnn/atom?h=master</id>
<link rel='self' href='https://windfis.ch/git/maze-rlnn/atom?h=master'/>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/'/>
<updated>2016-02-06T16:42:53+00:00</updated>
<entry>
<title>use keras. meh</title>
<updated>2016-02-06T16:42:53+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-02-06T16:42:53+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=03de67dc103c2557265d42f6e5b5d5b94375dd61'/>
<id>urn:sha1:03de67dc103c2557265d42f6e5b5d5b94375dd61</id>
<content type='text'>
</content>
</entry>
<entry>
<title>irgendwas mit plots</title>
<updated>2016-02-05T00:17:03+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-02-05T00:17:03+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=330f513d059bdff3ff9ff96895a434332eda17f3'/>
<id>urn:sha1:330f513d059bdff3ff9ff96895a434332eda17f3</id>
<content type='text'>
</content>
</entry>
<entry>
<title>nn3.log</title>
<updated>2016-02-03T17:43:29+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-02-03T17:43:29+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=f352a2cdc621b78b2771748c5bc60b212480d08b'/>
<id>urn:sha1:f352a2cdc621b78b2771748c5bc60b212480d08b</id>
<content type='text'>
</content>
</entry>
<entry>
<title>proper frameskip for NN</title>
<updated>2016-02-03T17:09:19+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-02-03T17:09:19+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=f0ebbd21a6ab27e4b97d03a19e06b01eb9a2114a'/>
<id>urn:sha1:f0ebbd21a6ab27e4b97d03a19e06b01eb9a2114a</id>
<content type='text'>
</content>
</entry>
<entry>
<title>dirty hack: do not ever quit early</title>
<updated>2016-01-08T20:01:22+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-01-08T20:01:22+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=d8b98f347d0858128074054dea583772692365d0'/>
<id>urn:sha1:d8b98f347d0858128074054dea583772692365d0</id>
<content type='text'>
</content>
</entry>
<entry>
<title>graphs</title>
<updated>2016-01-08T20:00:41+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-01-08T20:00:41+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=92f25c46b2d3e7e22b62faa4ca450b29757e977a'/>
<id>urn:sha1:92f25c46b2d3e7e22b62faa4ca450b29757e977a</id>
<content type='text'>
</content>
</entry>
<entry>
<title>more doc</title>
<updated>2016-01-08T17:09:36+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-01-08T17:09:36+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=c77a636c82fcf5211beadab90371719f175ef954'/>
<id>urn:sha1:c77a636c82fcf5211beadab90371719f175ef954</id>
<content type='text'>
</content>
</entry>
<entry>
<title>fixed neuronal network approach:</title>
<updated>2016-01-08T17:07:19+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-01-08T17:07:19+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=2483f393d9a3740b35606ca6acb6cb2df8ffdcd2'/>
<id>urn:sha1:2483f393d9a3740b35606ca6acb6cb2df8ffdcd2</id>
<content type='text'>
- make input values fit into 0..1 range (not 0..5/0..7)
- reward of 0.5 instead of 10
- randomize initial weights much more to avoid degeneration
</content>
</entry>
<entry>
<title>display range fix</title>
<updated>2016-01-08T17:07:01+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-01-08T17:07:01+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=1d8d0cc92e55b8bd51afbbdab14dade5c646e39d'/>
<id>urn:sha1:1d8d0cc92e55b8bd51afbbdab14dade5c646e39d</id>
<content type='text'>
</content>
</entry>
<entry>
<title>--sleep flag</title>
<updated>2016-01-08T17:06:40+00:00</updated>
<author>
<name>Florian Jung</name>
<email>flo@windfisch.org</email>
</author>
<published>2016-01-08T17:06:40+00:00</published>
<link rel='alternate' type='text/html' href='https://windfis.ch/git/maze-rlnn/commit/?id=63a2121abe15a5d7e08d7e7a8f92fdbfbe3b5987'/>
<id>urn:sha1:63a2121abe15a5d7e08d7e7a8f92fdbfbe3b5987</id>
<content type='text'>
</content>
</entry>
</feed>
