From f352a2cdc621b78b2771748c5bc60b212480d08b Mon Sep 17 00:00:00 2001 From: Florian Jung Date: Wed, 3 Feb 2016 18:43:29 +0100 Subject: nn3.log --- doc/doc.tex | 2 ++ 1 file changed, 2 insertions(+) (limited to 'doc/doc.tex') diff --git a/doc/doc.tex b/doc/doc.tex index 71e5e1e..aa85061 100644 --- a/doc/doc.tex +++ b/doc/doc.tex @@ -35,6 +35,8 @@ In the graphs, which show the total reward earned as a function of the episodes Note that the nn and nn2 runs get a reward of only $0.5$ for reaching the goal, while the old and new runs get $10$. Therefore, the nn/nn2 results have been multiplied by $20$ to make them comparable. +Then, there has been done a run with \verb+self.NN.randomize_weights+ in the range of $(-0.27, 0.27)$ instead of $(1,1)$. + \gnuplotloadfile[terminal=pdf]{array_naive_vs_episodic.gnuplot} \gnuplotloadfile[terminal=pdf]{array_naive_vs_episodic_deriv.gnuplot} -- cgit v1.2.1