Can be solved with Python 2.7 or Mathematica (10.2 or higher)
In this project you'll have to set up an environment (maze with walls 5x5), a robot and simulator functions. The robot has a fixed starting point and chooses its actions (right, left, up, down with certain probabilities) randomly. In the maze itself, some rewards are placed. In a second part, you'll have to implement a Q-learning algorithm and later experiment with different values of Q, alpha and epsilon.
Please find the whole exercise attached.
We'll need all codes of the solution.
9 pekerja bebas membida secara purata €137 untuk pekerjaan ini
Hi, My last project here is very related to this one. Basically I'm an electronics engineer. I did the same thing with python. Come to chat for more discussion. Thank you