I’ve developed a Java+Swing+Awt code for GridWorld (cfr. Barto-Sutton 1998). It implements the Value Iteration Reinforcement Learning algorithm. Enjoy!

.java .jar