I have found the RL4J examples, but would like more explanation about the toy game. I am creating my own MDP environment (not using a predefined gym game or MALMO). Is there documentation that explains the methods and setup of a RL environment. An example would be an explanation of all the properties in QLearning.QLConfiguration
Thank you,
Sean