Serialization with RL4j

marcus.frex · June 14, 2022, 10:42am

Hello everyone,

I am working with NStepQLearning model but it seems just saving ComputationGraph it is using is not enough because when i restored the model it does not give the same results given before.

Do you guys have any idea how? Or am I missing something?

agibsonccc · June 22, 2022, 12:39pm

@marcus.frex could you clarify your issue a bit? Also please note that rl4j was moved to a contrib module recently (1.0.0-M1.1 and newer) in new releases since it’s not heavily maintained.

Generally saving the weights should be enough. It’s hard to tell without knowing more info though.

marcus.frex · June 22, 2022, 2:33pm

@agibsonccc Probably I am missing something but let’s say a DQNPolicy trained with a specific QLearning configuration does not give the same score after I save it and load after a specific Epoch.

I noticed that even I use same DQNPolicy and run through the same MDP (Environment) separately it still does not return the same score. I event set target update frequency to 1 but it still does not returns the same score. I event saved MultiLayerNetwork manually but it does not gets the same score after I run again the same dataset.

Have you ever experienced something like that? What do you think am I missing?

marcus.frex · June 23, 2022, 7:40am

Ok, I found what am I missing. It seems that on training course it can create random actions to improve NNs abilitiy. All epoc results should get compared with actual data scores too.

Topic		Replies	Views
Where are the RL4J examples? RL4J	4	1140	January 5, 2023
Trying to use QLearning in a custom MDP environment. Chooses action 0 every time, despite the heavy negative reward RL4J	22	1522	May 6, 2021
RL4L Documentation Request RL4J	0	512	April 15, 2021
Migrating RL4J to DL4J Contrib RL4J	2	748	March 3, 2022
Is there a way to load a QLearningDiscreteDense Object after saving it? RL4J	0	212	October 3, 2023

Serialization with RL4j

Related topics