Police-based algorithm

About RL4j. To my knowledge, RL4j now support value-based rl algorithms like DQN(Double-DQN), any plan to develop policy-based algorithm ?