Unsupervised reinforcement learning method and apparatus based on Wasserstein distance

X Ji, S He, Y Jiang - US Patent 11,823,062, 2023‏ - Google Patents
this class of sequential decision problems it is set that an agent needs to perceive
information from an environment (eg, visual information obtained by a vision sensor of an …