Reinforcement Learning_Code_Blackjack_Monte Carlo Learning
2023-03-25 17:59:30 来源:哔哩哔哩
(资料图片)
Blackjack.py
Visualization of reward and policy are are respectively shown below.
The above codes are based on Gymnasium Documentation's tutorial "Solving Blackjack with Q-Learning", but solving Backjack with Monte Carlo learning.
[1] https://gymnasium.farama.org/tutorials/training_agents/blackjack_tutorial/
关键词: