Reinforcement Learning_Code_Blackjack_Monte Carlo Learning

2023-03-25 17:59:30     来源:哔哩哔哩


(资料图片)

Blackjack.py

Visualization of reward and policy are are respectively shown below.

The above codes are based on Gymnasium Documentation's tutorial "Solving Blackjack with Q-Learning", but solving Backjack with Monte Carlo learning. 

[1] https://gymnasium.farama.org/tutorials/training_agents/blackjack_tutorial/

关键词:

明星

电影