Get Started with Jupyter NotebookΒΆ
In this tutorial, we will use Google Colaboratory to show you the most basic usages of common building blocks in Tianshou. You will be guided step by step to see how different modules in Tianshou collaborate with each other to conduct a classic DRL experiment (PPO algorithm for CartPole-v0 environment).
L0: Overview
L1: Batch
L2: Replay Buffer
L4: Policy
L5: Collector
L6: Trainer
L7: Experiment