Tianshou
stable
Tutorials
Get Started with Jupyter Notebook
Deep Q Network
Basic concepts in Tianshou
Understand Batch
Multi-Agent RL
Logging Experiments
Benchmark
Cheat Sheet
API Docs
tianshou.data
tianshou.env
tianshou.policy
tianshou.trainer
tianshou.exploration
tianshou.utils
Community
Contributing to Tianshou
Contributor
Tianshou
Docs
»
Index
Edit on GitHub
Index
_
|
A
|
B
|
C
|
D
|
E
|
F
|
G
|
H
|
I
|
L
|
M
|
N
|
O
|
P
|
Q
|
R
|
S
|
T
|
U
|
V
|
W
_
__call__() (tianshou.exploration.BaseNoise method)
(tianshou.exploration.GaussianNoise method)
(tianshou.exploration.OUNoise method)
__getitem__() (tianshou.data.Batch method)
(tianshou.data.PrioritizedReplayBuffer method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.SegmentTree method)
__len__() (tianshou.data.Batch method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
(tianshou.data.SegmentTree method)
(tianshou.env.BaseVectorEnv method)
(tianshou.env.VectorEnvWrapper method)
__setitem__() (tianshou.data.Batch method)
(tianshou.data.SegmentTree method)
A
A2CPolicy (class in tianshou.policy)
action() (tianshou.env.ContinuousToDiscrete method)
action_spaces (tianshou.env.PettingZooEnv attribute)
Actor (class in tianshou.utils.net.continuous)
(class in tianshou.utils.net.discrete)
actor (tianshou.policy.CQLPolicy attribute)
(tianshou.policy.DiscreteSACPolicy attribute)
(tianshou.policy.REDQPolicy attribute)
(tianshou.policy.SACPolicy attribute)
(tianshou.policy.TD3BCPolicy attribute)
(tianshou.policy.TD3Policy attribute)
actor_optim (tianshou.policy.CQLPolicy attribute)
(tianshou.policy.DiscreteSACPolicy attribute)
(tianshou.policy.REDQPolicy attribute)
(tianshou.policy.SACPolicy attribute)
(tianshou.policy.TD3BCPolicy attribute)
(tianshou.policy.TD3Policy attribute)
actor_pred() (tianshou.policy.CQLPolicy method)
ActorCritic (class in tianshou.utils.net.common)
ActorProb (class in tianshou.utils.net.continuous)
add() (tianshou.data.CachedReplayBuffer method)
(tianshou.data.HERReplayBuffer method)
(tianshou.data.HERReplayBufferManager method)
(tianshou.data.PrioritizedReplayBuffer method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
(tianshou.utils.MovAvg method)
agent_selection (tianshou.env.PettingZooEnv attribute)
agents (tianshou.env.PettingZooEnv attribute)
AsyncCollector (class in tianshou.data)
B
BaseLogger (class in tianshou.utils)
BaseNoise (class in tianshou.exploration)
BasePolicy (class in tianshou.policy)
BaseVectorEnv (class in tianshou.env)
BasicLogger (class in tianshou.utils)
Batch (class in tianshou.data)
BCQPolicy (class in tianshou.policy)
BranchingDQNPolicy (class in tianshou.policy)
BranchingNet (class in tianshou.utils.net.common)
C
C51Policy (class in tianshou.policy)
CachedReplayBuffer (class in tianshou.data)
calc_actor_loss() (tianshou.policy.CQLPolicy method)
calc_pi_values() (tianshou.policy.CQLPolicy method)
calc_random_values() (tianshou.policy.CQLPolicy method)
cat() (tianshou.data.Batch static method)
cat_() (tianshou.data.Batch method)
close() (tianshou.env.BaseVectorEnv method)
(tianshou.env.PettingZooEnv method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.EnvWorker method)
close_env() (tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
collect() (tianshou.data.AsyncCollector method)
(tianshou.data.Collector method)
Collector (class in tianshou.data)
compute_episodic_return() (tianshou.policy.BasePolicy static method)
compute_nstep_return() (tianshou.policy.BasePolicy static method)
compute_q_value() (tianshou.policy.C51Policy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.QRDQNPolicy method)
ContinuousToDiscrete (class in tianshou.env)
CosineEmbeddingNetwork (class in tianshou.utils.net.discrete)
CQLPolicy (class in tianshou.policy)
Critic (class in tianshou.utils.net.continuous)
(class in tianshou.utils.net.discrete)
critic (tianshou.policy.CQLPolicy attribute)
(tianshou.policy.DiscreteSACPolicy attribute)
(tianshou.policy.REDQPolicy attribute)
(tianshou.policy.SACPolicy attribute)
(tianshou.policy.TD3BCPolicy attribute)
(tianshou.policy.TD3Policy attribute)
critic_optim (tianshou.policy.CQLPolicy attribute)
(tianshou.policy.DiscreteSACPolicy attribute)
(tianshou.policy.REDQPolicy attribute)
(tianshou.policy.SACPolicy attribute)
(tianshou.policy.TD3BCPolicy attribute)
(tianshou.policy.TD3Policy attribute)
D
DataParallelNet (class in tianshou.utils.net.common)
DDPGPolicy (class in tianshou.policy)
decode() (tianshou.utils.net.continuous.VAE method)
deprecation() (in module tianshou.utils)
disc() (tianshou.policy.GAILPolicy method)
DiscreteBCQPolicy (class in tianshou.policy)
DiscreteCQLPolicy (class in tianshou.policy)
DiscreteCRRPolicy (class in tianshou.policy)
DiscreteSACPolicy (class in tianshou.policy)
DQNPolicy (class in tianshou.policy)
DummyEnvWorker (class in tianshou.env.worker)
DummyTqdm (class in tianshou.utils)
DummyVectorEnv (class in tianshou.env)
E
empty() (tianshou.data.Batch static method)
empty_() (tianshou.data.Batch method)
EnsembleLinear (class in tianshou.utils.net.common)
EnvWorker (class in tianshou.env.worker)
exploration_noise() (tianshou.policy.BasePolicy method)
(tianshou.policy.BranchingDQNPolicy method)
(tianshou.policy.DDPGPolicy method)
(tianshou.policy.DiscreteSACPolicy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.ICMPolicy method)
(tianshou.policy.MultiAgentPolicyManager method)
F
f() (tianshou.utils.net.discrete.NoisyLinear method)
forward() (tianshou.policy.BasePolicy method)
(tianshou.policy.BCQPolicy method)
(tianshou.policy.BranchingDQNPolicy method)
(tianshou.policy.DDPGPolicy method)
(tianshou.policy.DiscreteBCQPolicy method)
(tianshou.policy.DiscreteSACPolicy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.FQFPolicy method)
(tianshou.policy.ICMPolicy method)
(tianshou.policy.ImitationPolicy method)
(tianshou.policy.IQNPolicy method)
(tianshou.policy.MultiAgentPolicyManager method)
(tianshou.policy.PGPolicy method)
(tianshou.policy.PSRLPolicy method)
(tianshou.policy.RandomPolicy method)
(tianshou.policy.REDQPolicy method)
(tianshou.policy.SACPolicy method)
(tianshou.utils.net.common.BranchingNet method)
(tianshou.utils.net.common.DataParallelNet method)
(tianshou.utils.net.common.EnsembleLinear method)
(tianshou.utils.net.common.MLP method)
(tianshou.utils.net.common.Net method)
(tianshou.utils.net.common.Recurrent method)
(tianshou.utils.net.continuous.Actor method)
(tianshou.utils.net.continuous.ActorProb method)
(tianshou.utils.net.continuous.Critic method)
(tianshou.utils.net.continuous.Perturbation method)
(tianshou.utils.net.continuous.RecurrentActorProb method)
(tianshou.utils.net.continuous.RecurrentCritic method)
(tianshou.utils.net.continuous.VAE method)
(tianshou.utils.net.discrete.Actor method)
(tianshou.utils.net.discrete.CosineEmbeddingNetwork method)
(tianshou.utils.net.discrete.Critic method)
(tianshou.utils.net.discrete.FractionProposalNetwork method)
(tianshou.utils.net.discrete.FullQuantileFunction method)
(tianshou.utils.net.discrete.ImplicitQuantileNetwork method)
(tianshou.utils.net.discrete.IntrinsicCuriosityModule method)
(tianshou.utils.net.discrete.NoisyLinear method)
FQFPolicy (class in tianshou.policy)
FractionProposalNetwork (class in tianshou.utils.net.discrete)
from_data() (tianshou.data.ReplayBuffer class method)
FullQuantileFunction (class in tianshou.utils.net.discrete)
G
GAILPolicy (class in tianshou.policy)
gather_info() (in module tianshou.trainer)
GaussianNoise (class in tianshou.exploration)
get() (tianshou.data.ReplayBuffer method)
(tianshou.utils.MovAvg method)
get_dict_state_decorator() (in module tianshou.utils.net.common)
get_env_attr() (tianshou.env.BaseVectorEnv method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
get_obs_rms() (tianshou.env.VectorEnvNormObs method)
get_prefix_sum_idx() (tianshou.data.SegmentTree method)
get_weight() (tianshou.data.PrioritizedReplayBuffer method)
H
HERReplayBuffer (class in tianshou.data)
HERReplayBufferManager (class in tianshou.data)
HERVectorReplayBuffer (class in tianshou.data)
I
ICMPolicy (class in tianshou.policy)
ImitationPolicy (class in tianshou.policy)
ImplicitQuantileNetwork (class in tianshou.utils.net.discrete)
infos (tianshou.env.PettingZooEnv attribute)
init_weight() (tianshou.data.PrioritizedReplayBuffer method)
IntrinsicCuriosityModule (class in tianshou.utils.net.discrete)
IQNPolicy (class in tianshou.policy)
is_empty() (tianshou.data.Batch method)
L
LazyLogger (class in tianshou.utils)
learn() (tianshou.policy.A2CPolicy method)
(tianshou.policy.BasePolicy method)
(tianshou.policy.BCQPolicy method)
(tianshou.policy.BranchingDQNPolicy method)
(tianshou.policy.C51Policy method)
(tianshou.policy.CQLPolicy method)
(tianshou.policy.DDPGPolicy method)
(tianshou.policy.DiscreteBCQPolicy method)
(tianshou.policy.DiscreteCQLPolicy method)
(tianshou.policy.DiscreteCRRPolicy method)
(tianshou.policy.DiscreteSACPolicy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.FQFPolicy method)
(tianshou.policy.GAILPolicy method)
(tianshou.policy.ICMPolicy method)
(tianshou.policy.ImitationPolicy method)
(tianshou.policy.IQNPolicy method)
(tianshou.policy.MultiAgentPolicyManager method)
(tianshou.policy.NPGPolicy method)
(tianshou.policy.PGPolicy method)
(tianshou.policy.PPOPolicy method)
(tianshou.policy.PSRLPolicy method)
(tianshou.policy.QRDQNPolicy method)
(tianshou.policy.RainbowPolicy method)
(tianshou.policy.RandomPolicy method)
(tianshou.policy.REDQPolicy method)
(tianshou.policy.SACPolicy method)
(tianshou.policy.TD3BCPolicy method)
(tianshou.policy.TD3Policy method)
(tianshou.policy.TRPOPolicy method)
load() (tianshou.utils.WandbLogger method)
load_hdf5() (tianshou.data.ReplayBuffer class method)
load_state_dict() (tianshou.utils.MultipleLRSchedulers method)
log_test_data() (tianshou.utils.BaseLogger method)
log_train_data() (tianshou.utils.BaseLogger method)
log_update_data() (tianshou.utils.BaseLogger method)
M
map_action() (tianshou.policy.BasePolicy method)
map_action_inverse() (tianshou.policy.BasePolicy method)
mean() (tianshou.utils.MovAvg method)
metadata (tianshou.env.PettingZooEnv attribute)
miniblock() (in module tianshou.utils.net.common)
MLP (class in tianshou.utils.net.common)
module
tianshou.exploration
tianshou.utils
tianshou.utils.net.common
tianshou.utils.net.continuous
tianshou.utils.net.discrete
MovAvg (class in tianshou.utils)
MultiAgentPolicyManager (class in tianshou.policy)
MultipleLRSchedulers (class in tianshou.utils)
N
Net (class in tianshou.utils.net.common)
next() (tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
NoisyLinear (class in tianshou.utils.net.discrete)
norm() (tianshou.utils.RunningMeanStd method)
NPGPolicy (class in tianshou.policy)
O
observation_spaces (tianshou.env.PettingZooEnv attribute)
offline_trainer() (in module tianshou.trainer)
offline_trainer_iter (in module tianshou.trainer)
OfflineTrainer (class in tianshou.trainer)
offpolicy_trainer() (in module tianshou.trainer)
offpolicy_trainer_iter (in module tianshou.trainer)
OffpolicyTrainer (class in tianshou.trainer)
onpolicy_trainer() (in module tianshou.trainer)
onpolicy_trainer_iter (in module tianshou.trainer)
OnpolicyTrainer (class in tianshou.trainer)
OUNoise (class in tianshou.exploration)
P
Perturbation (class in tianshou.utils.net.continuous)
PettingZooEnv (class in tianshou.env)
PGPolicy (class in tianshou.policy)
policy_update_fn() (tianshou.trainer.OfflineTrainer method)
(tianshou.trainer.OffpolicyTrainer method)
(tianshou.trainer.OnpolicyTrainer method)
possible_agents (tianshou.env.PettingZooEnv attribute)
post_process_fn() (tianshou.policy.BasePolicy method)
(tianshou.policy.ICMPolicy method)
PPOPolicy (class in tianshou.policy)
prev() (tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
PrioritizedReplayBuffer (class in tianshou.data)
PrioritizedReplayBufferManager (class in tianshou.data)
PrioritizedVectorReplayBuffer (class in tianshou.data)
process_fn() (tianshou.policy.A2CPolicy method)
(tianshou.policy.BasePolicy method)
(tianshou.policy.BranchingDQNPolicy method)
(tianshou.policy.CQLPolicy method)
(tianshou.policy.DDPGPolicy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.GAILPolicy method)
(tianshou.policy.ICMPolicy method)
(tianshou.policy.MultiAgentPolicyManager method)
(tianshou.policy.NPGPolicy method)
(tianshou.policy.PGPolicy method)
(tianshou.policy.PPOPolicy method)
PSRLPolicy (class in tianshou.policy)
Q
QRDQNPolicy (class in tianshou.policy)
R
RainbowPolicy (class in tianshou.policy)
RandomPolicy (class in tianshou.policy)
RayEnvWorker (class in tianshou.env.worker)
RayVectorEnv (class in tianshou.env)
Recurrent (class in tianshou.utils.net.common)
RecurrentActorProb (class in tianshou.utils.net.continuous)
RecurrentCritic (class in tianshou.utils.net.continuous)
recv() (tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
REDQPolicy (class in tianshou.policy)
reduce() (tianshou.data.SegmentTree method)
render() (tianshou.env.BaseVectorEnv method)
(tianshou.env.PettingZooEnv method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
replace_policy() (tianshou.policy.MultiAgentPolicyManager method)
ReplayBuffer (class in tianshou.data)
ReplayBufferManager (class in tianshou.data)
reset() (tianshou.data.Collector method)
(tianshou.data.HERReplayBuffer method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
(tianshou.env.BaseVectorEnv method)
(tianshou.env.PettingZooEnv method)
(tianshou.env.VectorEnvNormObs method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
(tianshou.exploration.BaseNoise method)
(tianshou.exploration.OUNoise method)
(tianshou.utils.net.discrete.NoisyLinear method)
reset_buffer() (tianshou.data.Collector method)
reset_env() (tianshou.data.AsyncCollector method)
(tianshou.data.Collector method)
reset_stat() (tianshou.data.Collector method)
restore_data() (tianshou.utils.BaseLogger method)
(tianshou.utils.LazyLogger method)
(tianshou.utils.TensorboardLogger method)
(tianshou.utils.WandbLogger method)
rewards (tianshou.env.PettingZooEnv attribute)
rewrite_transitions() (tianshou.data.HERReplayBuffer method)
RunningMeanStd (class in tianshou.utils)
S
SACPolicy (class in tianshou.policy)
sample() (tianshou.data.ReplayBuffer method)
(tianshou.utils.net.discrete.NoisyLinear method)
sample_indices() (tianshou.data.HERReplayBuffer method)
(tianshou.data.PrioritizedReplayBuffer method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
sample_noise() (in module tianshou.utils.net.discrete)
save_data() (tianshou.utils.BaseLogger method)
(tianshou.utils.LazyLogger method)
(tianshou.utils.TensorboardLogger method)
(tianshou.utils.WandbLogger method)
save_hdf5() (tianshou.data.HERReplayBuffer method)
(tianshou.data.HERReplayBufferManager method)
(tianshou.data.ReplayBuffer method)
seed() (tianshou.env.BaseVectorEnv method)
(tianshou.env.PettingZooEnv method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
SegmentTree (class in tianshou.data)
send() (tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
set_agent_id() (tianshou.policy.BasePolicy method)
set_batch() (tianshou.data.HERReplayBuffer method)
(tianshou.data.HERReplayBufferManager method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
set_beta() (tianshou.data.PrioritizedReplayBuffer method)
(tianshou.data.PrioritizedVectorReplayBuffer method)
set_env_attr() (tianshou.env.BaseVectorEnv method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.DummyEnvWorker method)
(tianshou.env.worker.EnvWorker method)
(tianshou.env.worker.RayEnvWorker method)
(tianshou.env.worker.SubprocEnvWorker method)
set_eps() (tianshou.policy.DQNPolicy method)
(tianshou.policy.ICMPolicy method)
set_exp_noise() (tianshou.policy.DDPGPolicy method)
set_obs_rms() (tianshou.env.VectorEnvNormObs method)
set_postfix() (tianshou.utils.DummyTqdm method)
shape (tianshou.data.Batch property)
ShmemVectorEnv (class in tianshou.env)
soft_update() (tianshou.policy.BasePolicy method)
split() (tianshou.data.Batch method)
stack() (tianshou.data.Batch static method)
stack_() (tianshou.data.Batch method)
state_dict() (tianshou.utils.MultipleLRSchedulers method)
std() (tianshou.utils.MovAvg method)
step() (tianshou.env.BaseVectorEnv method)
(tianshou.env.PettingZooEnv method)
(tianshou.env.VectorEnvNormObs method)
(tianshou.env.VectorEnvWrapper method)
(tianshou.env.worker.EnvWorker method)
(tianshou.utils.MultipleLRSchedulers method)
SubprocEnvWorker (class in tianshou.env.worker)
SubprocVectorEnv (class in tianshou.env)
sync_weight() (tianshou.policy.BCQPolicy method)
(tianshou.policy.CQLPolicy method)
(tianshou.policy.DDPGPolicy method)
(tianshou.policy.DiscreteCRRPolicy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.REDQPolicy method)
(tianshou.policy.SACPolicy method)
(tianshou.policy.TD3Policy method)
T
TD3BCPolicy (class in tianshou.policy)
TD3Policy (class in tianshou.policy)
TensorboardLogger (class in tianshou.utils)
terminations (tianshou.env.PettingZooEnv attribute)
test_episode() (in module tianshou.trainer)
tianshou.exploration
module
tianshou.utils
module
tianshou.utils.net.common
module
tianshou.utils.net.continuous
module
tianshou.utils.net.discrete
module
to_numpy() (in module tianshou.data)
(tianshou.data.Batch method)
to_torch() (in module tianshou.data)
(tianshou.data.Batch method)
to_torch_as() (in module tianshou.data)
train() (tianshou.policy.BCQPolicy method)
(tianshou.policy.CQLPolicy method)
(tianshou.policy.DDPGPolicy method)
(tianshou.policy.DiscreteBCQPolicy method)
(tianshou.policy.DQNPolicy method)
(tianshou.policy.ICMPolicy method)
(tianshou.policy.REDQPolicy method)
(tianshou.policy.SACPolicy method)
(tianshou.policy.TD3Policy method)
training (tianshou.policy.A2CPolicy attribute)
(tianshou.policy.BasePolicy attribute)
(tianshou.policy.BCQPolicy attribute)
(tianshou.policy.BranchingDQNPolicy attribute)
(tianshou.policy.C51Policy attribute)
(tianshou.policy.CQLPolicy attribute)
(tianshou.policy.DDPGPolicy attribute)
(tianshou.policy.DiscreteBCQPolicy attribute)
(tianshou.policy.DiscreteCQLPolicy attribute)
(tianshou.policy.DiscreteCRRPolicy attribute)
(tianshou.policy.DiscreteSACPolicy attribute)
(tianshou.policy.DQNPolicy attribute)
(tianshou.policy.FQFPolicy attribute)
(tianshou.policy.GAILPolicy attribute)
(tianshou.policy.ICMPolicy attribute)
(tianshou.policy.ImitationPolicy attribute)
(tianshou.policy.IQNPolicy attribute)
(tianshou.policy.MultiAgentPolicyManager attribute)
(tianshou.policy.NPGPolicy attribute)
(tianshou.policy.PGPolicy attribute)
(tianshou.policy.PPOPolicy attribute)
(tianshou.policy.PSRLPolicy attribute)
(tianshou.policy.QRDQNPolicy attribute)
(tianshou.policy.RainbowPolicy attribute)
(tianshou.policy.RandomPolicy attribute)
(tianshou.policy.REDQPolicy attribute)
(tianshou.policy.SACPolicy attribute)
(tianshou.policy.TD3BCPolicy attribute)
(tianshou.policy.TD3Policy attribute)
(tianshou.policy.TRPOPolicy attribute)
(tianshou.utils.net.common.ActorCritic attribute)
(tianshou.utils.net.common.BranchingNet attribute)
(tianshou.utils.net.common.DataParallelNet attribute)
(tianshou.utils.net.common.EnsembleLinear attribute)
(tianshou.utils.net.common.MLP attribute)
(tianshou.utils.net.common.Net attribute)
(tianshou.utils.net.common.Recurrent attribute)
(tianshou.utils.net.continuous.Actor attribute)
(tianshou.utils.net.continuous.ActorProb attribute)
(tianshou.utils.net.continuous.Critic attribute)
(tianshou.utils.net.continuous.Perturbation attribute)
(tianshou.utils.net.continuous.RecurrentActorProb attribute)
(tianshou.utils.net.continuous.RecurrentCritic attribute)
(tianshou.utils.net.continuous.VAE attribute)
(tianshou.utils.net.discrete.Actor attribute)
(tianshou.utils.net.discrete.CosineEmbeddingNetwork attribute)
(tianshou.utils.net.discrete.Critic attribute)
(tianshou.utils.net.discrete.FractionProposalNetwork attribute)
(tianshou.utils.net.discrete.FullQuantileFunction attribute)
(tianshou.utils.net.discrete.ImplicitQuantileNetwork attribute)
(tianshou.utils.net.discrete.IntrinsicCuriosityModule attribute)
(tianshou.utils.net.discrete.NoisyLinear attribute)
TRPOPolicy (class in tianshou.policy)
truncations (tianshou.env.PettingZooEnv attribute)
U
unfinished_index() (tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
update() (tianshou.data.Batch method)
(tianshou.data.HERReplayBuffer method)
(tianshou.data.HERReplayBufferManager method)
(tianshou.data.PrioritizedReplayBuffer method)
(tianshou.data.ReplayBuffer method)
(tianshou.data.ReplayBufferManager method)
(tianshou.policy.BasePolicy method)
(tianshou.utils.DummyTqdm method)
(tianshou.utils.RunningMeanStd method)
update_weight() (tianshou.data.PrioritizedReplayBuffer method)
V
VAE (class in tianshou.utils.net.continuous)
value_mask() (tianshou.policy.BasePolicy static method)
VectorEnvNormObs (class in tianshou.env)
VectorEnvWrapper (class in tianshou.env)
VectorReplayBuffer (class in tianshou.data)
W
wait() (tianshou.env.worker.DummyEnvWorker static method)
(tianshou.env.worker.EnvWorker static method)
(tianshou.env.worker.RayEnvWorker static method)
(tianshou.env.worker.SubprocEnvWorker static method)
WandbLogger (class in tianshou.utils)
write() (tianshou.utils.BaseLogger method)
(tianshou.utils.LazyLogger method)
(tianshou.utils.TensorboardLogger method)
(tianshou.utils.WandbLogger method)
Read the Docs
v: stable
Versions
master
latest
stable
v0.5.0
v0.4.11
v0.4.10
v0.4.9
v0.4.8
v0.4.7
v0.4.6.post1
v0.4.6
v0.4.5
v0.4.4
v0.4.3
v0.4.2
v0.4.1
v0.4.0
v0.3.2
v0.3.1
v0.3.0.post1
v0.3.0
v0.2.7
v0.2.6
v0.2.5
v0.2.4.post1
v0.2.4
v0.2.3
v0.2.2
v0.2.1
dev
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.