Open
Description
-
break_when_all_done
inenv.rollout()
[Feature Request]break_when_all_done
inenv.rollout()
#2355 - Partial steps in env [Feature Request] Partial steps in env #2356
-
BatchedEnv
: pass the indices of envs where a step should be done -
BatchedEnv
: index a BatchedEnv - Stateless env: discard done parts of the TD
-
- Scores [Feature] MCTS Scoring functions #2358
- PUCT (missing tests)
- UCB (missing tests)
- UCB1_TUNED
- EXP3
- PUCT_VARIANT
-
TensorSpec.enumerate()
- Storage
- Hashing functions [Feature] TensorDictMap hashing functions #2304
- Query modules [Feature] TensorDictMap Query module #2305
- Map [Feature] TensorDictMap #2306
- MCTSForest [Feature] MCTSForest #2307
- Policy classes [Feature] MCTS policy #2359
- MCTSPolicyBase
- MCTSPolicy
- AlphaGoPolicy
- AlphaStarPolicy
- MuZeroPolicy