DTLossΒΆ
- class torchrl.objectives.DTLoss(*args, **kwargs)[source]ΒΆ
TorchRL implementation of the Online Decision Transformer loss.
Presented in βDecision Transformer: Reinforcement Learning via Sequence Modelingβ <https://arxiv.org/abs/2106.01345>
- Parameters:
actor_network (ProbabilisticActor) β stochastic actor
- Keyword Arguments:
loss_function (str) β loss function to use. Defaults to
"l2"
.reduction (str, optional) β Specifies the reduction to apply to the output:
"none"
|"mean"
|"sum"
."none"
: no reduction will be applied,"mean"
: the sum of the output will be divided by the number of elements in the output,"sum"
: the output will be summed. Default:"mean"
.
- default_keysΒΆ
alias of
_AcceptedKeys