scope_rl.ope.weight_value_learning.function.DiscreteStateActionWeightFunction#
- class scope_rl.ope.weight_value_learning.function.DiscreteStateActionWeightFunction(n_actions, state_dim, hidden_dim=100, enable_gradient_reversal=False, device='cuda:0')[source]#
State Action Weight Function (for discrete action space).
Bases:
torch.nn.ModuleImported as:
scope_rl.ope.weight_value_learning.function.DiscreteStateActionWeightFunction- Parameters:
n_actions (int (> 0)) – Number of actions.
state_dim (int (> 0)) – Dimensions of the state space.
hidden_dim (int, default=100 (> 0)) – Hidden dimension of the network.
enable_gradient_reversal (bool = False) – Whether to enable gradient reversal layer (for loss maximization).
device (str, default="cuda:0") – Specifies device used for torch.
Methods