scope_rl.ope.weight_value_learning.function.DiscreteQFunction#
- class scope_rl.ope.weight_value_learning.function.DiscreteQFunction(n_actions, state_dim, hidden_dim=100, device='cuda:0')[source]#
Q Function (for discrete action space).
Bases:
torch.nn.ModuleImported as:
scope_rl.ope.weight_value_learning.function.DiscreteQFunction- Parameters:
Methods
all
argmax
expectation
max
Methods