scope_rl.ope.weight_value_learning.function.DiscreteQFunction#

class scope_rl.ope.weight_value_learning.function.DiscreteQFunction(n_actions, state_dim, hidden_dim=100, device='cuda:0')[source]#

Q Function (for discrete action space).

Bases: torch.nn.Module

Imported as: scope_rl.ope.weight_value_learning.function.DiscreteQFunction

Parameters:
  • n_actions (int (> 0)) – Number of actions.

  • state_dim (int (> 0)) – Dimensions of the state space.

  • hidden_dim (int, default=100 (> 0)) – Hidden dimension of the network.

  • device (str, default="cuda:0") – Specifies device used for torch.

Methods

all

argmax

expectation

max

Methods