SCOPE-RL Package Reference#
dataset module#
Abstract base class for logged dataset. |
|
Class to handle synthetic dataset generation. |
policy module#
Wrapper class to convert greedy policy into stochastic. |
|
Meta class to handle Offline Learning (ORL). |
ope module#
pipeline#
Meta class to create input for Off-Policy Evaluation (OPE). |
|
Meta class to handle standard and cumulative distribution OPE. |
|
Meta class to handle Off-Policy Selection (OPS) and evaluation of OPE/OPS. |
OPE estimators#
Abstract base class for Off-Policy Estimator. |
|
Off-Policy Estimators for discrete action cases. |
|
Off-Policy Estimators for continuous action cases (designed for deterministic evaluation policies). |
|
State(-Action) Marginal Off-Policy Estimators for discrete action cases. |
|
State(-Action) Marginal Off-Policy Estimators for continuous action cases (designed for deterministic evaluation policies). |
|
Cumulative Distribution Off-Policy Estimators for discrete action cases. |
|
Cumulative Distribution Off-Policy Estimators for continuous action cases (designed for deterministic evaluation policies). |
weight and value learning methods#
Abstract base class for weight and value learning. |
|
Weight and Value Functions. |
|
|
Augmented Lagrangian method for weight/value function learning (discrete action cases). |
|
Augmented Lagrangian method for weight/value function learning (continuous action cases). |
|
Minimax weight function learning (discrete action cases). |
|
Minimax weight function learning (continuous action cases). |
|
Minimax value function learning (discrete action cases). |
|
Minimax value function learning (continuous action cases). |
others#
On-Policy performance comparison. |
others#
Useful tools. |