scope_rl.ope.estimators_base.BaseCumulativeDistributionOPEEstimator#
- class scope_rl.ope.estimators_base.BaseCumulativeDistributionOPEEstimator[source]#
Base class for Cumulative Distribution OPE estimators.
Imported as:
scope_rl.ope.BaseCumulativeDistributionOPEEstimatorNote
This abstract base class also implements the following private methods.
- _aggregate_trajectory_wise_statistics_discrete:
Calculate trajectory-wise summary statistics based on step-wise observations in the case of discrete action spaces.
- _aggregate_trajectory_wise_statistics_continuous:
Calculate trajectory-wise summary statistics based on step-wise observations in the case of continuous action spaces.
- _target_value_given_idx:
Obtain the reward value corresponding to the given idx when estimating the CDF.
- property _kernel_function:
Dictionary containing names and functions of kernels.
key: [ gaussian, epanechnikov, triangular, cosine, uniform, ]
Methods
Estimate the conditional value at risk (CVaR) of the reward under the evaluation policy.
Estimate the cumulative distribution function (CDF) of the policy value.
Estimate the interquartile range of the reward under the evaluation policy.
Estimate the mean of the reward under the evaluation policy.
Estimate the variance of the reward under the evaluation policy.
- abstract estimate_cumulative_distribution_function()[source]#
Estimate the cumulative distribution function (CDF) of the policy value.
- abstract estimate_variance()[source]#
Estimate the variance of the reward under the evaluation policy.
Methods