basicgym.envs.simulator.function#

Mathematical Functions used in Synthetic System .

Classes

`RewardFunction`	Class to define the reward function.
`StateTransitionFunction`	Class to define the state transition function.

class basicgym.envs.simulator.function.StateTransitionFunction(state_dim, action_dim, random_state=None)[source]#

Class to define the state transition function.

Bases: basicgym.BaseStateTransitionFunction

Imported as: basicgym.envs.simulator.StateTransitionFunction

Tip

Use BaseStateTransitionFunction to define a custom StateTransitionFunction.

Parameters:

state_dim (int) – Dimension of the state.
action_dim (int) – Dimension of the action (context).
random_state (int, default=None (>= 0)) – Random state.

Attributes:

random_state

Methods

step(state, action)

Update the state based on the presented action.

step(state, action)[source]#

Update the state based on the presented action.

Parameters:

state (array-like of shape (state_dim, )) – Current state.
action (array-like of shape (action_dim, )) – Indicating the action chosen by the agent.

Returns:

state – Next state.

Return type:

array-like of shape (state_dim, )

class basicgym.envs.simulator.function.RewardFunction(state_dim, action_dim, reward_type='continuous', reward_std=0.0, random_state=None)[source]#

Class to define the reward function.

Bases: basicgym.BaseRewardFunction

Imported as: basicgym.envs.simulator.RewardFunction

Tip

Use BaseRewardFunction to define a custom RewardFunction.

Parameters:

state_dim (int) – Dimension of the state.
action_dim (int) – Dimension of the action (context).
reward_type ({"continuous", "binary"}, default="continuous") – Reward type.
reward_std (float, default=0.0 (>=0)) – Noise level of the reward. Applicable only when reward_type is “continuous”.
random_state (int, default=None (>= 0)) – Random state.

Attributes:

random_state

Methods

`mean_reward_function`(state, action)	Linear expected immediate reward function.
`sample_reward`(state, action)	Sample reward.

mean_reward_function(state, action)[source]#

Linear expected immediate reward function.

Parameters:

state (array-like of shape (state_dim, )) – State in the RL environment.
action (array-like of shape (action_dim, )) – Indicating the action chosen by the agent.

Returns:

mean_reward_function – Expected immediate reward function conditioned on the state and action.

Return type:

float