Module ilpyt.runners

The runner coordinates the interaction between the agent and the environment. It collects transitions (state, action, reward, next state) over specified intervals of time. We can have the runner generate a collection of transitions for us by calling generate_batch (specify number of steps) and generate_episodes (specify number of episodes).

Expand source code
"""
The runner coordinates the interaction between the agent and the environment. 
It collects transitions (state, action, reward, next state) over specified 
intervals of time. We can have the runner generate a collection of transitions 
for us by calling `generate_batch` (specify number of steps) and 
`generate_episodes` (specify number of episodes).
"""

Sub-modules

ilpyt.runners.runner

The runner coordinates the agent-environment interaction loop. It collects transitions (state, action, reward, next state) over specified intervals …