tf_agents.trajectories.boundary

Create a Trajectory transitioning between StepTypes LAST and FIRST.

View aliases

Main aliases

tf_agents.trajectories.trajectory.boundary

tf_agents.trajectories.boundary(
    observation: tf_agents.typing.types.NestedSpecTensorOrArray,
    action: tf_agents.typing.types.NestedSpecTensorOrArray,
    policy_info: tf_agents.typing.types.NestedSpecTensorOrArray,
    reward: tf_agents.typing.types.NestedSpecTensorOrArray,
    discount: tf_agents.typing.types.SpecTensorOrArray
) -> tf_agents.trajectories.Trajectory

All inputs may be batched.

The input discount is used to infer the outer shape of the inputs, as it is always expected to be a singleton array with scalar inner shape.

Args

observation (possibly nested tuple of) Tensor or np.ndarray; all shaped [B, ...], [T, ...], or [B, T, ...]. action (possibly nested tuple of) Tensor or np.ndarray; all shaped [B, ...], [T, ...], or [B, T, ...]. policy_info (possibly nested tuple of) Tensor or np.ndarray; all shaped [B, ...], [T, ...], or [B, T, ...]. reward (possibly nested tuple of) Tensor or np.ndarray; all shaped [B, ...], [T, ...], or [B, T, ...]. discount A floating point vector Tensor or np.ndarray; shaped [B], [T], or [B, T] (optional).

Returns
A `Trajectory` instance.

tf_agents.trajectories.boundary Stay organized with collections Save and categorize content based on your preferences.

View aliases

Args

Returns

tf_agents.trajectories.boundary