Module: tf_agents.bandits.policies.mixture_policy

A policy class that chooses from a set of policies to get the actions from.

This mixture policy takes a list of policies and will randomly choose one of them for every observation. The distribution is defined by the mixture_distribution.

Classes

class MixturePolicy: A policy that chooses from a set of policies to decide the action.

MIXTURE_AGENT_ID 'mixture_agent_id' SUBPOLICY_INFO 'subpolicy_info' absolute_import Instance of __future__._Feature division Instance of __future__._Feature print_function Instance of __future__._Feature