Skip to content

Commit

Permalink
Update CategoricalPolicy.md
Browse files Browse the repository at this point in the history
  • Loading branch information
AqwamCreates authored Nov 28, 2024
1 parent 02913c0 commit 424d579
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/API/QuickSetups/CategoricalPolicy.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ CategoricalPolicy.new(numberOfReinforcementsPerEpisode: integer, epsilon: number

#### Parameters:

* numberOfReinforcementsPerEpisode: The number of reinforcements to decay the epsilon value.
* numberOfReinforcementsPerEpisode: The number of reinforcements to be considered as a single episode.

* epsilon: The higher the value, the more likely it focuses on exploration over exploitation. The value must be set between 0 and 1. Exploration means choosing a random action to try to get better overall average performance, while exploiting means choosing an action that has the potential to increase the performance.

Expand Down

0 comments on commit 424d579

Please sign in to comment.