Skip to content

Commit

Permalink
Update DeepReinforcementLearningAlgorithmsProperties.md
Browse files Browse the repository at this point in the history
  • Loading branch information
AqwamCreates authored Oct 22, 2024
1 parent 20152e9 commit 4367008
Showing 1 changed file with 7 additions and 7 deletions.
14 changes: 7 additions & 7 deletions docs/Tutorials/DeepReinforcementLearningAlgorithmsProperties.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,13 +11,13 @@
| Deep Expected State-Action-Reward-State-Action | 1 | Temporal Difference | On-Policy | Yes | No | No | Yes | No |
| Double Deep Expected State-Action-Reward-State-Action V1 (Randomly Chosen Network) | 1 (2 Model Parameters) | Temporal Difference | On-Policy | Yes | No | No | Yes | No |
| Double Deep Expected State-Action-Reward-State-Action V2 (Target Network) | 1 (2 Model Parameters) | Temporal Difference | On-Policy | Yes | No | No | Yes | No |
| Actor-Critic | 2 (Actor + Critic) | Monte Carlo | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Advantage Actor-Critic | 2 (Actor + Critic) | Monte Carlo | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Asynchronous Advantage Actor-Critic | 2 (Actor + Critic) | Monte Carlo | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Proximal Policy Optimization | 2 (Actor + Critic) | Monte Carlo | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Proximal Policy Optimization with Clipped Objective | 2 (Actor + Critic) | Monte Carlo | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Vanilla Policy Gradient | 2 (Actor + Critic) | Monte Carlo | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| REINFORCE | 1 | Monte Carlo | On-Policy | No | Yes | Yes | Yes | Yes |
| Actor-Critic | 2 (Actor + Critic) | Both | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Advantage Actor-Critic | 2 (Actor + Critic) | Both | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Asynchronous Advantage Actor-Critic | 2 (Actor + Critic) | Both | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Proximal Policy Optimization | 2 (Actor + Critic) | Both | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Proximal Policy Optimization with Clipped Objective | 2 (Actor + Critic) | Both | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| Vanilla Policy Gradient | 2 (Actor + Critic) | Both | On-Policy | Yes (Actor) | Yes (Critic) | Yes | Yes | Yes |
| REINFORCE | 1 | Both | On-Policy | No | Yes | Yes | Yes | Yes |

## Additional Notes:
1. **Deep Q Learning**:
Expand Down

0 comments on commit 4367008

Please sign in to comment.