1. Finished defining the Actor and Critic models design.
2. Completed the DDPG implementation with the OU noise process and a replay buffer.
3. Finished setting up the soft-target updates for the agents
4. Completed and successfully trained the RL agents.
5. Completed the software documentation.
Discussions
Become a Hackaday.io Member
Create an account to leave a comment. Already have an account? Log In.