Add SERL launcher with a Franka Arm from state example #3

Leo428 · 2023-12-13T05:32:38Z

Hi all,

This PR includes:

the edgeml launcher codes from youliang (with unused parts removed)
replay buffer implementations for fast online RL training. This part is taken from jaxrl-5, which we used in the ICRA paper. The memory-efficient buffer supports faster CPU-to-GPU transfer by packing obs and next obs for pixel images and performs pre-fetching to speed up training further. However, in this PR, the test example is with state-only training.
an example of Franka Arm lifting a cube in simulation, with separate actor and learner nodes. This example should start picking up a cube around 8 minutes (20k steps) and should converge around 15 minutes on a 4090 machine.
I also removed the isort hook in pre-commit due to a format conflict with black formatter hook. I checked this with youliang already.

Thank you so much for reviewing!

Notes:

I followed the installation instructions for jaxrl-minimal. But I found that I have to upgrade the chex and tensorflow versions to run training correctly. Maybe jaxrl-minimal needs a pip requirement update at some point?
I tested the code with https://github.com/rail-berkeley/jaxrl_minimal/tree/serl_dev. This branch is based on and identical to the main branch. I will add DrQ and encoder-related changes later this week once this PR is through.

youliangtan

Thanks for this PR. Some high-level thoughs.

Install edgeml from the original repo. This can easily be done by including the git path and commit hash in setup.py. This will automatically pull and install the code during pip3 install -e . in serl_launcher. The main idea here is that edgeml will get maintained overtime, and we try to prevent duplication of code when possible.

After these changes, i can try run the simulation code.

youliangtan · 2023-12-13T05:55:58Z

serl_launcher/setup.py

+
+setup(
+    name="edgeml",
+    version="0.1.2",


serl_launcher package

Cool, I can change that.

youliangtan · 2023-12-13T05:58:01Z

serl_launcher/setup.py

+        "typing_extensions",
+        "opencv-python",
+        "lz4",
+    ],


Since Iam maintaining edgeml, so it is better not to duplicate code

install_requires=[ ....... TODO DEPENDENCIES "'edgeml @ git+https://github.com/youliangtan/edgeml.git@COMMIT_HASH', ],

Would this still work if I added additional code for an efficient online RL replay buffer in the /data folder?

Also, for SERL's use cases, some of the features in edgeml are not used. Shall we meet some times tomorrow? I'd love to hear your suggestions in person.

yes, you will just add it to serl_launcher/data and the custom code will import edgeml as library.

It is okay that features are not used. Think edgeml as a numpy library, you only use classes that you required. If you make custom changes to edgeml, either open a PR, or just have the code in serl_launcher.

youliangtan · 2023-12-13T05:58:58Z

README.md

+
+2. Install RL library
+    - the examples here use jaxrl-minimal as the RL library.
+    - To install jaxrl-minimal, `git clone https://github.com/rail-berkeley/jaxrl_minimal/tree/serl_dev`, the `serl_dev` branch is based off the latest `main` branch.


I see no diff in serl_dev and main in jaxrl_minimal, is that what it suppose to?

This branch is created for future use, we are going to add in encoder stuff, DrQ, and VICE.

eventually, we should merge changes back to the main branch though. This is for our convenience when testing and running code for now.

youliangtan · 2023-12-13T06:01:05Z

serl_launcher/edgeml/data/serl_dataset.py

@@ -0,0 +1,195 @@
+from functools import partial


This seems to be similar to the one in jaxrl_m. If there's diff, place it in serl_dev, or keep it here for now (both options are good)

Right, it can be in either places. I'm not placing it in jaxrl_minimal for now because I'm not sure if anyone else's code depends on the specifics of the dataset and replay_buffer implementation in jaxrl_minimal. Although I highly recommend this version of the memory efficient replay buffer as it is fast and compact when doing image-based online RL training.

youliangtan · 2023-12-13T06:02:41Z

serl_launcher/edgeml/data/serl_memory_efficient_replay_buffer.py

+from flax.core import frozen_dict
+from gymnasium.spaces import Box
+
+


Since this package is now call serl_launcher, the path should be:

serl_launcher/serl_launcher/data/memory_efficient_replay_buffer.py

Leo428 · 2023-12-13T22:05:50Z

Hi @youliangtan,

I made the changes per your requests.

Some side notes: When running jaxrl-minimal, you might run into some bugs related to chex and tensorflow versions. You can run pip install --upgrade chex and pip install --upgrade tensorflow to fix those errors.

Signed-off-by: youliang <[email protected]>

youliangtan · 2023-12-14T00:50:10Z

I managed to test-run the environment and run the actor and learner bash scripts.

Did minor code cleanups, and also updated the Readme doc. srry for the nitpicking, but is a good practice to use code block rather than code line in markdown (easier for users to copy)

0170394

Note: need to update chex to 0.1.85, will need to update jaxrl_m soon

youliangtan

Overall this works well. I set it up and ran it with a clean env. Havnt try to run long enough since i only tried it on a CPU setup. Can do it for the next iteration

Leo428 · 2023-12-14T00:55:08Z

Nice! with CPU it might take a bit longer to converge.

Thank you for the code improvements!

Leo428 added 5 commits December 12, 2023 19:34

add serl replay buffers

1692643

add edgeml code as serl launcher

5212034

comment out isort due to conflicts with black hook

1d6e230

add lift cube sim example from state

9bcd7b9

update README to include franka arm sim from state

4d86b1b

Leo428 requested review from youliangtan and jianlanluo December 13, 2023 05:35

add jax install instruction on gpu & tpu in README

ffee7ee

youliangtan suggested changes Dec 13, 2023

View reviewed changes

remove edgeml code and use edgeml as a library

138c030

youliangtan added 2 commits December 13, 2023 16:43

further cleanup code, and test run

0170394

Signed-off-by: youliang <[email protected]>

nit

afc5a62

Signed-off-by: youliang <[email protected]>

youliangtan approved these changes Dec 14, 2023

View reviewed changes

youliangtan merged commit 76c771b into main Dec 15, 2023
1 check passed

youliangtan deleted the serl_launcher_state branch December 15, 2023 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SERL launcher with a Franka Arm from state example #3

Add SERL launcher with a Franka Arm from state example #3

Leo428 commented Dec 13, 2023 •

edited

Loading

youliangtan left a comment

youliangtan Dec 13, 2023

Leo428 Dec 13, 2023

youliangtan Dec 13, 2023

Leo428 Dec 13, 2023

youliangtan Dec 13, 2023

youliangtan Dec 13, 2023

Leo428 Dec 13, 2023

Leo428 Dec 13, 2023

youliangtan Dec 13, 2023

Leo428 Dec 13, 2023

youliangtan Dec 13, 2023

Leo428 commented Dec 13, 2023

youliangtan commented Dec 14, 2023

youliangtan left a comment

Leo428 commented Dec 14, 2023

		from flax.core import frozen_dict
		from gymnasium.spaces import Box

Add SERL launcher with a Franka Arm from state example #3

Add SERL launcher with a Franka Arm from state example #3

Conversation

Leo428 commented Dec 13, 2023 • edited Loading

youliangtan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Leo428 commented Dec 13, 2023

youliangtan commented Dec 14, 2023

youliangtan left a comment

Choose a reason for hiding this comment

Leo428 commented Dec 14, 2023

Leo428 commented Dec 13, 2023 •

edited

Loading