[DRAFT] C++ Export #228

Gregwar · 2022-04-01T12:48:55Z

Description

This is a draft, I suggest we keep the conversation in the associated issue:
DLR-RM/stable-baselines3#836

Motivation and Context

I have raised an issue to propose this change (required for new features and bug fixes)

DLR-RM/stable-baselines3#836

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist:

I've read the CONTRIBUTION guide (required)
I have updated the changelog accordingly (required).
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.
I have reformatted the code using make format (required)
I have checked the codestyle using make check-codestyle and make lint (required)
I have ensured make pytest and make type both pass. (required)

…o export_cpp

Gregwar · 2022-04-06T16:47:02Z

For some application, I would also be interrested in having the values predictions available, so I am adding them as well, some questions:

Are the Q/V networks decent estimations of the true functions or not ? (answer may depend on algorithm involved)
ContinuousCritic has several q_networks
- Is that only for the twin trick ?
- Is it ok in production to only look at the value of the first network, or should we compute both and get the min ?

Gregwar · 2022-04-06T16:48:34Z

Also, is there a consistent pattern across all algorithms to access values functions just like predict for the action ?

with flatten)

Gregwar · 2022-04-06T21:18:16Z

Another question/note:

If we use the normalize hyperparameter while training, I guess we will need to use the normalization during the inference, right? That would imply getting some data from the env wrapper and exporting it as well

araffin · 2022-04-06T21:24:44Z

I guess we will need to use the normalization during the inference, right?

yes, we already save the mean and std for observation in a separate file (vecnormalize.pkl) but should be easy to export.

araffin · 2022-04-06T21:26:08Z

onsistent pattern across all algorithms to access values functions just like predict for the action ?

not really... Only between algorithms of the same family...

For some application, I would also be interrested in having the values predictions available,

I agree but I would not focus on that for now (value functions are not needed for inference, unless it is DQN).
I would do it in a follow up PR, once we have the basic feature ready and working reliably.

araffin · 2022-04-06T21:27:45Z

re the Q/V networks decent estimations of the true functions or not ?

that's hard to give only one answer, we surely hope they are, but that doesn't mean it is always the case.

Is that only for the twin trick ?

yes

Is it ok in production to only look at the value of the first network

for rough estimation, only one is needed.

araffin · 2022-04-08T16:14:30Z

@Gregwar I could successfully test it =)
I had to do some tweaks to use it with conda env:

export CMAKE_PREFIX_PATH="${HOME}/.local/lib/libtorch:/home/user/miniconda3"

where the conda prefix can be retrieved with ${CONDA_PREFIX:-"$(dirname $(which conda))/../"}

I also had to comment out the hardcoded set (Python_EXECUTABLE "/usr/bin/python3.8") (I don't think that's needed)

Gregwar · 2022-04-11T09:31:47Z

@Gregwar I could successfully test it =)
That is nice!

I had to do some tweaks to use it with conda env:
Yes you are right this should not be there

I think I will have to setup some (automated) tests to check for consistency between Python and C++ predictions. It will be hard to be sure to cover all the cases else, I am digging in the code of all the possible algorithms and I might miss some information (there are many possible options as well like using images as input that should get normalized, using SDE, the Wrapping "normalize" that is not handled at all currently etc.)

araffin · 2022-04-11T10:09:21Z

It will be hard to be sure to cover all the cases else,

Let's do a first working version that covers only some algorithms (let's say PPO, DQN and SAC) and only covers basic case (MLP, no images, no additional feature like normalization or SDE).

Once that's working and merged, we can work on adding additional features, I would start with normalization and then image support ;)

Gregwar · 2022-06-26T15:57:28Z

Hello,

Sorry for the lag, we are currently working on our humanoid robots for RoboCup, we integrate DRL algorithms in the robots for the first year.

We spent some time investigating and finally using OpenVino runtime because of our robots architecture (we use ONNX as intermediary representation OpenVino's model exporter).

In first place I thought of implementing pre/post processing in C++ but it is actually a better idea to use it in the PyTorch module that is being traced or exported. We can't provide a lot of runtime-specific implementation, so we could focus on libtorch as first intended and provide ONNX possibility for people that want to use something else.

araffin · 2022-09-26T13:10:48Z

@Gregwar could you give me access to your repo so I can push changes? (mainly merging master with this branch)

zoythum · 2024-01-18T16:35:12Z

@araffin Is there any update on this PR? I would be interested in exporting SB3 models into C++ executables but I am not sure on how to approach this problem

araffin · 2024-01-18T17:08:55Z

I would be interested in exporting SB3 models into C++ executables but I am not sure on how to approach this problem

For inference, you can have a look at https://stable-baselines3.readthedocs.io/en/master/guide/export.html
and DLR-RM/stable-baselines3#1349 (comment)

Gregwar added 5 commits March 30, 2022 18:08

Exporting model to C++ (wip)

797f622

CPP Export (wip)

a309c43

CPP Export (wip)

6b708ab

CPP Export (+pybind to test)

1b46083

Reformating

ef4b1ef

Gregwar mentioned this pull request Apr 1, 2022

[Question] C++ Inference DLR-RM/stable-baselines3#836

Closed

2 tasks

araffin and others added 5 commits April 1, 2022 15:58

Merge branch 'master' into export_cpp

6558af0

Outputing value function (wip)

de9d721

Merge branch 'export_cpp' of github.com:Gregwar/rl-baselines3-zoo int…

f509ff6

…o export_cpp

Formatting

e758ce6

Adding feature extractors in exported models (not tested, wip)

a0dcb77

Unqqueezing since feature extractor is now embedded in model (and starts

87f0aa9

with flatten)

Gregwar and others added 5 commits April 6, 2022 23:35

Features for TD3

3d38afd

Features extractors wip

29dedc0

Tracing value modules

e21cc8b

Reformat and minor cleanup

b4195ff

Merge branch 'master' into export_cpp

5256aaa

Fix import order

f7b8c15

Merge branch 'master' into export_cpp

5baa4b3

araffin added the Maintainers on vacation Maintainers are on vacation so they can recharge their batteries, we will be back soon ;) label Jun 29, 2022

araffin removed the Maintainers on vacation Maintainers are on vacation so they can recharge their batteries, we will be back soon ;) label Aug 13, 2022

araffin self-assigned this Aug 13, 2022

Merge branch 'master' into export_cpp

afdd8ba

Merge branch 'master' into export_cpp

6247367

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] C++ Export #228

[DRAFT] C++ Export #228

Gregwar commented Apr 1, 2022

Gregwar commented Apr 6, 2022 •

edited

Loading

Gregwar commented Apr 6, 2022

Gregwar commented Apr 6, 2022

araffin commented Apr 6, 2022 •

edited

Loading

araffin commented Apr 6, 2022 •

edited

Loading

araffin commented Apr 6, 2022

araffin commented Apr 8, 2022

Gregwar commented Apr 11, 2022

araffin commented Apr 11, 2022 •

edited

Loading

Gregwar commented Jun 26, 2022

araffin commented Sep 26, 2022

zoythum commented Jan 18, 2024

araffin commented Jan 18, 2024

[DRAFT] C++ Export #228

Are you sure you want to change the base?

[DRAFT] C++ Export #228

Conversation

Gregwar commented Apr 1, 2022

Description

Motivation and Context

Types of changes

Checklist:

Gregwar commented Apr 6, 2022 • edited Loading

Gregwar commented Apr 6, 2022

Gregwar commented Apr 6, 2022

araffin commented Apr 6, 2022 • edited Loading

araffin commented Apr 6, 2022 • edited Loading

araffin commented Apr 6, 2022

araffin commented Apr 8, 2022

Gregwar commented Apr 11, 2022

araffin commented Apr 11, 2022 • edited Loading

Gregwar commented Jun 26, 2022

araffin commented Sep 26, 2022

zoythum commented Jan 18, 2024

araffin commented Jan 18, 2024

Gregwar commented Apr 6, 2022 •

edited

Loading

araffin commented Apr 6, 2022 •

edited

Loading

araffin commented Apr 6, 2022 •

edited

Loading

araffin commented Apr 11, 2022 •

edited

Loading