Training SAC with raw image as input #25

ChunJyeBehBeh · 2020-03-07T10:23:14Z

The policy that I have tried is DDPG and SAC. I used master branch and below is the two command to reproduce the error.
python train.py --algo sac -n 5000
python train.py --algo ddpg -n 5000

Tensorflow version == 1.15.0
stable baseline == 2.9.0

Thanks for this good repo. Is a very good start to learning reinforcement learning in autonomous driving area. I had successfully trained a SAC model using VAE as input.

Now I want to try using raw image as input. I have set N_COMMAND_HISTORY to zero. I use the master branch. For the first 300 steps, the steering and throttle will be varied between -1 and 1 because of the sampling random action.
https://github.com/araffin/learning-to-drive-in-5-minutes/blob/fb82bc77593605711289e03f95dcfb6d3ea9e6c3/algos/custom_sac.py#L89

But after that, the policy will keep output the extreme value either 1 or 1 for the steering value. So the donkey car will go out the lane quickly and it will keep repeat without showing any learning progress.

The image below showed that the episode step drop 95 to 50 after the policy start to output the action.

Below is the plot of throttle value output [SAC with raw image input]. It keep constant at 1 after few episode.

Below is the plot of throttle value output [SAC with vae input]. The model tried to learn how to steer and vary the output between -1 and 1.

Sorry for keep open issues.

The text was updated successfully, but these errors were encountered:

araffin · 2020-03-07T11:21:56Z

hello,
what policy are you using?
please fill the issue template completely

ChunJyeBehBeh · 2020-03-07T13:16:05Z

The policy that I used is DDPG and SAC. I have updated on the issue above. Thanks for your reply~

araffin · 2020-03-07T13:19:53Z

I wanted to say "policy architecture", it seems that you are not using a CNN if you are using the default hyperparameters... This explains your results.

ChunJyeBehBeh · 2020-03-07T13:28:25Z

Yes I am using the default hyperparameters.... May I know which part should I change in order to using raw image to train a SAC model?

In the sac.yml, change the policy from policy: 'MlpPolicy' to policy: 'CnnPolicy' ?

araffin · 2020-03-07T13:29:32Z

I would recommend you to read stable-baselines documentation and look at the rl zoo, you have plenty of examples of RL with images.

ChunJyeBehBeh · 2020-03-21T05:02:08Z

Hello, I change the policy to CnnPolicy and increase the layer to policy_kwargs: “dict(layers=[64,64,64,64])”. However, I still didn't manage to train the agent with raw image input... Any other parameters that I miss out?

Adnan-annan · 2020-12-30T15:24:27Z

@ChunJyeBehBeh did you manage to train without VAE ?

eliork · 2021-01-21T16:39:48Z

@ChunJyeBehBeh @Adnan-annan I am also trying to train without VAE. Did you have any success yet? would you mind sharing your results and methods you've tried?

araffin added the question Further information is requested label Mar 7, 2020

araffin mentioned this issue Apr 16, 2020

[question] Training models without VAE #26

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training SAC with raw image as input #25

Training SAC with raw image as input #25

ChunJyeBehBeh commented Mar 7, 2020 •

edited

Loading

araffin commented Mar 7, 2020

ChunJyeBehBeh commented Mar 7, 2020

araffin commented Mar 7, 2020

ChunJyeBehBeh commented Mar 7, 2020 •

edited

Loading

araffin commented Mar 7, 2020

ChunJyeBehBeh commented Mar 21, 2020 •

edited

Loading

Adnan-annan commented Dec 30, 2020

eliork commented Jan 21, 2021

Training SAC with raw image as input #25

Training SAC with raw image as input #25

Comments

ChunJyeBehBeh commented Mar 7, 2020 • edited Loading

araffin commented Mar 7, 2020

ChunJyeBehBeh commented Mar 7, 2020

araffin commented Mar 7, 2020

ChunJyeBehBeh commented Mar 7, 2020 • edited Loading

araffin commented Mar 7, 2020

ChunJyeBehBeh commented Mar 21, 2020 • edited Loading

Adnan-annan commented Dec 30, 2020

eliork commented Jan 21, 2021

ChunJyeBehBeh commented Mar 7, 2020 •

edited

Loading

ChunJyeBehBeh commented Mar 7, 2020 •

edited

Loading

ChunJyeBehBeh commented Mar 21, 2020 •

edited

Loading