Prioritized Experience Replay

Borrowed from Damcy's code here

Original code does not work for large buffer sizes at all. This version ensures correctness while also working with large buffer sizes. 1000000 size buffer tested.

Usage

in rank_base.py Experience.stroe give a simple description of store replay memory, or you can also refer rank_base_test.py
It's more convenient to store replay as format (state_1, action_1, reward, state_2, terminal). If we use this method, all replay memory in Experience are legal and can be sampled as we like.
run it with python3/python2.7

Rank-based

use binary heap tree as priority queue, and build an Experience class to store and retrieve the sample

Interface:
* All interfaces are in rank_based.py
* init conf, please read Experience.__init__ for more detail, all parameters can be set by input conf
* replay sample store: Experience.store
	params: [in] experience, sample to store
	returns: bools, True for success, False for failed
* replay sample sample: Experience.sample
	params: [in] global_step, used for cal beta
	returns: 
		experience, list of samples
		w, list of weight
		rank_e_id, list of experience's id, used for update priority value
* update priority value: Experience.update
	params: 
		[in] indices, rank_e_ids
		[in] delta, new TD-error

Proportional

You can find the implementation here: proportional

Reference

"Prioritized Experience Replay" http://arxiv.org/abs/1511.05952
Atari by @Kaixhin, Atari uses torch to implement rank-based algorithm.

Application

[Damcy] TEST1 PASSED: These code has been applied to my own NLP DQN experiment, it significantly improves performance. See here for more detail.
[ameet-1997] Used this implementation with Hindsight Experience Replay that is explained here

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
binary_heap.py		binary_heap.py
binary_heap_test.py		binary_heap_test.py
rank_based.py		rank_based.py
rank_based_test.py		rank_based_test.py
utility.py		utility.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prioritized Experience Replay

Usage

Rank-based

Proportional

Reference

Application

About

Releases

Packages

Languages

License

ameet-1997/Prioritized_Experience_Replay

Folders and files

Latest commit

History

Repository files navigation

Prioritized Experience Replay

Usage

Rank-based

Proportional

Reference

Application

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages