Keep replay buffer on disk (not in memory), allowing it to grow to any size. #151

me-unsolicited · 2021-04-22T05:05:09Z

Hello, please consider this pull request which I implemented based on this comment.

Mainly I added a GameHistoryDao class which creates "replay_buffer.db" with a simple key->value table and stores the games there like a dictionary. If this approach is good, then it can be optimized further by separating reanalysed_predicted_root_values, priorities, and game_priority into their own columns so it can avoid serializing/deserializing the full observation history at each update. However, I think that would take more invasive changes to the existing code.

ahainaut · 2021-04-25T20:02:46Z

Hi @me-unsolicited ,
Thank you for this new feature. After reviewing and testing the code, we found that it slows considerably the time of training. So we will have to wait to merge this PR until we find a way to speed the training keeping the replay buffer on disk.

…ainer methods

me-unsolicited · 2021-04-27T01:48:02Z

@ahainaut ,
Thanks for the feedback! I made some improvements and it runs much faster now.

Changes:

Use SQL to efficiently sample from the prioritized replay buffer.
Store priorities and predicted values in separate columns from the full object.

me-unsolicited added 2 commits April 21, 2021 22:34

Keep replay buffer on disk instead of in memory, using a sqlite database

f18a35b

Update appropriately for database backed records

a22b6f5

me-unsolicited added 11 commits April 26, 2021 19:30

Split GameHistory data into multiple columns for quick access

4b46ed6

Fix column name

2415a6c

Assemble game history properly when accessing replay buffer with cont…

0f45259

…ainer methods

Change spelling 'reanalyzed' to 'reanalysed'

b513a76

Add missing SQL functions

4f4915d

Add missing column to SELECT statement

23ac3a9

Add missing commas in SQL

6f69e45

Don't store numpy arrays

9084cb4

Check for numpy array type before converting to list

9a25c09

Coerce game_priority to a regular float before saving

dceed59

Fix SQL syntax errors with concatenated strings

641ff90

me-unsolicited added 4 commits April 26, 2021 21:55

Serialize reanalysed_predicted_root_values before update

423e2b0

Sample the database more efficiently

f47e164

Use fast update for reanalysed_predicted_root_values

f9a3307

Change spelling 'reanalyzed' to 'reanalysed'

729daaa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep replay buffer on disk (not in memory), allowing it to grow to any size. #151

Keep replay buffer on disk (not in memory), allowing it to grow to any size. #151

me-unsolicited commented Apr 22, 2021

ahainaut commented Apr 25, 2021

me-unsolicited commented Apr 27, 2021

Keep replay buffer on disk (not in memory), allowing it to grow to any size. #151

Are you sure you want to change the base?

Keep replay buffer on disk (not in memory), allowing it to grow to any size. #151

Conversation

me-unsolicited commented Apr 22, 2021

ahainaut commented Apr 25, 2021

me-unsolicited commented Apr 27, 2021