Ch 16. Using native python list instead of deque for storing experience, increases sampling performance considerably. #204

psnilesh · 2018-03-28T14:58:22Z

First of all, thanks for an amazing book and even more awesome github repo. It really help me pickup Deep RL. I tried training Breakout-v0 using your code and I found that the training slowed considerably as time went by. Replay buffer size was set to 1 million. Since collections.deque take O(n) time for random access, I believe it's not suitable for random sampling. I wrote a simple replay buffer that plays well with the rest of the code and uses native python list for storing samples. The performance gain is considerably large. I hope you find this change useful.

ageron · 2018-04-04T12:56:50Z

Thanks @NileshPS , that's a very helpful contribution! :)
I took a quick look and it seems great, but I don't have time to test this right now. I'll merge as soon as I can test the code.
Thanks again!
Aurélien

ageron · 2018-05-09T13:34:26Z

Works great, thanks again @NileshPS ! :)

psnilesh added 2 commits March 28, 2018 19:57

use list with circular indexing instead of deque as the replay buffer

de510e8

use ReplayMemory

b70c1df

ageron merged commit db8cd86 into ageron:master May 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ch 16. Using native python list instead of deque for storing experience, increases sampling performance considerably. #204

Ch 16. Using native python list instead of deque for storing experience, increases sampling performance considerably. #204

psnilesh commented Mar 28, 2018 •

edited

Loading

ageron commented Apr 4, 2018

ageron commented May 9, 2018

Ch 16. Using native python list instead of deque for storing experience, increases sampling performance considerably. #204

Ch 16. Using native python list instead of deque for storing experience, increases sampling performance considerably. #204

Conversation

psnilesh commented Mar 28, 2018 • edited Loading

ageron commented Apr 4, 2018

ageron commented May 9, 2018

psnilesh commented Mar 28, 2018 •

edited

Loading