Improve Utilization of GPU · notadamking/RLTrader#10

(14 comments) (3 reactions) (1 assignee)Python (1,592 stars) (535 forks)batch import

enhancementgood first issuehelp wanted

Description

This library achieves very high success rates, though it takes a very long time to optimize and train. This could be improved if we could figure out a way to utilize the GPU more during optimization/training, so the CPU can be less of a bottleneck. Currently, the CPU is being used for most of the intermediate environment calculations, while the GPU is used within the PPO2 algorithm during policy optimization.

I am currently optimizing/training on the following hardware:

AMD Threadripper 1920X 12 Core (24 Thread) CPU
Nvidia RTX 2080 8GB GPU
16 GB 3000 Mhz RAM

The bottleneck on my system is definitely the CPU, which is surprising as this library takes advantage of the multi-threaded benefits of the Threadripper, and my GPU is staying around 1-10% utilization. I have some ideas on how this could be improved, but would like to start a conversation.

Increase the size of the policy network (i.e. increase the number of hidden layers or increase the number of nodes in each layer)
Do less work in each training loop, so the GPU loop is called more often.

I would love to hear what you guys think. Any ideas or knowledge is welcome to be shared here.

Contributor guide

Tech stack: pythontensorflow
Domain: machine learningai
Issue type: performance
Difficulty: 4
Estimated time: over 1 week
Activity status: stale
Clarity: mostly clear
Prerequisites: PythonTensorFlowReinforcement learning basics
Newbie friendliness: 20
Research direction: Review the training loop implementation in the repository, focusing on how the environment steps are computed on CPU and how the PPO2 algorithm is executed on GPU. Use profiling tools to measure CPU and GPU utilization. Explore techniques such as increasing batch sizes, parallel environment instances, or using TensorFlow's tf.data API to reduce CPU GPU synchronization overhead. The existing comments in issue #10 discuss potential approaches.