How to allocate fixed amount of gpu memory?

jonbakerfish · September 10, 2018, 1:09am

Is there a way to allocate fixed amount of gpu memory for a program in mxnet (like tensorflow)? Currently, I have a training script which take days to run and its gpu memory usage fluctuates from time to time. When other users accidentally use the same gpu, my program will crash because of OOM.

Sergey · September 10, 2018, 11:33pm

There is currently a limited number of things you can do to manage GPU memory. Take a look into possible environment variables - https://mxnet.incubator.apache.org/faq/env_var.html#memory-options

zhreshold · September 14, 2018, 6:52pm

If you are using gluon, you can try net.hybridize(static_alloc=True)

jonbakerfish · September 19, 2018, 5:01am

For those who have root privilege, you can change the Compute Mode of the target GPU to “EXCLUSIVE_PROCESS”, which means only one context is allowed per device, usable from multiple threads at a time. Please refer man nvidia-smi, e.g.:

sudo nvidia-smi -c 3 -i 1

TriLoon · September 19, 2019, 6:59am

There is a member variable reserve_ in ResourceManage releated codes (default to 5%). however not sure wether it work or not

Topic		Replies	Views
How to limit GPU memory usage	8	4521	September 1, 2020
How to release the GPU memory in MXNET Discussion	4	3327	October 29, 2019
The GPU memory usage is not stable Performance	3	1011	May 12, 2018
Free GPU memory? Gluon	1	1542	November 27, 2018
Is it possible to reuse GPU's memory when training a network? Gluon	3	1170	August 10, 2018

How to allocate fixed amount of gpu memory?

Related Topics