NDArray.concat failed to concatenate two array on different GPUs?

mg0880gm · May 15, 2018, 5:49am

The following codes raised error:

import mxnet as mx
a = mx.ndarray.arry([[1,2,3],[4,5,6]],ctx=mx.gpu(0))
b = mx.ndarray.array([[1,2,3],[4,5,6]],ctx=mx.gpu(1))
mx.ndarray.concat(a,b,dim=1)

raise MXNetError(py_str(_LIB.MXGetLastError()))
mxnet.base.MXNetError: [05:42:09] /mxnet-1.2.0/3rdparty/mshadow/mshadow/./stream_gpu-inl.h:62: Check failed: e == cudaSuccess CUDA: an illegal memory access was encountered

The goal is to concat multiple ndarray and convert into numpy array. Those ndarray objects were generated by prediction using model trained with multiple GPUs and model parallelism. Seems that converting each ndarray into numpy array using asnumpy() then called numpy.concatenate() was not efficient. So just checked whether it’s ok to concatenate using mxnet.ndarray.concat first and then convert merged array into numpy array. Any suggestion?

Coffeered · May 15, 2018, 6:40am

Actually, you can first check if they are in same context by using as_in_context()
The reason why using asnumpy() works is because it automatically moves things to CPU and then computes.

For the context issue, you can simply use copyto(), if i dont get you wrong.

Sergey · May 15, 2018, 7:24pm

Unfortunately, you have to have arrays in the same context before you can do an operation on them. The simplest way to do that is to call as_in_context(ctx) method and provide same context you want it to be. This context doesn’t need to be a CPU. Here is the example:

import mxnet as mx
a = mx.ndarray.array([[1,2,3],[4,5,6]],ctx=mx.gpu(0))
b = mx.ndarray.array([[1,2,3],[4,5,6]],ctx=mx.gpu(1))
b_copy = b.as_in_context(a.context)
mx.ndarray.concat(a,b_copy,dim=1)

Which produces:

[[ 1.  2.  3.  1.  2.  3.]
[ 4.  5.  6.  4.  5.  6.]]
<NDArray 2x6 @gpu(0)>

mg0880gm · May 17, 2018, 12:56am

The answers look clear and straightforward. Thanks you guys.

Topic		Replies	Views
Mxnet ndarray to numpy without copy Discussion	1	476	September 11, 2019
Nd.concat does not support list input? Discussion	1	685	February 25, 2019
Create mxnet.ndarray.NDArray from pycuda.driver.DeviceAllocation	3	913	April 11, 2020
Ndarray problem Gluon	1	420	September 19, 2019
Numpy array to ndarray Discussion	1	2964	November 8, 2017

NDArray.concat failed to concatenate two array on different GPUs?

Related Topics