Homework 1.3 MxNet GPU

lklkylx · January 28, 2019, 2:28am

I can’t install mxnet for gpu. I have SSH into AWS GPU. I have tried installing both CUDA version 10.13 and 9 but when I run mxnet-CU90, it does not work. Does anyone know how to install the gpu version of mxnet?

joanne.chen · January 28, 2019, 6:16am

I’m having the same issue. I think it is highly unreasonable for the instructors to expect us to complete this homework with the confusing and vague installation guidelines we’ve been given and the inadequate instruction we’ve had on these topics

astonzhang · January 28, 2019, 6:45am

For AWS instances pre-installed with CUDA 10.*, you need to install mxnet-cu100 instead of mxnet-cu90

Alternatively, can you follow instructions at
https://en.d2l.ai/chapter_appendix/aws.html

You can use a clean AWS instance, install CUDA 9.0 by yourself, then use mxnet-cu90

yopourquoi · January 28, 2019, 8:49am

I did installed mxnet-cu100 and updated environment.yml to pip: mxnet-cu100.
However, I get the following error when I tried to import mxnet.

import mxnet

Traceback (most recent call last):

File “<stdin>”, line 1, in <module>

File “/home/ubuntu/miniconda3/envs/gluon/lib/python3.6/site-packages/mxnet/init.py”, line 24, in <module>

from .context import Context, current_context, cpu, gpu, cpu_pinned

File “/home/ubuntu/miniconda3/envs/gluon/lib/python3.6/site-packages/mxnet/context.py”, line 24, in <module>

from .base import classproperty, with_metaclass, _MXClassPropertyMetaClass

File “/home/ubuntu/miniconda3/envs/gluon/lib/python3.6/site-packages/mxnet/base.py”, line 213, in <module>

_LIB = _load_lib()

File “/home/ubuntu/miniconda3/envs/gluon/lib/python3.6/site-packages/mxnet/base.py”, line 204, in _load_lib

lib = ctypes.CDLL(lib_path[0], ctypes.RTLD_LOCAL)

File “/home/ubuntu/miniconda3/envs/gluon/lib/python3.6/ctypes/init.py”, line 348, in init

self._handle = _dlopen(self._name, mode)

OSError: libcudart.so.10.0: cannot open shared object file: No such file or directory

astonzhang · January 28, 2019, 10:29pm

Can you just follow the instructions at:
https://en.d2l.ai/chapter_appendix/aws.html

and use mxnet-cu90 after installing CUDA 9.0

zzseany · January 28, 2019, 10:39pm

Similar question here. After we complete all the installations should we upload the homework1.ipynb to gpu instance to get the display for !nvidia-smi. Is there a way to keep the file at local but still get the output from !nvidia-smi?

smolix · January 28, 2019, 10:59pm

We have so far provided a number of ways for getting this done:

On your own GPU enabled machine
Using Colab (it’s literally two lines of code that you need to add to the notebook)
Using the DL AMI as per the slides (they’re tested for CUDA 9.2).

Beyond that, Aston is putting up detailed instructions for CUDA 10.0 which is still quite new and thus not supported by default everywhere. Apologies if you’re having trouble. We will post an update shortly.

smolix · January 29, 2019, 12:36am

Please have a look at the detailed walkthrough (tested as of this afternoon):

smolix · January 29, 2019, 1:10am

You need to take care of the /usr/local/cuda link to point to /usr/local/cuda-10.0. Otherwise MXNet won’t know where to find the right version of CUDA.

Topic		Replies	Views
MXNet install instructions on AWS DL AMI Courses	1	2129	January 29, 2019
Dead Kernel When Importing MXNet (HW1, Q3) Courses	4	1109	January 29, 2019
Python Stopped working error on GPU Gluon	2	1009	January 19, 2019
Mxnet R Installation for GPU (Windows) not working	6	1673	August 24, 2018
Mxnet version for cuda 10.2 in windows 10?	3	7451	May 6, 2020

Homework 1.3 MxNet GPU

Related Topics