当前位置:网站首页>cuDNN installation
cuDNN installation
2022-06-24 04:58:00 【vanguard】
NVIDIA cuDNNis a GPU-accelerated library of primitives for deep neural networks.
- Hardware preparation ( Power Supply + a main board + processor + Fan + Memory + External storage /NVMESSD/HDD+Nvidia The graphics card )
- Operating system and tool installation (Ubuntu20.04+update+net-tools+ssh+vim+python3-pip+samba+git+xrdp+virtualenv)
- Graphics card driver and NVIDIA software installation (Driver+CUDA+cuDNN+TensorRT)
- Rely on software and framework installation (tensorflow-gpu+pytorch+opencv-python+yolo...)
- Container or direct training models and reasoning (docker+nvidia-docker...)
cuDNN Installation process ( Now you need to log in to get this link )
wget https://developer.download.nvidia.cn/compute/machine-learning/cudnn/secure/8.2.2/11.4_07062021/cudnn-11.4-linux-x64-v8.2.2.26.tgz?zVO0xngn9RHkR6idYHi7_WjTxJhRatqOB0Tsrbzn-y1zIokHbv0PQO_U8XLu7aMydM33JWOczvkirvAZ9BNN-aqsIyCpxg5Vc_sbF6AF8K6lGSXQ-CZXUe6IBt-5mcsMERGmkvQACeYRwKLqk7xy76mzV9epqp5_EgFkNFt7RcvA0T97ozdTs6e63yabuR5LkFx-de-Oa6IPbuU tar xvf * sudo cp -a include/cudnn.h /usr/local/cuda/include/ sudo cp -a lib64/libcudnn* /usr/local/cuda/lib64/ # nvidia-smi # nvcc -V
Difficulty or CUDA Installation
https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#removing-cuda-tk-and-driver
# To remove CUDA Toolkit: sudo apt-get --purge remove "*cublas*" "*cufft*" "*curand*" \ "*cusolver*" "*cusparse*" "*npp*" "*nvjpeg*" "cuda*" "nsight*" # To remove NVIDIA Drivers: sudo apt-get --purge remove "*nvidia*" # To clean up the uninstall: sudo apt-get autoremove
The drive shall be installed separately as far as possible , Because some do not rely on CUDA But depending on the driver, especially if you want to replace the native driver , Set the environment variable after installation
export PATH=/usr/local/cuda-11.4/bin${PATH:+:${PATH}}
export LD_LIBRARY_PATH=/usr/local/cuda-11.4/lib64\
${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
source ~/.bashIf not installed cuDNN, May skip GPU Use :
2021-08-26 19:55:22.789937: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudnn.so.8'; dlerror: libcudnn.so.8: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/cuda-11.4/lib64 2021-08-26 19:55:22.790001: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1835] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform. Skipping registering GPU devices... 2021-08-26 19:55:22.790631: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 2021-08-26 19:55:23.528475: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:185] None of the MLIR Optimization Passes are enabled (registered 2)
install cuDNN after , Can be used , Can also pass nvidia-smi Observe the usage of video memory, etc
2021-08-30 16:57:03.457415: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 2021-08-30 16:57:05.198665: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1510] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 17540 MB memory: -> device: 0, name: NVIDIA GeForce RTX 3090, pci bus id: 0000:02:00.0, compute capability: 8.6 2021-08-30 16:57:06.848155: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:185] None of the MLIR Optimization Passes are enabled (registered 2) Epoch 1/5 2021-08-30 16:57:10.171347: I tensorflow/stream_executor/cuda/cuda_blas.cc:1760] TensorFloat-32 will be used for the matrix multiplication. This will only be logged once.
Mon Aug 30 17:17:31 2021 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... Off | 00000000:02:00.0 Off | N/A | | 35% 50C P2 109W / 350W | 23055MiB / 24265MiB | 1% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ | 1 NVIDIA GeForce ... Off | 00000000:82:00.0 Off | N/A | | 34% 44C P0 110W / 350W | 0MiB / 24268MiB | 2% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | 0 N/A N/A 5741 C python 23053MiB | +-----------------------------------------------------------------------------+
cuDNN
边栏推荐
- Here's all you want to know about takin data (1) startup command
- LeetCode 1662. Check whether two string arrays are equal
- Use of golang testing framework test
- How does the VPS server upload data? Is the VPS server free to use?
- Problem: SQL create stored procedure
- Deep learning NLP from RNN LSTM Gru seq2seq to attention classification and analysis
- 『渗透基础』Cobalt Strike基础使用入门_Cobalt Strike联动msfconsole
- Ext4 file system jam caused by MEM CGroup OOM
- Naming of tables in MySQL
- What are the advantages of ECS? Is ECS better than VM?
猜你喜欢

Analyzing the superiority of humanoid robot in the post human era

SAP MTS/ATO/MTO/ETO专题之十:ETO模式 Q+空模式 未估价库存 策略自定义

解析后人类时代类人机器人的优越性
Advanced authentication of uni app [Day12]

Apipost interface assertion details

Idea创建Servlet 后访问报404问题

Facebook内部通告:将重新整合即时通讯功能

Idea creates a servlet and accesses the 404 message

Leetcode (question 1) - sum of two numbers

『渗透基础』Cobalt Strike基础使用入门_Cobalt Strike联动msfconsole
随机推荐
014_ TimePicker time selector
问题:sql创建存储过程
Replication of variables in golang concurrency
After purchasing Tencent ECs, how to solve packet loss in Internet access?
提pr,push 的时候网络超时配置方法
Detailed explanation of tcpip protocol
How to add a domain name to ECS? What are the advantages of ECS?
Shutter - how to copy certain elements from a map to a new map in dart/shutter?
TCPIP协议详解
What are the functions of ASP files on ECs? What if the ECS cannot run ASP with a low version?
Is it useful to build an industrial knowledge map platform?
Specificity and correlation of Worthington deoxyribonuclease I
Bi-sql basic cognition
Popularization of children's programming education in specific scenarios
What is the experience of developing an ice 3D music player in 3 minutes?
Qiming cloud sharing: tips on esp32c3 simple IO and serial port
Tencent cloud audio and video award-winning evaluation | leave online messages or submit evaluation, win Dajiang UAV /iphone/switch and other awards
When remote, your resolution is lower than a × B. Some items may not be displayed on the screen
Jimureport building block report - expression introduction
Use of go testing framework gomock