-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
require 'nccl' doesn't work #5
Comments
You only need one of those files, either libnccl.1.dylib or libnccl.so.1. Error message you were getting from nccl " 'libnccl.so not found in library path" is confusing, sorry, your actual problem is that libnccl.so.1 can not be loaded successfully. "undefined symbol: __cudaRegisterFatBinary:" indicates cuda install problems or torch compiled without cuda support. Does require 'cutorch' work? |
I have the same issue (library loading fails with undefined symbol). cutorch and other cuda packages work just fine. |
These symbols are linked into the library as undefined: ~/nccl/build/lib [master*]$ nm libnccl.so | grep FatBinary
U __cudaRegisterFatBinary
U __cudaUnregisterFatBinary |
Are you using nccl built from source, or deb package? I can't repro with nccl built from source. |
I built it from source 😕 |
I'm using CUDA 7.5.18 on Ubuntu 14.04.1 btw |
I managed to solve this issue. It seems that the gcc linker didn't add |
Your solution worked for me, too. Thanks a lot! |
Thanks, guys. Closing. |
I am still facing the issue. Can you please explain the steps in detail. I am new to torch as well as nccl and so I am unable to understand exactly what I need to solve this issue. |
can you please explain the steps in detail.,thank. |
Hello!
I was trying to use nccl for torch. Installing worked fine for both the Nvidia library and the torch bindings. However when I try
require 'nccl'
, I get this error message:Problem is, I already added libnccl.so to my LD_LIBRARY_PATH and double-checked that it's really there. So I tried to manually load the two files, which are loaded in ffi.lua. But then I get these errors:
Indeed this file doesn't exist in my nccl/lib directory. Do I have to create it manually, since it didn't show up automatically after installing nccl? How could I do so?
For the second file I get this error:
How can I solve this problem?
Thanks!
The text was updated successfully, but these errors were encountered: