Bug 15328 - nvidia-current-cuda-opencl doesn't work
Summary: nvidia-current-cuda-opencl doesn't work
Status: RESOLVED FIXED
Alias: None
Product: Mageia
Classification: Unclassified
Component: RPM Packages (show other bugs)
Version: Cauldron
Hardware: x86_64 Linux
Priority: Normal normal
Target Milestone: ---
Assignee: Anssi Hannula
QA Contact:
URL:
Whiteboard: MGA5TOO
Keywords:
: 14462 (view as bug list)
Depends on:
Blocks:
 
Reported: 2015-02-19 22:06 CET by Bert Aerts
Modified: 2019-12-31 15:20 CET (History)
4 users (show)

See Also:
Source RPM: nvidia-current
CVE:
Status comment:


Attachments

Description Bert Aerts 2015-02-19 22:06:57 CET
Description of problem:

The nVIDIA driver that is installed by Mageia 5 beta 3 is 346.35 and is OK.
But I installed also nvidia-current-cuda-opencl-346.35-5.mga5.nonfree.x86_64.rpm
And also installed the CUDA Toolkit directly from nVIDIA website 
su
cuda_6.5.14_linux_64.run -override
(-override is to be able to use Mageia's gcc 4.9.2)
The simple toolkit sample deviceQuery returns 30 devices and gives error.
[bert@Dell7720Mageia5 release]$ pwd
/home/bert/Apps/NVIDIA_CUDA-6.5_Samples/bin/x86_64/linux/release
[bert@Dell7720Mageia5 release]$ ./deviceQuery

Then I created my own rpm's for nVIDIA with the script of Anssi:
http://onse.fi/nvidia-mgabuild/
nvidia-current-cuda-opencl-346.35-1custom.nvidia.mga5.x86_64.rpm and 4 others

and installed them with
su
rpm -Uvh --oldpackage *.rpm
removed the 2 nvidia-current-kernel
reboot

Now CUDA samples work perfectly.
CUDA in Mathematica 10.0.2 also works great.
With the official package Mathematica behaved extremely strange.

But I have no clue on what could be wrong with the offical rpm :-(

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.


Reproducible: 

Steps to Reproduce:
Thierry Vignaud 2015-02-19 23:27:33 CET

CC: (none) => tmb
Source RPM: (none) => nvidia-current

Morgan Leijström 2015-02-24 10:37:52 CET

CC: (none) => fri

Comment 1 Morgan Leijström 2015-02-24 13:07:18 CET
Thanks. I followed your instruction and now I too have CUDA and OpenCL :)
Florian Hubold 2015-02-24 22:58:21 CET

CC: (none) => doktor5000

Daniel Osmari 2015-03-15 22:40:26 CET

CC: (none) => danielosmari

Samuel Verschelde 2015-06-06 00:44:29 CEST

Summary: Mageia 5 beta 3 nvidia-current-cuda-opencl-346.35-5.mga5.nonfree.x86_64.rpm is NOK => nvidia-current-cuda-opencl doesn't work
Severity: normal => major
See Also: (none) => https://bugs.mageia.org/show_bug.cgi?id=15346
Assignee: bugsquad => anssi.hannula

Comment 2 Samuel Verschelde 2015-06-06 00:44:57 CEST
Is this bug still present in latest cauldron?

Keywords: (none) => NEEDINFO

Comment 3 Morgan Leijström 2015-06-06 15:10:37 CEST
WOOHO! Great, it works :)
I cleaned skip.list, autoupdated and watched the locally built packages be replaced, rebooted, and the machine still crunch BOINC project with GPU (GPUGRID)
Excerpt from start of BOINC log:
lör  6 jun 2015 14:56:07 |  | Starting BOINC client version 7.2.42 for x86_64-pc-linux-gnu
lör  6 jun 2015 14:56:07 |  | CUDA: NVIDIA GPU 0: GeForce GTX 760 (driver version unknown, CUDA version 7.0, compute capability 3.0, 2047MB, 1751MB available, 2469 GFLOPS peak)
lör  6 jun 2015 14:56:07 |  | OpenCL: NVIDIA GPU 0: GeForce GTX 760 (driver version 346.72, device version OpenCL 1.1 CUDA, 2047MB, 1751MB available, 2469 GFLOPS peak)
lör  6 jun 2015 14:56:07 |  | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz [Family 6 Model 42 Stepping 7]
lör  6 jun 2015 14:56:07 |  | OS: Linux: 3.19.8-desktop-2.mga5

I close this. If anyone have problems please repoen.

Keywords: NEEDINFO => (none)
Resolution: (none) => FIXED
Status: NEW => RESOLVED

Comment 4 Morgan Leijström 2016-01-15 13:38:28 CET
It was still working before christmas.
Then GPUGRID, the only BOINC project i run which use CUDA, ran out of work.
Then we got a new kernel and other updates. (i block nothing in skip.list)
Now GPUGRID send work but all fail with computation error
I also had one and fist time ever complete system freeze yesterday

There are currently several open mageia issues regarding
CUDA and OpenCL, you see when searching here.

This is part of BOINC log when it starts: 

fre 15 jan 2016 11:07:09 |  | Starting BOINC client version 7.2.42 for x86_64-pc-linux-gnu
fre 15 jan 2016 11:07:09 |  | log flags: file_xfer, sched_ops, task
fre 15 jan 2016 11:07:09 |  | Libraries: libcurl/7.40.0 OpenSSL/1.0.2e zlib/1.2.8 libidn/1.32 libssh2/1.4.3
fre 15 jan 2016 11:07:09 |  | Data directory: /var/lib/boinc
fre 15 jan 2016 11:07:09 |  | CUDA: NVIDIA GPU 0: GeForce GTX 760 (driver version unknown, CUDA version 7.0, compute capability 3.0, 2047MB, 1769MB available, 2469 GFLOPS peak)
fre 15 jan 2016 11:07:09 |  | OpenCL: NVIDIA GPU 0: GeForce GTX 760 (driver version 346.96, device version OpenCL 1.1 CUDA, 2047MB, 1769MB available, 2469 GFLOPS peak)
fre 15 jan 2016 11:07:09 |  | Host name: svarten
fre 15 jan 2016 11:07:09 |  | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz [Family 6 Model 42 Stepping 7]
fre 15 jan 2016 11:07:09 |  | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 popcnt tsc_deadline_timer aes xsave avx lahf_lm ida arat epb pln pts dtherm tpr_shadow vnmi flexpriority ept vpid xsaveopt
fre 15 jan 2016 11:07:09 |  | OS: Linux: 4.1.15-desktop-1.mga5
fre 15 jan 2016 11:07:09 |  | Memory: 7.78 GB physical, 3.91 GB virtual

Version: Cauldron => 5
Severity: major => normal
Status: RESOLVED => REOPENED
Resolution: FIXED => (none)

Comment 5 Thomas Backlund 2016-01-15 13:53:43 CET
Well you are using upstream cuda, so complain upstream
Comment 6 Morgan Leijström 2016-01-15 14:27:59 CET
Can you explain why you think that?  (see #3 and 4)
Comment 7 Morgan Leijström 2016-01-15 14:49:50 CET
To refresh, I uninstalled nvidia-current-cuda-opencl, and was going to uninstall nvidia-cuda-toolkit when i noticed it was not installed - Strange.

- maybe because some clash with wine i have been trying 
Bug 15346 - Lib64openCL1 with latest update of Wine conflicts with nvidia-cuda, also a Wine dependency.

- Or Bug 17059 - cuda conflicts with opencl 

At this point i rebooted and verified BOINC do not find CUDA.

Installing only nvidia-current-cuda-opencl and rebooting do not make BOINC find CUDA

Then i installed also nvidia-cuda-toolkit
- see https://bugs.mageia.org/show_bug.cgi?id=14462#c19 
...and rebooted, and now BOINC find CUDA and GPUGRID use it successfully :)

BOINC however do not list OpenCL anymore (like it did in #3 and 4 above)
- maybe bug 17059 -  but i do not need OpenCL currently. 

So i close this again; it is probably the other bugs i mentioned that struck.
Comment 8 Thomas Backlund 2016-01-15 14:52:50 CET
(In reply to Morgan Leijström from comment #6)
> Can you explain why you think that?  (see #3 and 4)


both comment 3 and 4 shows:

driver version unknown, CUDA version 7.0,


we only have 6.5 in mga5 and cauldron

Anyway, I guess I should try and sort out the cuda mess
Comment 9 Morgan Leijström 2016-01-15 14:58:06 CET
Strange, BOINC again reports version 7 
Anyhow it works now.
Probably best to leave this open
Comment 10 Morgan Leijström 2016-01-15 16:27:35 CET
uninstalled nvidia-cuda-toolkit, and installed lib64opencl1

(  file /usr/lib64/libOpenCL.so.1 of nvidia-cuda-toolkit-6.5.14-6.mga5.nonfree.x86_64 conflicts that file of lib64opencl1-2.1.3-3.mga5.x86_64  )

And now BOINC finds both OpenCL and CUDA - and GPUGRID works OK  :)
...At the moment ;)
Comment 11 Morgan Leijström 2016-01-25 21:05:25 CET
Another user solved it by manually linking
https://forums.mageia.org/en/viewtopic.php?f=15&t=8711&p=61839#p61839
Comment 12 Morgan Leijström 2016-07-09 13:19:02 CEST
That same system got upgraded online to cauldron and boinc just still works :)
Comment 13 Morgan Leijström 2016-10-07 14:18:29 CEST
.. since that, my son have borrewed the Nvidia card a few weeks, and now when i put it back Boinc could not use it.

This time i sucessfully tried the solution provided by Yann
https://bugs.mageia.org/show_bug.cgi?id=14462#c19

I did not try building locally as per #0

I also did not try #11

And did not try linking like have worked for another user:
https://forums.mageia.org/en/viewtopic.php?f=15&t=8711&p=61839#p61839


so... there are several working possibilities

Can we have one default working solution please?

Whiteboard: (none) => MGA5TOO
Version: 5 => Cauldron

Comment 14 Morgan Leijström 2016-10-07 14:20:03 CEST
*** Bug 14462 has been marked as a duplicate of this bug. ***
Comment 15 Morgan Leijström 2019-12-31 15:20:35 CET
Working on fresh install of Mageia 7, nvidia-current, installed boinc and nvidia-current-cuda-opencl and it just works. boinc-client-7.14.2-3.mga7.src.rpm

64 bit, and i let boincmgr start boinc after i log into Plasma.

Status: REOPENED => RESOLVED
Resolution: (none) => FIXED


Note You need to log in before you can comment on or make changes to this bug.