Bug 21973

Summary: mageia6 not usable with all CPU at 100% when using dnetc (3rd party software)
Product: Mageia Reporter: Paul Éric Despretz <paul-eric.despretz>
Component: RPM PackagesAssignee: Kernel and Drivers maintainers <kernel>
Status: RESOLVED OLD QA Contact:
Severity: normal    
Priority: Normal CC: marja11, ouaurelien
Version: 6   
Target Milestone: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Source RPM: kernel-desktop-4.9.56-1.mga6-1-1.mga6 CVE:
Status comment:
Attachments: lshw from the machine

Description Paul Éric Despretz 2017-11-03 16:46:08 CET
Description of problem: Running Distributed.net dnetc program gives a non usable machine.
Same as bug 16151

Version-Release number of selected component (if applicable):


How reproducible: always on my machine... :=(

Steps to Reproduce:
1.install Distributed.net dnetc and try to run it with the kernel-desktop-4.9.56-1.mga6-1-1.mga6.

Extract from "journalctl -r" :
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff9178b675>] ret_from_fork+0x25/0x30
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff91096f20>] ? kthread_park+0x60/0x60
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff91097006>] kthread+0xe6/0x100
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910ddbc0>] ? __note_gp_changes+0xb0/0xb0
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910de0a1>] rcu_gp_kthread+0x4e1/0x8e0
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910e4fa0>] ? del_timer_sync+0x50/0x50
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff91789951>] schedule_timeout+0x1d1/0x490
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff91786386>] schedule+0x36/0x80
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff91785ec0>] ? __schedule+0x220/0x6b0
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: Call Trace:
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  0000000000000246 0000000000000006 ffff9717ed719c80 ffffa24cc1907db0
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  ffff9717ef598f40 ffffa24cc1907d68 ffffffff91785ec0 ffffa24cc1907d38
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  ffff9717de9ac380 0000000000000000 ffff9717ed769c80 ffff9717ed719c80
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: rcu_sched       R  running task        0     7      2 0x00000000
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: rcu_sched kthread starved for 240012 jiffies! g1241 c1240 f0x2 RCU_GP_WAIT_FQS(3) ->state=0x0
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  <EOI> 
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff9178c022>] apic_timer_interrupt+0x82/0x90
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff9104c91d>] smp_apic_timer_interrupt+0x3d/0x50
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff9104c008>] local_apic_timer_interrupt+0x38/0x60
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910e7ef8>] hrtimer_interrupt+0xa8/0x190
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910e773e>] __hrtimer_run_queues+0xee/0x270
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910f6e3d>] tick_sched_timer+0x3d/0x70
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910f6826>] tick_sched_handle.isra.12+0x36/0x50
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910e6caf>] update_process_times+0x2f/0x60
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910f6e00>] ? tick_sched_do_timer+0x30/0x30
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910dfdce>] rcu_check_callbacks+0x87e/0x880
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  [<ffffffff910a4b7d>] sched_show_task+0xcd/0x130
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  <IRQ> 
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: Call Trace:
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  00000000000004d9 0000000000000672 00000000000004d9 0000000000000000
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  ffff9717ef543ec0 ffffffff910dfdce 0000000000000000 ffff9717ef543e88
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:  ffff9717ef543e58 ffffffff910a4b7d ffff9717ef559c80 ffffffff91c4a740
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: dnetc           R  running task        0  1492   1254 0x00000000
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: All QSes seen, last rcu_sched kthread activity 240009 (4294930856-4294690847), jiffies_till_next_fqs=3, r
nov. 03 16:12:05 Abraracourcix.despretz.org kernel:         (detected by 5, t=240008 jiffies, g=1241, c=1240, q=1650)
nov. 03 16:12:05 Abraracourcix.despretz.org kernel: INFO: rcu_sched detected stalls on CPUs/tasks:

I'm successfully using my machine by doing this :
1) boot in recovery mode to inactivate the dnetc computing program
2) install the last mageia5 kernel available 
urpmi kernel-desktop-4.4.92-1.mga5-1-1.mga5.x86_64.rpm kernel-desktop-devel-4.4.92-1.mga5-1-1.mga5.x86_64.rpm
3) reactivate the automatic startup start of dnetc
4) reboot

All is OK with the kernel-desktop-4.4.92. :=)
Comment 1 Paul Éric Despretz 2017-11-03 16:48:18 CET
Created attachment 9771 [details]
lshw from the machine
Comment 2 Marja Van Waes 2017-11-03 22:20:21 CET
Assigning to the kernel and drivers maintainers, for them to decide what to do with this report.

CC: (none) => marja11
Assignee: bugsquad => kernel

Comment 3 Aurelien Oudelet 2020-08-05 16:45:41 CEST
This message is a reminder that Mageia 6 is end of life.

Mageia stopped maintaining and issuing updates for Mageia 6. At that time this bug will be closed as OLD (EOL).

Package Maintainer: If you wish for this bug to remain open because you plan to 
fix it in a currently maintained version, simply change the 'version' to a later 
Mageia version prior to Mageia 6's end of life.

Bug Reporter: Thank you for reporting this issue and we are sorry that we cannot 
be able to fix it before Mageia 6 was end of life.
If you would still like to see this bug fixed and are able to reproduce it against a later version of Mageia, you are encouraged to click on "Version" and change it against that version of Mageia.

Although we aim to fix as many bugs as possible during every release's lifetime, 
sometimes those efforts are overtaken by events. Often a more recent Mageia 
release includes newer upstream software that fixes bugs or makes them obsolete.

--
Mageia Bugsquad

Resolution: (none) => OLD
CC: (none) => ouaurelien
Status: NEW => RESOLVED