Bug 7879 - System crash after waking up
Summary: System crash after waking up
Status: RESOLVED OLD
Alias: None
Product: Mageia
Classification: Unclassified
Component: RPM Packages (show other bugs)
Version: 2
Hardware: x86_64 Linux
Priority: Normal critical
Target Milestone: ---
Assignee: Mageia Bug Squad
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-10-23 03:59 CEST by Richard Giroux
Modified: 2013-11-23 16:15 CET (History)
0 users

See Also:
Source RPM:
CVE:
Status comment:


Attachments
var/log/messages from bad/good boot (175.42 KB, text/plain)
2012-10-28 03:04 CET, Richard Giroux
Details

Description Richard Giroux 2012-10-23 03:59:58 CEST
Description of problem:
Sometimes after getting back from s2disk, I was starting python cli from konsole (KDE).  The system completely crashed to a (sorry) "black screen of death"...

I wrote down (on a paper... so it is incomplete) some bug logs that I thought will not be in /var/log/messages, but gives something to hang on:
BUG: unable to handle kernel NULL pointer at 0000000000000078
IP: [...] raw_spin_lock
PGD
Oops SMP
CPU1
Modules linked:...
Pid 1903, comm: X tainted ... 3.3.8-desktop-2.mga2
ttm_vo_reserve+0x3e/0a0
? nouveau_gem_ioctl_push_buf
drm_ioctl
nouveau_gem_ioctl_new
do_vfs_ioctl


Rebooting, I was then able to get this from /var/log/messages, for the time corresponding with the crash:

Oct 22 20:48:25 localhost sensord: Chip: nouveau-pci-0100
Oct 22 20:48:25 localhost sensord: Adapter: PCI adapter
Oct 22 20:48:25 localhost sensord:   temp1: 0.0 C
Oct 22 20:53:32 localhost kernel: [ 6319.268447] sleep[17336]: segfault at 7f9e021c5780 ip 00007f9e09fb3f71 sp 00007fffc877b6d0 error 4 in l
d-2.14.1.so[7f9e09fa6000+1e000]
Oct 22 20:53:32 localhost ksmtuned[2519]: /usr/sbin/ksmtuned: line 120: 17336 Segmentation fault      sleep $KSM_MONITOR_INTERVAL
Oct 22 20:56:58 localhost kernel: [ 6524.694669] BUG: Bad page map in process kscreenlocker  pte:8000000120317045 pmd:1205b0067
Oct 22 20:56:58 localhost kernel: [ 6524.694674] page:ffffea000480c5c0 count:1 mapcount:-1 mapping:ffff8800b65e9690 index:0xe65
Oct 22 20:56:58 localhost kernel: [ 6524.694676] page flags: 0x200000000020038(uptodate|dirty|lru|mappedtodisk)
Oct 22 20:56:58 localhost kernel: [ 6524.694682] addr:00007fc4c06dc000 vm_flags:08100071 anon_vma:ffff88006fe070d8 mapping:ffff880163637360
index:e7
Oct 22 20:56:58 localhost kernel: [ 6524.694688] vma->vm_ops->fault: filemap_fault+0x0/0x480
Oct 22 20:56:58 localhost kernel: [ 6524.694713] vma->vm_file->f_op->mmap: ext4_file_mmap+0x0/0x60 [ext4]
Oct 22 20:56:58 localhost kernel: [ 6524.694716] Pid: 14642, comm: kscreenlocker Tainted: G           O 3.3.8-desktop-2.mga2 #1
Oct 22 20:56:58 localhost kernel: [ 6524.694719] Call Trace:
Oct 22 20:56:58 localhost kernel: [ 6524.694724]  [<ffffffff81128139>] print_Oct 22 21:07:52 localhost kernel: imklog 5.8.10, log source = /
proc/kmsg started.

Is it really due to ksmtuned, kscreenlocker ?  It was not obvious from the "crash" screen dump.
Seems clearer from /var/log/messages, but I am not able to track this bug further.  
And why is qemu running on my computer ?
Can you help me ?

Version-Release number of selected component (if applicable):

Linux localhost.localdomain 3.3.8-desktop-2.mga2 #1 SMP Mon Jul 30 21:35:06 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux

How reproducible:
Can't reproduce it

Steps to Reproduce:
1.
2.
3.
Comment 1 Richard Giroux 2012-10-25 03:39:09 CEST
The crash append again, similar wake up situation.

This time var/log/messages tells:
Oct 24 21:14:56 localhost avahi-daemon[1645]: New relevant interface eth0.IPv4 for mDNS.
Oct 24 21:14:56 localhost avahi-daemon[1645]: Registering new address record for 192.168.2.20 on eth0.IPv4.
Oct 24 21:14:56 localhost kernel: [31055.233643] ip[10731]: segfault at 0 ip 000000000042aad0 sp 00007fff9c3983d0 error 4 in ip[400000+3e000]
Oct 24 21:14:56 localhost kernel: [31055.237980] ip[10732]: segfault at 0 ip 000000000042aad0 sp 00007fff13f15660 error 4 in ip[400000+3e000]
Oct 24 21:14:57 localhost NetworkManager[1826]: <info> (eth0): writing resolv.conf to /sbin/resolvconf
Oct 24 21:14:57 localhost NetworkManager[1826]: <info> (eth0): device state change: ip-config -> activated (reason 'none') [70 100 0]
Oct 24 21:14:57 localhost NetworkManager[1826]: <info> Policy set 'System eth0' (eth0) as default for IPv4 routing and DNS.
Oct 24 21:14:57 localhost NetworkManager[1826]: <info> Activation (eth0) successful, device activated.
Oct 24 21:14:57 localhost NetworkManager[1826]: <info> Activation (eth0) Stage 5 of 5 (IPv4 Commit) complete.
Oct 24 21:14:57 localhost dbus[1882]: [system] Activating service name='org.freedesktop.nm_dispatcher' (using servicehelper)
Oct 24 21:14:57 localhost dbus-daemon[1882]: dbus[1882]: [system] Activating service name='org.freedesktop.nm_dispatcher' (using servicehelper)
Oct 24 21:14:57 localhost dbus-daemon[1882]: dbus[1882]: [system] Successfully activated service 'org.freedesktop.nm_dispatcher'
Oct 24 21:14:57 localhost dbus[1882]: [system] Successfully activated service 'org.freedesktop.nm_dispatcher'
Oct 24 21:14:57 localhost kernel: [31056.290306] ip[10835]: segfault at 7df2c01 ip 000000000042aad0 sp 00007fff07df1ba0 error 4 in ip[400000+3e000]
Oct 24 21:14:57 localhost avahi-daemon[1645]: Registering new address record for fe80::21e:c9ff:fe51:581f on eth0.*.
Oct 24 21:16:06 localhost kernel: [31125.448436] BUG: unable to handle kernel NULL pointer dereference at 00000000000007f0
Oct 24 21:16:06 localhost kernel: [31125.448490] IP: [<ffffffff81129b54>] unmap_vmas+0x1a4/0x8f0
Oct 24 21:16:06 localhost kernel: [31125.448522] PGD 1180d5067 PUD 1180d4067 PMD 0
Oct 24 21:16:06 localhost kernel: [31125.448553] Oops: 0000 [#1] SMP
Oct 24 21:16:06 localhost kernel: [31125.448577] CPU 2
Oct 24 21:16:06 localhost kernel: [31125.448597] Modules linked in: iptable_filter ip_tables x_tables dm_zero af_packet sg bnep bluetooth rfkill parport_pc ppdev parport binfmt_misc vboxnetadp(O) vboxnetflt(O) vboxdrv(O) joydev usbhid hid wacom xfs exportfs usb_storage uas dm_mirror dm_region_hash dm_log dm_mod fuse raid1 dcdbas snd_hda_codec_idt snd_hda_intel snd_hda_codec serio_raw snd_hwdep snd_pcm cpufreq_ondemand cpufreq_conservative iTCO_wdt cpufreq_powersave iTCO_vendor_support acpi_cpufreq evdev mperf freq_table i2c_i801 kvm_intel snd_page_alloc kvm snd_timer snd processor soundcore x38_edac edac_core ipv6 autofs4 ext4 crc16 jbd2 firewire_ohci sr_mod ehci_hcd sd_mod firewire_core crc_t10dif crc_itu_t uhci_hcd usbcore e1000e usb_common nouveau button video mxm_wmi wmi drm_kms_helper ttm drm i2c_core ahci libahci ata_piix libata scsi_mod [last unloaded: microcode]
Oct 24 21:16:06 localhost kernel: [31125.449002]
Oct 24 21:16:06 localhost kernel: [31125.449002] Pid: 4361, comm: akonadi_agent_l Tainted: G           O 3.3.8-desktop-2.mga2 #1 Dell Inc. Dell XPS420                  /0TP406
Oct 24 21:16:06 localhost kernel: [31125.449002] RIP: 0010:[<ffffffff81129b54>]  [<ffffffff81129b54>] unmap_vmas+0x1a4/0x8f0
Oct 24 21:16:06 localhost kernel: [31125.449002] RSP: 0018:ffff8801180b7c88  EFLAGS: 00010206
Oct 24 21:16:06 localhost kernel: [31125.449002] RAX: 00007f8000000000 RBX: 0000000000000000 RCX: 00007f22059bbfff
Oct 24 21:16:06 localhost kernel: [31125.449002] RDX: 00000000000007f0 RSI: 00007f8000000000 RDI: ffffff8000000000
Oct 24 21:16:06 localhost kernel: [31125.449002] RBP: ffff8801180b7d98 R08: ffff8801180f7d10 R09: ffff88017bbf9f58
Oct 24 21:16:06 localhost kernel: [31125.449002] R10: 0000000000000057 R11: 0000000000000000 R12: 00007f22057bc000
Oct 24 21:16:06 localhost kernel: [31125.449002] R13: ffffea00058da500 R14: ffff8801180d9160 R15: ffff8801180d9158
Oct 24 21:16:06 localhost kernel: [31125.449002] FS:  0000000000000000(0000) GS:ffff88017bc80000(0000) knlGS:0000000000000000
Oct 24 21:16:06 localhost kernel: [31125.449002] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Oct 24 21:16:06 localhost kernel: [31125.449002] CR2: 00000000000007f0 CR3: 00000001180d2000 CR4: 00000000000006e0
Oct 24 21:16:06 localhost kernel: [31125.449002] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Oct 24 21:16:06 localhost kernel: [31125.449002] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Oct 24 21:16:06 localhost kernel: [31125.449002] Process akonadi_agent_l (pid: 4361, threadinfo ffff8801180b6000, task ffff88011ccd5a40)
Oct 24 21:16:06 localhost kernel: [31125.449002] Stack:
Oct 24 21:16:06 localhost kernel: [31125.449002]  ffffea00047fd130 00007f21fcc2d140 ffff8801180b7d28 ffffffff811290e5
Oct 24 21:16:06 localhost kernel: [31125.449002]  000000013e80a025 ffff8801180b7e10 ffff88016c25f000 ffffffffffffffff
Oct 24 21:16:06 localhost kernel: [31125.449002]  0000000000000000 0000000000000000 00007f22059bbfff 00007f22059bc000
Oct 24 21:16:06 localhost kernel: [31125.449002] Call Trace:
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff811290e5>] ? do_wp_page+0x325/0x6e0
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff81131e17>] exit_mmap+0x97/0x140
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff8104b174>] mmput+0x64/0x140
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff8105035c>] exit_mm+0xfc/0x120
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff81051c44>] do_exit+0x164/0x870
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff8144f794>] ? __schedule+0x3c4/0x7b0
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff810526b4>] do_group_exit+0x44/0xa0
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff81052727>] sys_exit_group+0x17/0x20
Oct 24 21:16:06 localhost kernel: [31125.449002]  [<ffffffff81458e79>] system_call_fastpath+0x16/0x1b
Oct 24 21:16:06 localhost kernel: [31125.449002] Code: ff ff ff 0f 1f 40 00 48 be 00 00 00 00 80 00 00 00 48 bf 00 00 00 00 80 ff ff ff 48 8b 95 58 ff ff ff 4c 01 e6 48 21 fe 48 89 f0 <48> 8b 3a 48 83 e8 01 48 3b 85 40 ff ff ff 48 8b 85 48 ff ff ff
Oct 24 21:16:06 localhost kernel: [31125.449002] RIP  [<ffffffff81129b54>] unmap_vmas+0x1a4/0x8f0
Oct 24 21:16:06 localhost kernel: [31125.449002]  RSP <ffff8801180b7c88>
Oct 24 21:16:06 localhost kernel: [31125.449002] CR2: 00000000000007f0
Oct 24 21:16:06 localhost kernel: [31125.488604] ---[ end trace 4865b2c6df7fce82 ]---
Oct 24 21:16:06 localhost kernel: [31125.488614] Fixing recursive fault but reboot is needed!
Oct 24 21:16:07 localhost kernel: [31125.845632] akonadi_agent_l[4391]: segfault at 7f57e0001ce8 ip 00007f57f8cb0129 sp 00007f57f0702e10 error 6 in libglib-2.0.so.0.3200.4[7f57f8c50000+f2000]
Oct 24 21:16:07 localhost acpid: client 1960[0:0] has disconnected
Oct 24 21:16:07 localhost acpid: client connected from 12023[0:0]
Oct 24 21:16:07 localhost acpid: 1 client rule loaded
Oct 24 21:16:08 localhost kernel: [31126.814421] nepomukservices[29141]: segfault at 0 ip 00007f26be6e25ae sp 00007f26add922d0 error 4 in libQtCore.so.4.8.2[7f26be66c000+2c4000]

Summary: System crash after waking up, starting python cli => System crash after waking up

Comment 2 Manuel Hiebel 2012-10-26 22:53:41 CEST
the second one seems an akonadi/nepomuk bugs witch is also strange, john, nicolas, I'm right or not ?

CC: (none) => balcaen.john, nicolas.lecureuil
Source RPM: qemu-1.0-6.2.mga2 => qemu-1.0-6.2.mga2 akonadi

Comment 3 Richard Giroux 2012-10-28 03:04:39 CET
Created attachment 3001 [details]
var/log/messages from bad/good boot

"bad boot": from Oct 27 21:13:14 to  Oct 27 21:27:29
"good boot": from Oct 27 21:29:02
Comment 4 Richard Giroux 2012-10-28 03:06:41 CET
Is my computer strange or what ?

I got another segfault after cold boot, on starting X11.  The console prompt was functional, but X refused to start from there.
I tried a "reboot" as Root, the system crashed (could not fully turn off); I had to shut it down manually.
Rebooting gave a fully functional system .

This is from /var/log/Xorg.0.log on the "bad boot":

[   725.624] (II) NOUVEAU(0): NVEnterVT is called.
[   725.689] (II) NOUVEAU(0): Setting screen physical size to 508 x 317
[   725.689] resize called 1920 1200
[   725.701] (II) XKB: generating xkmfile /usr/share/X11/xkb/compiled/server-8AA988DD479FAABEC4FC3CCCF4CC29B4948840B4.xkm
[   725.721]
Backtrace:
[   725.721] 0: /etc/X11/X (xorg_backtrace+0x26) [0x567e16]
[   725.721] 1: /etc/X11/X (0x400000+0x16ba59) [0x56ba59]
[   725.721] 2: /lib64/libpthread.so.0 (0x7f461ad2d000+0xef70) [0x7f461ad3bf70]
[   725.721] 3: /etc/X11/X (SrvXkbApplyCompatMapToKey+0x19) [0x535c69]
[   725.721] 4: /etc/X11/X (XkbUpdateDescActions+0x52) [0x51b642]
[   725.721] 5: /etc/X11/X (XkbUpdateActions+0x83) [0x51b863]
[   725.721] 6: /etc/X11/X (InitKeyboardDeviceStruct+0x539) [0x5257b9]
[   725.721] 7: /etc/X11/X (0x400000+0x2965e) [0x42965e]
[   725.721] 8: /etc/X11/X (ActivateDevice+0x43) [0x429a53]
[   725.721] 9: /etc/X11/X (0x400000+0x2d993) [0x42d993]
[   725.721] 10: /etc/X11/X (0x400000+0x22f7b) [0x422f7b]
[   725.721] 11: /lib64/libc.so.6 (__libc_start_main+0xed) [0x7f4619c6932d]
[   725.722] 12: /etc/X11/X (0x400000+0x232bd) [0x4232bd]
[   725.722] Segmentation fault at address (nil)
[   725.722]
Fatal server error:
[   725.722] Caught signal 11 (Segmentation fault). Server aborting
[   725.722]
[   725.722]
Please consult the The X.Org Foundation support
         at http://bugs.mageia.org
 for help.
[   725.722] Please also check the log file at "/var/log/Xorg.0.log" for additional information.
[   725.722]
[   725.722] (II) AIGLX: Suspending AIGLX clients for VT switch
[   725.722] (II) NOUVEAU(0): NVLeaveVT is called.
[   725.776] Server terminated with error (1). Closing log file.
Manuel Hiebel 2012-10-28 08:40:32 CET

CC: balcaen.john, nicolas.lecureuil => (none)
Source RPM: qemu-1.0-6.2.mga2 akonadi => (none)

Comment 5 Manuel Hiebel 2013-10-22 12:19:14 CEST
This message is a reminder that Mageia 2 is nearing its end of life.
Approximately one month from now Mageia will stop maintaining and issuing updates for Mageia 2. At that time this bug will be closed as WONTFIX (EOL) if it remains open with a Mageia 'version' of '2'.

Package Maintainer: If you wish for this bug to remain open because you plan to fix it in a currently maintained version, simply change the 'version' to a later Mageia version prior to Mageia 2's end of life.

Bug Reporter: Thank you for reporting this issue and we are sorry that we may not be able to fix it before Mageia 2 is end of life.  If you would still like to see this bug fixed and are able to reproduce it against a later version of Mageia, you are encouraged to click on "Version" and change it against that version of Mageia.

Although we aim to fix as many bugs as possible during every release's lifetime, sometimes those efforts are overtaken by events. Often a more recent Mageia release includes newer upstream software that fixes bugs or makes them obsolete.

-- 
The Mageia Bugsquad
Comment 6 Manuel Hiebel 2013-11-23 16:15:50 CET
Mageia 2 changed to end-of-life (EOL) status on ''22 November''. Mageia 2 is no
longer maintained, which means that it will not receive any further security or
bug fix updates. As a result we are closing this bug.

If you can reproduce this bug against a currently maintained version of Mageia
please feel free to click on "Version" change it against that version of Mageia
and reopen this bug.

Thank you for reporting this bug and we are sorry it could not be fixed.

--
The Mageia Bugsquad

Status: NEW => RESOLVED
Resolution: (none) => OLD


Note You need to log in before you can comment on or make changes to this bug.