Bug 23420

Summary: Plasma and XFCE live install unable to complete because of missing text and data entry fields
Product: Mageia Reporter: Ben McMonagle <westel>
Component: RPM PackagesAssignee: Kernel and Drivers maintainers <kernel>
Status: RESOLVED FIXED QA Contact:
Severity: normal    
Priority: Normal CC: mageia, marja11, tmb
Version: 6Keywords: 6.1
Target Milestone: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Source RPM: CVE:
Status comment:
Attachments: lspcidrake report
missing text during install
root and user data entry
lspcidrake(2) report from installed "Live " system
boot.log
install log
journal from the live xfce system (the real live one)
Xorg.0.log
Journal after X restart resulted in LightDM text corruption
Xorg.0.log after X restart resulted in LightDM text corruption

Description Ben McMonagle 2018-08-10 09:29:13 CEST
Description of problem:as per summary. text also not accepted into data fields.


Version-Release number of selected component (if applicable):


How reproducible:hardware dependent?


Steps to Reproduce:
1.burn USB or DVD Plasma live .iso and reboot
2.attempt to complete install but unable to proceed due to missing text / data fields 
3.
Comment 1 Ben McMonagle 2018-08-10 09:29:58 CEST
Created attachment 10306 [details]
lspcidrake report
Comment 2 Ben McMonagle 2018-08-10 09:36:58 CEST
Created attachment 10307 [details]
missing text during install
Comment 3 Ben McMonagle 2018-08-10 09:38:57 CEST
Created attachment 10308 [details]
root and user data entry

cannot proceed from here
Ben McMonagle 2018-08-10 09:39:15 CEST

Keywords: (none) => 6.1

Comment 4 Ben McMonagle 2018-08-10 09:47:16 CEST
Created attachment 10309 [details]
lspcidrake(2) report from installed "Live " system

lspcidrake report is from a different system partition
lspcidrake(2) report is from the problem install
Comment 5 Marja Van Waes 2018-08-10 10:53:47 CEST
(In reply to ben mcmonagle from comment #4)
> Created attachment 10309 [details]
> lspcidrake(2) report from installed "Live " system
> 
> lspcidrake report is from a different system partition
> lspcidrake(2) report is from the problem install

Both have:

Card:Intel 810 and later: Intel Corporation|Xeon E3-1200 v2/3rd Gen Core processor Graphics Controller [DISPLAY_VGA] (vendor:8086 device:0152 subv:1043 subd:84ca) (rev: 09)(In reply to ben mcmonagle from comment #3)
> Created attachment 10308 [details]
> root and user data entry
> 
> cannot proceed from here

But you can cancel and (hopefully) login as live user and then become root, can't you?

As root, please run 

  journalctl -ab > boot.log.txt

and also

  journalctl -ab1 > install.log.txt

@ isobuilders

Please reassign if needed :-)

Please attach both (first compress with xz if needed)

Keywords: (none) => NEEDINFO
Assignee: bugsquad => isobuild
CC: (none) => marja11

Ben McMonagle 2018-08-11 04:37:52 CEST

Summary: UEFI Plasma live install unable to complete because of missing text and data entry fields => UEFI Plasma and XFCE live install unable to complete because of missing text and data entry fields

Comment 6 Ben McMonagle 2018-08-11 04:44:05 CEST
more info:

desktop refresh is very slow, some elements take several minutes to re-draw. mouse is ok.

if the live session is corrupted, changing the driver via drakx11 to vesa, and logout presents a "sorry, something wrong with your graphics settings, press any key. 
tty2 to drakx11, reset intel810 driver, test -ok, exit.
tty1 now has normal greeter, desktop is now perfect.

so, suspect something amiss with intel810 driver when in UEFI as this problem does not occur in legacy boot or csm boot.
Comment 7 Martin Whitaker 2018-08-11 13:08:42 CEST
Ben, if you can again boot to a session where the screen corruption is occurring, can you run (as root) 'journalctl -ab > journal.log' and attach the resulting journal.log file and also any Xorg.*.log files you can find in /var/log.

CC: (none) => mageia

Comment 8 Ben McMonagle 2018-08-12 08:44:05 CEST
Created attachment 10312 [details]
boot.log
Comment 9 Ben McMonagle 2018-08-12 08:47:13 CEST
Created attachment 10313 [details]
install log
Comment 10 Martin Whitaker 2018-08-12 10:49:17 CEST
Thanks Ben. Sadly I can see no hint of anything wrong in those logs.

Do you still have a Xfce install that is showing the display corruption? If so, can you see if any error messages appear in ~/.xsession-errors or in /var/log/Xorg.0.log when you run Mageia Welcome.
Comment 11 Ben McMonagle 2018-08-13 10:20:14 CEST
Created attachment 10314 [details]
journal from the live xfce system (the real live one)
Comment 12 Ben McMonagle 2018-08-13 10:36:12 CEST
Created attachment 10315 [details]
Xorg.0.log
Comment 13 Ben McMonagle 2018-08-13 10:39:39 CEST
(In reply to Martin Whitaker from comment #10)
> Thanks Ben. Sadly I can see no hint of anything wrong in those logs.
ok,

  any error messages appear in ~/.xsession-errors 

file does not appear to exist :(
Comment 14 Martin Whitaker 2018-08-14 21:35:46 CEST
(In reply to ben mcmonagle from comment #13)
> (In reply to Martin Whitaker from comment #10)
>   any error messages appear in ~/.xsession-errors 
> 
> file does not appear to exist :(

That's odd, it should be there (unless you had switched to root user, in which case ~ would be pointing to /root, not /home/live).

There's nothing untoward in the latest logs. What baffles me is that the live install is just a standard Mageia 6 system - so why don't you also see this problem on your other Mageia 6 systems on that machine.

At the moment I'm out of ideas :-(
Comment 15 Ben McMonagle 2018-08-14 21:50:09 CEST
I will try a UEFI install on my Toshiba R930, uses the intel-810 driver too.
it advises that UEFI support is "experimental" though.

hopefully, that will help to point to a hardware  or driver issue if it works.
Comment 16 Martin Whitaker 2018-08-14 22:04:11 CEST
I have tested it repeatedly on my laptop, which has the same generation of Intel GPU as yours. So it's definitely hardware specific. The fact that you only see it with UEFI boot points at the BIOS.
Comment 17 Ben McMonagle 2018-08-15 09:09:41 CEST
tested both xfce and plasma in UEFI live mode.
no display issues, so may just try a different monitor on the problem system.
running out of ideas here
Ben McMonagle 2018-08-17 23:34:19 CEST

Summary: UEFI Plasma and XFCE live install unable to complete because of missing text and data entry fields => Plasma and XFCE live install unable to complete because of missing text and data entry fields

Comment 18 Ben McMonagle 2018-08-17 23:36:40 CEST
Xfce i586 exhibit this issue on Toshiba R930 i5 upon login to install system.
(missing text and icons in desktop)

logout and login corrects this
Comment 19 Martin Whitaker 2018-08-18 00:30:33 CEST
You'll be pleased to know you are no longer alone Ben. I've now reproduced this on my laptop with the 64-bit Live Xfce and legacy boot. I saw it on the initial dialogues when booting to the live desktop. The problem cleared when Xfce started, but I have found it will come and go if you restart the X server (using Ctrl-Alt-Backspace).

Keywords: NEEDINFO => (none)

Comment 20 Martin Whitaker 2018-08-18 00:48:45 CEST
Same behaviour with UEFI boot. But no problems on my desktop machine with Radeon graphics.
Comment 21 Ben McMonagle 2018-08-18 06:37:25 CEST
so, intel it seems is the common factor?

2 different display managers, different desktops.
Comment 22 Martin Whitaker 2018-08-18 09:46:29 CEST
Hard to be 100% sure, because it's so variable. I'd tested all the ISOs on my laptop before releasing them, and retested many times after you reported this bug, without seeing it. Last night it was occurring fairly often - every 3 or 4 times I restarted X. This morning it's taken maybe 50 restarts before it showed up.

I'll try rebuilding the ISO to pick up the latest kernel/drivers - see if that makes a difference.
Comment 23 Martin Whitaker 2018-08-18 16:26:58 CEST
I have now demonstrated that this fault also occurs in my main Mageia 6 system on my laptop. That is Cinnamon DE, installed from the 6-sta2 classical installer ISO, and fully updated.

Changing the bootloader to GRUB2 (EFI) and booting to the LightDM greeter, then repeatedly hitting Ctrl-Alt-Backspace twice to restart the X server, 10-20% of the restarts result in the same rendering faults in the greeter as Ben has observed in the finish-install screens. Moving the cursor will cause parts of the screen to be redrawn, and other parts to be lost. Logging in when this corruption occurs, the same problems can be seen in the desktop and in Mageia Welcome.

Further tests show:

 - if I change the commands in the GRUB2 menu from linuxefi/initrdefi to linux/initrd, the fault never occurs (but see below)

 - if I revert to my normal boot method (rEFInd + kernel EFI stub loader), the fault never occurs

 - if I revert to the 4.9.56 kernel, the fault never occurs (all 4.14 series kernels I tried, back to 4.14.10 did show the fault)

 - the 4.14.62-linus kernel does exhibit the fault

 - if I change to CSM/legacy boot (still with GRUB2 and the linux/initrd commands), the fault never occurs

 - no difference in behaviour was observed when running on battery power

However, tests with the 6.1-rc1 64-bit Xfce Live ISO showed:

 - the fault occasionally but rarely occurs on EFI boot (even though the Live ISO boot menu uses the linux/initrd commands)

 - the fault occasionally but rarely occurs on legacy (isolinux) boot

 - running on battery power caused both of the above boot methods to exhibit the fault very frequently

The laptop has a i7-3630QM CPU, with hybrid graphics:

Card:Intel 810 and later: Intel Corporation|3rd Gen Core processor Graphics Controller [DISPLAY_VGA] (rev: 09)
Card:NVIDIA GeForce 420 series and later: NVIDIA Corporation|GF108M [GeForce GT 635M] [DISPLAY_VGA] (rev: a1)

It uses the intel card as the primary display.

Tests on my desktop machine (i5-3550 CPU, Radeon HD 7850 graphics) have never shown this fault.

Assignee: isobuild => kernel
Source RPM: draklive-install => (none)

Comment 24 Martin Whitaker 2018-08-18 19:32:10 CEST
Forgot to mention I also tested with the two available older versions of grub2. They made no difference.
Comment 25 Ben McMonagle 2018-08-19 00:18:52 CEST
(Have to admit I do not remember testing the M-6 series of Lives, just the Classicals, due to time restraints)
Comment 26 Martin Whitaker 2018-08-20 21:53:26 CEST
Just to confirm, I built a version of the 64-bit Xfce Live ISO with kernel 4.9.56, but all other packages up to date. That did not show the fault in 100 restarts of the X server. Rebuilt with the latest 4.14.65 kernel, and the fault is back.
Comment 27 Ben McMonagle 2018-08-20 22:04:01 CEST
thanks for your hard work Martin.

If you are able to put that .iso onto the testing server  -under the "testing" directory, I could check it tonight on my desktop unit.
It is the one that all this started with, it seems to be the one that is most sensitive to the issue.

The Toshiba exhibited it once, the desktop, pretty much every time I tried an .iso
Comment 28 Ben McMonagle 2018-08-20 22:08:25 CEST
I also should probably use the test kernel with this system as well from "updates - testing".
Comment 29 Martin Whitaker 2018-08-20 23:46:43 CEST
OK, I've put a build with the 4.9.56 kernel in the

  testing/Mageia-6.1-rc-LiveDVD-Xfce-x86_64-DVD/

directory. This is of course not a viable fix - we can't go back to an old kernel - but if it also works for you, we will at least be sure we're both looking at the same bug.
Comment 30 Ben McMonagle 2018-08-21 10:12:32 CEST
sort of good news.

4.14.56-1 kernel Xfce UEFI install- problematic.
4.9.56    kernel Xfce Uefi install - ok.

have only tested each install 1 x each though.

If I have the time, I will try again tomorrow

will be trying the 4.14.56-1 kernel on this system as I have not yet updated an Mga6 on this machine lately.
Comment 31 Ben McMonagle 2018-08-21 10:59:07 CEST
ok,

so I added 4.9.56 kernel to the 4.14.56-1 system.

ctrl + alt + backspace with .14.56 kernel is definitely problematic
ctrl + alt + backspace with .9.56 kernel is definitely not problematic.
Comment 32 Thomas Backlund 2018-08-21 11:56:25 CEST
(In reply to ben mcmonagle from comment #31)
> ok,
> 
> so I added 4.9.56 kernel to the 4.14.56-1 system.
> 
> ctrl + alt + backspace with .14.56 kernel is definitely problematic
> ctrl + alt + backspace with .9.56 kernel is definitely not problematic.

Can you grab the journal logs and the Xorg log after the restart of xorg ?

Also what happends if you configure your hw for uxa acceleration:

Create a file: /etc/X11/xorg.conf.d/20-intel.conf

With the contents:

Section "Device"
    Identifier  "Intel Graphics"
    Driver      "intel"
    Option      "AccelMethod"  "uxa"
EndSection

and reboot…

Does it work then ?

CC: (none) => tmb

Comment 33 Martin Whitaker 2018-08-21 21:10:00 CEST
Created attachment 10326 [details]
Journal after X restart resulted in LightDM text corruption

The fourth restart triggered the fault. I then logged in on a tty and captured the journal and the Xorg.0.log
Comment 34 Martin Whitaker 2018-08-21 21:11:52 CEST
Created attachment 10327 [details]
Xorg.0.log after X restart resulted in LightDM text corruption

I compared this to a log from a restart that didn't result in text corruption and, apart from the timestamps, they were identical.
Comment 35 Martin Whitaker 2018-08-21 21:17:56 CEST
Configuring for uxa acceleration works round the bug, as does configuring for blt acceleration (I added these to the existing Device section in /etc/X11/xorg.conf).

The other suggestion I found when searching for similar bug reports, adding COGL_ATLAS_DEFAULT_BLIT_MODE=framebuffer to /etc/environment, had no effect.
Comment 36 Ben McMonagle 2018-08-22 07:55:14 CEST
(In reply to Thomas Backlund from comment #32)
Can you grab the journal logs and the Xorg log after the restart of xorg ?
> 
> Also what happens if you configure your hw for uxa acceleration:
> 
> Create a file: /etc/X11/xorg.conf.d/20-intel.conf
> 
> With the contents:
> 
> Section "Device"
>     Identifier  "Intel Graphics"
>     Driver      "intel"
>     Option      "AccelMethod"  "uxa"
> EndSection
> 
> and reboot…
> 
> Does it work then ?

before above configuration, ctrl + alt + backspace on kernel 4.14.56 results in text issues 8/10 times.
after above configuration, ctrl + alt + backspace on kernel 4.14.56 results in text issues 0/10 times.

looks like an acceptable workaround.

now, how to incorporate into the Lives?
Comment 37 Thomas Backlund 2018-08-22 08:21:37 CEST
(In reply to ben mcmonagle from comment #36)

> 
> looks like an acceptable workaround.
> 
> now, how to incorporate into the Lives?

That's the problem... 
there is no easy way as "sna" acceleration is default and works for most and is _needed_ for some…

So either way, it will break for some…


One other thing to test...


What happends if you boot with "xdriver=modesetting" ?
Comment 38 Ben McMonagle 2018-08-22 08:58:39 CEST
I commented out the file: /etc/X11/xorg.conf.d/20-intel.conf.

added to the kernel boot line "xdriver=modesetting".

unfortunately, at greeter login, the problem returns -missing text and data entry boxes
Comment 39 Martin Whitaker 2018-08-24 23:37:51 CEST
This may not help much, but I have isolated the significant difference between the various boot methods that exposes this bug on my machine. When I boot my installed system using GRUB2/linuxefi, or boot the live system using GRUB2/linux, the system allocates a 1024x768 EFI framebuffer, which exposes the bug. When I boot my installed system using GRUB2/linux or using rEFInd + kernel EFI stub loader, the system allocates a 1920x1080 EFI framebuffer (the native resolution of the display), and the bug never occurs.

To prove this was the significant difference, I configured rEFInd to set the display resolution to 1024x768, and that immediately exposed the bug.
Comment 40 Martin Whitaker 2018-08-25 00:50:37 CEST
Ben, on your installed system with the 4.14.56 kernel, if you type

  dmesg | grep efifb

what screen resolution does it report? If you reboot and edit the grub2 menu entry, changing the word linuxefi to linux and initrdefi to initrd, does it then report a different resolution? If so, does that help with this bug?
Comment 41 Ben McMonagle 2018-08-25 02:58:10 CEST
uname -r
4.14.62-desktop-2.mga6

dmesg | grep efifb
[    0.143419] pci 0000:00:02.0: BAR 2: assigned to efifb
[    0.381248] efifb: probing for efifb
[    0.381262] efifb: framebuffer at 0xe0000000, using 3072k, total 3072k
[    0.381263] efifb: mode is 1024x768x32, linelength=4096, pages=1
[    0.381263] efifb: scrolling: redraw
[    0.381264] efifb: Truecolor: size=8:8:8:8, shift=24:16:8:0


After edit the grub2 menu entry, changing the word linuxefi to linux and initrdefi to initrd

dmesg | grep efifb
[    0.143408] pci 0000:00:02.0: BAR 2: assigned to efifb
[    0.379279] efifb: probing for efifb
[    0.379291] efifb: framebuffer at 0xe0000000, using 1920k, total 1920k
[    0.379291] efifb: mode is 800x600x32, linelength=3200, pages=1
[    0.379292] efifb: scrolling: redraw
[    0.379293] efifb: Truecolor: size=8:8:8:8, shift=24:16:8:0

Does that help with this bug?

doesnt seem to.
8/9 attempts at login after ctrl-alt-backspace produce the problem.

will go looking at trying various previous kernels to see which ones are ok.
spectre has a lot to answer for
Comment 42 Ben McMonagle 2018-08-25 06:19:50 CEST
added all available kernels to the problem system:

uname -r
4.9.56-desktop-1.mga6
ctrl-alt-backspace 10/10 correct desktop

uname -r
4.14.10-desktop-1.mga6
ctrl-alt-backspace 10/10 correct desktop

uname -r
4.14.13-desktop-1.mga6
ctrl-alt-backspace 3/13 correct desktop

uname -r
4.14.16-desktop-1.mga6
ctrl-alt-backspace 5/10 correct desktop

uname -r
4.14.18-desktop-1.mga6
ctrl-alt-backspace 4/12 correct desktop

uname -r
4.14.20-desktop-1.mga6
ctrl-alt-backspace 5/10 correct desktop

uname -r
4.14.25-desktop-1.mga6
ctrl-alt-backspace 4/10 correct desktop

uname -r
4.14.30-desktop-3.mga6
ctrl-alt-backspace 5/12 correct desktop

uname -r
4.14.50-desktop-2.mga6
ctrl-alt-backspace 5/10 correct desktop

uname -r
4.14.44-desktop-2.mga6
ctrl-alt-backspace 5/10 correct desktop


uname -r
4.14.40-desktop-1.mga6
ctrl-alt-backspace 6/10 correct desktop
Comment 43 Martin Whitaker 2018-08-25 13:29:04 CEST
(In reply to ben mcmonagle from comment #41)

Thanks Ben. Seems like the display resolution thing is just a quirk of my hardware, and no good as a general workaround for this bug.

(In reply to ben mcmonagle from comment #42)

I saw this bug with kernel 4.14.10, so it might be you just got lucky on that test.

(In reply to Thomas Backlund from comment #37)
> One other thing to test...
> What happends if you boot with "xdriver=modesetting" ?

That only works when booting the live system; on an installed system service_harddrake sees the hardware hasn't changed and skips the video drive configuration step. Manually changing the Driver line in xorg.conf from "intel" to "modesetting" fixes this bug for me.

Unless Thomas has any more ideas, I guess this will just have to go in the errata.

BTW, I did some performance tests using gtkperf. On my machine the "blt" acceleration method is slightly faster than "sna". "uxa", and the modesetting driver, are significantly slower. 3D performance (tested using glxgears and the mesa "teapot" demo) is the same for all.
Comment 44 Ben McMonagle 2019-02-04 23:27:49 CET
last comment 6 month ago, no new comments from anybody so closing

Status: NEW => RESOLVED
Resolution: (none) => FIXED