[ltp] R52 woes

Torsten Wolf linux-thinkpad@linux-thinkpad.org
Mon, 21 Nov 2005 15:44:51 +0100


--liOOAslEiF7prFVr
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline

Hi!

Recently, I purchased a Thinkpad R52 (1847-W62) (lspci output attached).
Besides some really nice features and hardware components, few issues
constantly drive me crazy. Mainly, there are two problems.

* ATI driver
Well, this one is perhaps only a matter of time. However, I have tried
hard to get the recent driver (8.19.10) work flawlessly. After bootup I
see a real nice performance with DRI enabled. 3D Rendering works fine,
even moving and redrawing transparent aterms is much faster compared to
the Xorg driver. After suspending to RAM the laptop resumes sucessfully
with DRI still available. Waking up from suspending to disk works fine,
but invoking glxinfo results in a kernel oops:

[fglrx:create_buffer_queue] *ERROR* PCIe was not acquired
(complete oops attached)

X freezes, but the laptop is still accessible via network and can be
rebooted without problems. I posted this on http://ati.cchtml.com but up
to now got no response. On thinkwiki.org I found several reports about
successful suspend/resume with the latest ATI driver but no information
whether DRI is still working afterwards. So what is your experience on
this topic? While playing with the parameters in hibernate.conf I found
that any usage of vbetool would result in a complete freeze after
resume, so this would be no solution. In the meantime, the Xorg driver
allows for a safe suspend/resume.

* System instability
Currently, I stick to 2.6.12, with libata- & sata-pm-patches applied.
With 2.6.14 I had major problems using the dvd-writer. cdrecord listed
the device only once. Afterwards there were scsi timeouts and a garbled
output of cdrecord -scanbus. So the issues I have here might be kernel
related and vanish with a recent version. However, every now and then I
see freezes both at work (using the internal gigabit device -- tg3 
driver) and ugly crashes with lots of oopses and segfautlts at home,
where I have a wireless network. The problems at home have (up to now)
vanished after applying the ipw2200 patch from
http://bughost.org/bugzilla/show_bug.cgi?id=821. But at work the laptop 
still hangs from time to time. Today I burned a dvd without errors, but
30% before finishing the next cd-r the box froze. A few days ago, I
needed three tries to finish a backup of the filesystem across the
network. In such situations I cannot log on via network but reboot the
laptop using the sysrq keys. I checked the memory (2x512MB) with
memtest86+ for 12 hours but found no error. With respect to the
displayed temperatures, the laptop does not get that hot (CPU ~53, GPU
~51, HDD ~35/40 acpi/smartctl; all degC). I also considered a defect in
the hardware (e.g.  a weak soldering point), but how to track this down?
Would IBM replace the mainboard or just the wlan card or the whole
device? What should I check before contacting the support? What causes
me real headaches is that I do not see any regularity in these failures.
I can run numbercrunching apps with the CPU temperature rising up to
72degC, stress the wlan device or copy several GB across the network and
nothing would happen. So any hints what to check next are greatly
appreciated!

Thanks for reading so far. Best wishes,
Torsten

--liOOAslEiF7prFVr
Content-Type: text/plain; charset=us-ascii
Content-Disposition: attachment; filename="r52_lspci.txt"

0000:00:00.0 Host bridge: Intel Corporation Mobile 915GM/PM/GMS/910GML Express Processor to DRAM Controller (rev 03)
	Subsystem: IBM: Unknown device 0575
	Flags: bus master, fast devsel, latency 0
	Capabilities: <available only to root>

0000:00:01.0 PCI bridge: Intel Corporation Mobile 915GM/PM Express PCI Express Root Port (rev 03) (prog-if 00 [Normal decode])
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
	I/O behind bridge: 00003000-00003fff
	Memory behind bridge: a8100000-a81fffff
	Prefetchable memory behind bridge: c0000000-c7ffffff
	Capabilities: <available only to root>

0000:00:1c.0 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 1 (rev 03) (prog-if 00 [Normal decode])
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=02, subordinate=02, sec-latency=0
	Memory behind bridge: a8200000-a82fffff
	Capabilities: <available only to root>

0000:00:1c.2 PCI bridge: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) PCI Express Port 3 (rev 03) (prog-if 00 [Normal decode])
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=03, subordinate=03, sec-latency=0
	I/O behind bridge: 00004000-00004fff
	Memory behind bridge: a8300000-a83fffff
	Prefetchable memory behind bridge: 00000000c8000000-00000000c8000000
	Capabilities: <available only to root>

0000:00:1d.0 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #1 (rev 03) (prog-if 00 [UHCI])
	Subsystem: IBM: Unknown device 0565
	Flags: bus master, medium devsel, latency 0, IRQ 169
	I/O ports at 1800 [size=32]

0000:00:1d.1 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #2 (rev 03) (prog-if 00 [UHCI])
	Subsystem: IBM: Unknown device 0565
	Flags: bus master, medium devsel, latency 0, IRQ 50
	I/O ports at 1820 [size=32]

0000:00:1d.2 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #3 (rev 03) (prog-if 00 [UHCI])
	Subsystem: IBM: Unknown device 0565
	Flags: bus master, medium devsel, latency 0, IRQ 58
	I/O ports at 1840 [size=32]

0000:00:1d.3 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB UHCI #4 (rev 03) (prog-if 00 [UHCI])
	Subsystem: IBM: Unknown device 0565
	Flags: bus master, medium devsel, latency 0, IRQ 233
	I/O ports at 1860 [size=32]

0000:00:1d.7 USB Controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) USB2 EHCI Controller (rev 03) (prog-if 20 [EHCI])
	Subsystem: IBM: Unknown device 0566
	Flags: bus master, medium devsel, latency 0, IRQ 233
	Memory at a8000000 (32-bit, non-prefetchable) [size=1K]
	Capabilities: <available only to root>

0000:00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev d3) (prog-if 01 [Subtractive decode])
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=04, subordinate=07, sec-latency=64
	I/O behind bridge: 00005000-00008fff
	Memory behind bridge: a8400000-b7ffffff
	Prefetchable memory behind bridge: 00000000d0000000-00000000d7f00000
	Capabilities: <available only to root>

0000:00:1e.2 Multimedia audio controller: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) AC'97 Audio Controller (rev 03)
	Subsystem: IBM: Unknown device 0567
	Flags: bus master, medium devsel, latency 0, IRQ 209
	I/O ports at 1c00 [size=256]
	I/O ports at 1880 [size=64]
	Memory at a8000800 (32-bit, non-prefetchable) [size=512]
	Memory at a8000400 (32-bit, non-prefetchable) [size=256]
	Capabilities: <available only to root>

0000:00:1e.3 Modem: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) AC'97 Modem Controller (rev 03) (prog-if 00 [Generic])
	Subsystem: IBM: Unknown device 0574
	Flags: medium devsel, IRQ 225
	I/O ports at 2400 [size=256]
	I/O ports at 2000 [size=128]
	Capabilities: <available only to root>

0000:00:1f.0 ISA bridge: Intel Corporation 82801FBM (ICH6M) LPC Interface Bridge (rev 03)
	Subsystem: IBM: Unknown device 0568
	Flags: bus master, medium devsel, latency 0

0000:00:1f.2 IDE interface: Intel Corporation 82801FBM (ICH6M) SATA Controller (rev 03) (prog-if 80 [Master])
	Subsystem: IBM: Unknown device 056a
	Flags: bus master, 66MHz, medium devsel, latency 0
	I/O ports at <unassigned>
	I/O ports at <unassigned>
	I/O ports at <unassigned>
	I/O ports at <unassigned>
	I/O ports at 18c0 [size=16]
	Capabilities: <available only to root>

0000:00:1f.3 SMBus: Intel Corporation 82801FB/FBM/FR/FW/FRW (ICH6 Family) SMBus Controller (rev 03)
	Subsystem: IBM: Unknown device 056b
	Flags: medium devsel, IRQ 11
	I/O ports at 18e0 [size=32]

0000:01:00.0 VGA compatible controller: ATI Technologies Inc M22 [Radeon Mobility M300] (prog-if 00 [VGA])
	Subsystem: IBM: Unknown device 056e
	Flags: bus master, fast devsel, latency 0, IRQ 11
	Memory at c0000000 (32-bit, prefetchable) [size=128M]
	I/O ports at 3000 [size=256]
	Memory at a8100000 (32-bit, non-prefetchable) [size=64K]
	Capabilities: <available only to root>

0000:02:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5751M Gigabit Ethernet PCI Express (rev 11)
	Subsystem: IBM: Unknown device 0577
	Flags: bus master, fast devsel, latency 0, IRQ 169
	Memory at a8200000 (64-bit, non-prefetchable) [size=64K]
	Capabilities: <available only to root>

0000:04:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev b3)
	Subsystem: IBM: Unknown device 0532
	Flags: bus master, medium devsel, latency 168, IRQ 169
	Memory at a8400000 (32-bit, non-prefetchable) [size=4K]
	Bus: primary=04, secondary=05, subordinate=08, sec-latency=176
	Memory window 0: 40000000-403ff000 (prefetchable)
	Memory window 1: 40400000-407ff000
	I/O window 0: 00005000-000050ff
	I/O window 1: 00005400-000054ff
	16-bit legacy interface ports at 0001

0000:04:00.1 FireWire (IEEE 1394): Ricoh Co Ltd R5C552 IEEE 1394 Controller (rev 08) (prog-if 10 [OHCI])
	Subsystem: IBM: Unknown device 01cf
	Flags: bus master, medium devsel, latency 64, IRQ 50
	Memory at b1000000 (32-bit, non-prefetchable) [size=2K]
	Capabilities: <available only to root>

0000:04:02.0 Network controller: Intel Corporation PRO/Wireless 2915ABG MiniPCI Adapter (rev 05)
	Subsystem: Intel Corporation: Unknown device 1011
	Flags: bus master, medium devsel, latency 64, IRQ 66
	Memory at a8401000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <available only to root>


--liOOAslEiF7prFVr
Content-Type: text/plain; charset=us-ascii
Content-Disposition: attachment; filename="fglrx_oops.txt"

[fglrx:create_buffer_queue] *ERROR* PCIe was not acquired
Unable to handle kernel NULL pointer dereference at virtual address 00000008
 printing eip:
f978984b
*pde = 00000000
Oops: 0000 [#1]
PREEMPT 
Modules linked in: nsc_ircc ipw2200 ieee80211 ieee80211_crypt ehci_hcd uhci_hcd
psmouse fglrx binfmt_misc rfcomm l2cap lp autofs4 pcmcia nls_cp
850 vfat fat ircomm_tty ircomm ibm_acpi smapi rtcmosram thinkpad nvram
acpi_cpufreq eth1394 irtty_sir hci_usb sir_dev bluetooth snd_intel8x0 sn
d_ac97_codec usbhid snd_pcm snd_timer snd irda soundcore tg3 firmware_class
ohci1394 ieee1394 yenta_socket rsrc_nonstatic pcmcia_core crc_ccitt
 snd_page_alloc parport_pc parport i2c_i801 usbcore pcspkr intel_agp agpgart
floppy rtc sg sr_mod joydev evdev
CPU:    0
EIP:    0060:[<f978984b>]    Tainted: PF     VLI
EFLAGS: 00010286   (2.6.12) 
EIP is at firegl_get_pciemem_from_addr+0x4b/0xc0 [fglrx]
eax: f7028390   ebx: f52bfb74   ecx: 00000000   edx: 00000000
esi: f97e01e0   edi: 00000000   ebp: f52bfb74   esp: f526fe38
ds: 007b   es: 007b   ss: 0068
Process glxinfo (pid: 9290, threadinfo=f526e000 task=f52a6a20)
Stack: f97e01e0 10000000 00000001 f97897af f52bfb74 b3208000 f52bfb74 f9785188 
       f52bfb74 b3208000 f526e000 f526e000 00000000 00000000 c014f65a f52bfb74 
       b3208000 f526feac f5c4f080 00000001 081e0000 c01dc066 00000000 f687d600 
Call Trace:
 [<f97897af>] firegl_get_dev_from_vm+0xf/0x20 [fglrx]
 [<f9785188>] vm_pcie_nopage+0x38/0x120 [fglrx]
 [<c014f65a>] do_no_page+0x8a/0x400
 [<c01dc066>] prio_tree_insert+0x156/0x1c0
 [<c014d4de>] pte_alloc_map+0x3e/0xf0
 [<c014fcaa>] handle_mm_fault+0x1aa/0x200
 [<c0115892>] do_page_fault+0x2a2/0x6be
 [<c0173c3a>] do_ioctl+0x9a/0xa0
 [<c0173e0b>] vfs_ioctl+0x6b/0x230
 [<c01155f0>] do_page_fault+0x0/0x6be
 [<c0103d4b>] error_code+0x4f/0x54
Code: 5c a4 ff ff 89 1c 24 e8 74 ba ff ff 89 44 24 04 b9 01 00 00 00 89 4c 24 08
89 34 24 e8 5f 19 01 00 85 c0 74 51 8b 8e 80 17 00 00 <8b> 59 
08 85 db 74 2c 8b 08 8d b6 00 00 00 00 8d bf 00 00 00 00 

This is how fglrx shows during startup
fglrx: module license 'Proprietary. (C) 2002 - ATI Technologies, Starnberg,
GERMANY' taints kernel.
[fglrx] Maximum main memory to use for locked dma buffers: 928 MBytes.
ACPI: PCI Interrupt 0000:01:00.0[A] -> GSI 16 (level, low) -> IRQ 16
[fglrx] module loaded - fglrx 8.19.10 [Nov  9 2005] on minor 0
[fglrx] ACPI power management is initialized.
[fglrx] free  PCIe = 54804480
[fglrx] max   PCIe = 54804480
[fglrx] free  LFB = 47099904
[fglrx] max   LFB = 47099904
[fglrx] free  Inv = 0
[fglrx] max   Inv = 0
[fglrx] total Inv = 0
[fglrx] total TIM = 0
[fglrx] total FB  = 0
[fglrx] total PCIe = 16384

--liOOAslEiF7prFVr--