[ltp] Strange intermittent problem with T23, RH 8.0--best way to sleep?

Richard Neill linux-thinkpad@linux-thinkpad.org
Sun, 16 Mar 2003 03:36:40 +0000


julian underwood wrote:
>>My investigations so far lead to:
>>
>>1)The problem is that, somehow, the HDD doesn't wake up. On unsuccessful 
>>wakes, you don't hear the start up 'kerchunk', and the system blocks on 
>>HDD i/o. The consequences are:
> 
> 
> Yes, I forgot to mention that the HDD activity light is solid when this
> happens.  Although it doesn't sound like there is any activity.  The
> drive is of course spinning however.

Odd. So, your HDD is responding? (I'm assuming that the HDD light is 
controlled in hardware, not by any software, so can't be lying).

> 
> I've noticed for some reason, if I put my unit to sleep while running
> the latest version of VMWare, it seems to happen _much_ more
> frequently--about 1 out of 3 times.  Which is weird.
> 
> 
>>	*Programs still in RAM appear to work - that's why X seems
>>	 partly normal at first.
>>
>>	*Debugging is impossible - something like
>>          "apm -s >apmerror.txt 2>&1" never works, because if there is an
>>	error, the system can't write to disk.
>>
>>	*You can do "Alt-SysRq-B" to reboot. But none of the other
>>	sysrq things work properly (it may try to sync, but can't)
>>	
>>2)A good test is this script
>>
>>	#!/bin/bash
>>
>>	for i in `seq 1 100` ; do
>>         	echo -e "This is attempt $i"
>>		touch foo	#timestamp of last successful write
>>         	sleep 1
>>         	apm -s
>>	done
> 
> 
> Not sure I fully understand what that does, but I'll

Just sleeps 100 times, printing a log to the screen, and attempting to 
write the HDD. With a fail rate of about 1 in 10 to 1 in 20, I'd expect 
to need 100 successes to be statistically convinced the apm problem is 
fixed.

> 
> 
>>You'll have to keep pressing 'Fn' every 20 secs. But I would be very 
>>surprised if the script gets to the end.
>>
>>
>>3)The only fix I can imagine is to add an SysRq key combination for 
>>"wake the hard disk, yes really". But I don't know how to do that.
>>(I tried to contact the sysrq maintainers, their email is bouncing - I 
>>think that code has been abandoned!)
>>I think it would have to be done in the kernel, since there's no 
>>filesystem to work with if the hdd isn't alive.
>>
>>4)Does anyone know how to do this? Could it actually just be a config 
>>error? Is it a BIOS bug after all? Would the use of ACPI (supposedly 
>>planned for Mandrake 9.1) be better  ?
>>
>>
>>Best wishes
>>
>>Richard
>>
>>
> 
>