HDD DMA error and system hangs - FC3.
Shaffer Paul
pshaffer at spaceimaging.com
Sat Nov 27 18:14:16 UTC 2004
VJ, this is not the first time similar inquiries have appeared here, and
I encountered the same problem with a completely different hardware
setup. I saw this happening on a SCSI raid w/caching controller. I
discovered a work around by disabling cache writes to filesystem in the
mount. Some of the newer ide/sata drives are coming with large caches
(~8meg), so it makes me wonder about a possible correlation there.
Maybe completely off target - just FYI...
Paul
________________________________
From: fedora-list-bounces at redhat.com
[mailto:fedora-list-bounces at redhat.com] On Behalf Of VJ
Sent: Saturday, November 27, 2004 7:35 AM
To: For users of Fedora Core releases
Subject: Re: HDD DMA error and system hangs - FC3.
----- Original Message -----
From: VJ <mailto:vj at vijaygill.homelinux.net>
To: Fedora List <mailto:fedora-list at redhat.com>
Sent: Saturday, November 27, 2004 2:33 PM
Subject: HDD DMA error and system hangs - FC3.
Hi,
My PC suffers from HDD dma problem almost everyday.
Motherboard is Gigabyte GA7DXR, HDD is Seagate ST3160023A.
Output of lspci is:
00:00.0 Host bridge: Advanced Micro Devices [AMD] AMD-760
[IGD4-1P] System Controller (rev 13)
00:01.0 PCI bridge: Advanced Micro Devices [AMD] AMD-760
[IGD4-1P] AGP Bridge
00:07.0 ISA bridge: VIA Technologies, Inc. VT82C686 [Apollo
Super South] (rev 40)
00:07.1 IDE interface: VIA Technologies, Inc.
VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus Master IDE (rev 06)
00:07.4 SMBus: VIA Technologies, Inc. VT82C686 [Apollo Super
ACPI] (rev 40)
00:09.0 Multimedia video controller: Internext Compression Inc
iTVC15 MPEG-2 Encoder (rev 01)
00:0d.0 PCI bridge: Digital Equipment Corporation DECchip 21152
(rev 03)
00:0e.0 Multimedia audio controller: Ensoniq 5880 AudioPCI (rev
02)
00:10.0 Unknown mass storage controller: Promise Technology,
Inc. PDC20265 (FastTrak100 Lite/Ultra100) (rev 02)
01:05.0 VGA compatible controller: ATI Technologies Inc Rage 128
RF/SG AGP
02:04.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro
100] (rev 05)
02:05.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro
100] (rev 05)
This is a portion of my logwatch mail message (i can provied the
/var/log/messages as well if needed).
Buffer I/O error on device hdh4, ...: 10 Time(s)
RPC: error 5 connecting to ...: 1 Time(s)
end_request: I/O error, dev hdh, sector...: 468 Time(s)
hdh: DMA timeout error...: 2 Time(s)
hdh: dma timeout error: status=0x00 { }...: 1 Time(s)
hdh: dma timeout error: status=0xd0 { B...: 1 Time(s)
hdh: read_intr: error=0x04 { DriveStat...: 4 Time(s)
hdh: read_intr: status=0x51 { DriveReady SeekComplete Error
}...: 4 Time(s)
ide3: reset: master: error (0x00?)...: 2 Time(s)
lost page write due to I/O error on hdh4...: 10 Time(s)
And a lot of following messages
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071628
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071636
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071644
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071652
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071660
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071668
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071676
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071684
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071692
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071700
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071708
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071716
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071724
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071732
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071740
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071748
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071756
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071764
I am running kernel "2.6.9-1.681_FC3" with "noapic nolapic
acpi=off".
I ran Segate's own Seagate Tools for surface scan - No problem
found.
I ran smartctl -t long /dev/hdh for SMART extensive test. No
problems there either.
Any ideas???
Regards from
VJ
________________________________
--
fedora-list mailing list
fedora-list at redhat.com
To unsubscribe:
http://www.redhat.com/mailman/listinfo/fedora-list
Apologies. I sent MIME mail by mistake.
VJ
***************************************************************************
This message is intended only for the use of the Addressee and may
contain information that is PRIVILEGED and CONFIDENTIAL.
If you are not the intended recipient, you are hereby notified that any
dissemination of this communication is strictly prohibited. If you have
received this communication in error, please erase all copies of the
message and its attachments and notify Space Imaging immediately.
***************************************************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.fedoraproject.org/pipermail/users/attachments/20041127/8a30d431/attachment-0002.html
More information about the users
mailing list