<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
Gilbert Sebenste wrote:
<blockquote
cite="mid:Pine.LNX.4.64.0710072358090.20857@weather3.admin.niu.edu"
type="cite">Hello all,
<br>
<br>
I am having an absolutely vexing problem that maybe somebody might shed
some light on.
<br>
<br>
I just got 2 new computers, both running F7. They each have one Seagate
750 GB SATA 3 Gb/s, 7200 RPM, 16 MB drive. Each machine has 4 GB of
RAM, Core 2 quad 6700 motherboard from ASUS.
<br>
<br>
OK. I run the computers pretty hard. But I have two Pentium 4's who
work just as hard, all getting a 20 MB/sec peak (1 MB/sec avg) weather
feed from the National Weather Service, flawlessly for months until I
install new kernels on it and reboot.
<br>
<br>
OK, within 12 hours after startup of the new machine running identical
software that the other slower machines are running with the exact same
data feed, I get
<br>
<br>
kernel: journal commit I/O error
<br>
<br>
I can log in, but can't do commands. A manual power-down (shutdown -r
now won't work) and reboot clears it fine.
<br>
<br>
First I suspected a hard drive error on both machines. But then
<br>
replacement hard drives came in. It seemed to stop the problem for a
few days, so I closed a bugzilla I had. Nope, this weekend, it went
back to crashing every 4-18 hours.
<br>
<br>
I tried to cut the read-writes in half, to no effect, by reducing the
<br>
amount of data/files coming in.
<br>
<br>
I have:
<br>
<br>
Replaced the hard drive 3 times with new ones (to no avail)
<br>
<br>
Reduced the read/writes by around half
<br>
<br>
Turned off legacy USB support, which also caused my keyboard and mouse
to stop working with errors (that's been cleared and is OK)
<br>
<br>
Filed a bugzilla: <a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?id=318661">https://bugzilla.redhat.com/show_bug.cgi?id=318661</a>
<br>
<br>
Tonight, I tried using the original kernel that came with F7
<br>
(2.6.21-1.3194.fc7) instead of the latest (2.6.22.9-91.fc-7).
<br>
As of two hours into this, so far so good, but I'm not confident.
<br>
<br>
Two other machines, Pentium 4's at 3 GHZ with ASUS motherboards, purr
like a kitten.
<br>
<br>
Has anyone seen anything like this, or know what could be the problem?
<br>
<br>
As always, grateful for any help, and thanks for reading this!
<br>
<br>
Gilbert
<br>
<br>
*******************************************************************************
<br>
Gilbert Sebenste
********
<br>
(My opinions only!)
******
<br>
*******************************************************************************
<br>
<br>
</blockquote>
I would suspect a hardware issue with the motherboards as my first port
of call. I have had a similar problsm with a new Pentium 4 board
recently where the ATA disc interface offlined every 18 hours of so but
hvaing replaced with a SATA drive the system purrs for weeks.<br>
<br>
Secondly the kernel version may be important - core 2 quad processors
are newish so later kernel SHOULD have better support. Maybe try a
development kernel on one of the machines e.g. 2.6.23.-----<br>
<br>
Finally, have you run a full FSCK on the drives after they fail -
reboot into single mode and run fsck -f. You may find that the problem
is a disc structure corruption ... then you have to find out why.<br>
<br>
You do not say which journalling file system you are using - is this
ext3, jfs, reiserfs, ...<br>
<br>
Finally, have you run memtest86+ on these machines - possible memory
dropout going unnoticed (especially if they do not have ECC memory)<br>
<br>
Note sure if this will help but hope it is not just noise....<br>
<br>
<div class="moz-signature">-- <br>
<title>Signature</title>
<div class="Section1">
<table class="MsoNormalTable" style="width: 100%;" border="0"
cellpadding="0" width="100%">
<tbody>
<tr style="">
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">Howard Wilkinson</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">Phone:</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">+44(20)76907075</p>
</td>
</tr>
<tr style="">
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">Coherent Technology Limited</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">Fax:</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal"> </p>
</td>
</tr>
<tr style="">
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">23 Northampton Square,</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">Mobile:</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">+44(7980)639379</p>
</td>
</tr>
<tr style="">
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">United Kingdom, EC1V 0HL</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal">Email:</p>
</td>
<td style="padding: 1.5pt;" valign="top">
<p class="MsoNormal"><a name="howardcohtech.com"></a><a class="moz-txt-link-abbreviated" href="mailto:howard@cohtech.com">howard@cohtech.com</a></p>
</td>
</tr>
</tbody>
</table>
<p class="MsoNormal"> </p>
</div>
</div>
</body>
</html>