<html>
<head>
<meta content="text/html; charset=utf-8" http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 10/22/2014 10:58 AM, Shilen Patel
wrote:<br>
</div>
<blockquote cite="mid:D06D5127.A8996%25shilen@duke.edu" type="cite">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<div>1.2.11.15 is a couple of years old?</div>
</blockquote>
<br>
Yes and no. 1.2.11.15 was the starting point for EL6. However,
many, many features and fixes have been backported from later
versions into 1.2.11.15-47 in EL 6.6.<br>
<br>
<blockquote cite="mid:D06D5127.A8996%25shilen@duke.edu" type="cite">
<div>I had to upgrade to the latest in copr because of another
issue that I think was fixed in 1.2.11.30.</div>
</blockquote>
<br>
Has that issue been fixed in 1.2.11.15-47 in EL 6.6? I know a lot
of 389 community members running on EL6 were using fedorapeople/copr
repos because they could not wait until those fixes/features were
available in EL 6.6. Now that EL 6.6 is out, I encourage you (and
anyone else in this situation) to stop using fedorapeople/copr
builds and instead use 1.2.11.15-47 in EL 6.6.<br>
<br>
<blockquote cite="mid:D06D5127.A8996%25shilen@duke.edu" type="cite">
<div>If I’m misunderstanding version numbers in EL vs copr, please
let me know.</div>
</blockquote>
<br>
See above.<br>
<br>
<blockquote cite="mid:D06D5127.A8996%25shilen@duke.edu" type="cite">
<div>But my main question is the second question regarding best
practices for detecting replication failures and I think that
applies to all versions?</div>
</blockquote>
<br>
<span id="OLK_SRC_BODY_SECTION"><span id="OLK_SRC_BODY_SECTION"
style="color: rgb(0, 0, 0); font-size: 14px; font-family:
Calibri, sans-serif;">nsds5replicaLastUpdateStatus is the
documented way to get replication status. The fact that this
error is not being reported that way seems like a bug.<br>
You can also monitor the errors logs.<br>
<br>
As for this particular problem, see
<a class="moz-txt-link-freetext" href="https://fedorahosted.org/389/ticket/47409">https://fedorahosted.org/389/ticket/47409</a><br>
<br>
</span></span>
<blockquote cite="mid:D06D5127.A8996%25shilen@duke.edu" type="cite">
<div><br>
</div>
<div>Thanks!</div>
<div><br>
</div>
<div>— Shilen</div>
<div><br>
</div>
<span id="OLK_SRC_BODY_SECTION">
<div style="font-family:Calibri; font-size:11pt;
text-align:left; color:black; BORDER-BOTTOM: medium none;
BORDER-LEFT: medium none; PADDING-BOTTOM: 0in; PADDING-LEFT:
0in; PADDING-RIGHT: 0in; BORDER-TOP: #b5c4df 1pt solid;
BORDER-RIGHT: medium none; PADDING-TOP: 3pt">
<span style="font-weight:bold">From: </span>Rich Megginson
<<a moz-do-not-send="true"
href="mailto:rmeggins@redhat.com">rmeggins@redhat.com</a>><br>
<span style="font-weight:bold">Reply-To: </span>"<a
moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>"
<<a moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>><br>
<span style="font-weight:bold">Date: </span>Wednesday,
October 22, 2014 at 12:14 PM<br>
<span style="font-weight:bold">To: </span>"<a
moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>"
<<a moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>><br>
<span style="font-weight:bold">Subject: </span>Re:
[389-users] Error code 51 and replication errors<br>
</div>
<div><br>
</div>
<blockquote id="MAC_OUTLOOK_ATTRIBUTION_BLOCKQUOTE"
style="BORDER-LEFT: #b5c4df 5 solid; PADDING:0 0 0 5; MARGIN:0
0 0 5;">
<div>
<div bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 10/22/2014 10:10 AM,
Shilen Patel wrote:<br>
</div>
<blockquote cite="mid:D06D4F13.A898A%25shilen@duke.edu"
type="cite">
<div>
<p style="margin: 0px;">389-ds-base-1.2.11.32-1.el6.x86_64</p>
</div>
</blockquote>
<br>
I would strongly encourage you to use the version provided
with EL 6.6, which is 389-ds-base-1.2.11.15-47. It looks
like you are using a build from the old rmeggins repo or
the newer copr repo. These are really only for those
users who needed critical fixes or features not yet in the
"supported" EL6.6 version. I don't know if that will fix
your problem, but it will make it a lot easier to support.<br>
<br>
<br>
<blockquote cite="mid:D06D4F13.A898A%25shilen@duke.edu"
type="cite">
<div><br>
</div>
<div>Thanks!</div>
<div style="font-size: 14px; font-family: Calibri,
sans-serif;"><br>
</div>
<div style="font-size: 14px; font-family: Calibri,
sans-serif;">— Shilen</div>
<div style="color: rgb(0, 0, 0); font-size: 14px;
font-family: Calibri, sans-serif;">
<br>
</div>
<span id="OLK_SRC_BODY_SECTION" style="color: rgb(0, 0,
0); font-size: 14px; font-family: Calibri,
sans-serif;">
<div style="font-family:Calibri; font-size:11pt;
text-align:left; color:black; BORDER-BOTTOM: medium
none; BORDER-LEFT: medium none; PADDING-BOTTOM: 0in;
PADDING-LEFT: 0in; PADDING-RIGHT: 0in; BORDER-TOP:
#b5c4df 1pt solid; BORDER-RIGHT: medium none;
PADDING-TOP: 3pt">
<span style="font-weight:bold">From: </span>Rich
Megginson <<a moz-do-not-send="true"
href="mailto:rmeggins@redhat.com">rmeggins@redhat.com</a>><br>
<span style="font-weight:bold">Reply-To: </span>"<a
moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>"
<<a moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>><br>
<span style="font-weight:bold">Date: </span>Wednesday,
October 22, 2014 at 12:07 PM<br>
<span style="font-weight:bold">To: </span>"<a
moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>"
<<a moz-do-not-send="true"
href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>><br>
<span style="font-weight:bold">Subject: </span>Re:
[389-users] Error code 51 and replication errors<br>
</div>
<div><br>
</div>
<blockquote id="MAC_OUTLOOK_ATTRIBUTION_BLOCKQUOTE"
style="BORDER-LEFT: #b5c4df 5 solid; PADDING:0 0 0
5; MARGIN:0 0 0 5;">
<div>
<div bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">On 10/22/2014 09:54
AM, Shilen Patel wrote:<br>
</div>
<blockquote
cite="mid:D06D4B78.A897B%25shilen@duke.edu"
type="cite">
<div>Hi,</div>
<div><br>
</div>
<div>I’m running 1.2.11.32.</div>
</blockquote>
<br>
What is output of rpm -q 389-ds-base?<br>
<br>
<blockquote
cite="mid:D06D4B78.A897B%25shilen@duke.edu"
type="cite">
<div>I have 6 replicas (two of which are
read-only). I ran into an issue where a
DELETE operation failed on a server with
error code 51 (ldap busy).</div>
<div><br>
</div>
<div>
<p style="margin: 0px;">[21/Oct/2014:23:44:44
-0400] conn=78160 op=39510 RESULT err=51
tag=107 nentries=0 etime=3
csn=5447282c000300050000</p>
</div>
<div><br>
</div>
<div>The application retried the delete
several times for a couple of hours (while
the server wasn’t getting any other
requests) and the result was always the same
(err=51). Each time that happened, the
error log had the following:</div>
<div><br>
</div>
<div>
<p style="margin: 0px;">[21/Oct/2014:23:44:44
-0400] - Retry count exceeded in delete</p>
</div>
<div><br>
</div>
<div>My first question is, what would cause a
problem like this?</div>
<div><br>
</div>
<div>I simply restarted that directory and
then the update succeeded. However, when
the update went to the other 5 servers, they
failed in the same way and the same error
was logged in their log files. But the
update wasn’t retried. It was just skipped
and future updates via replication succeeded
on those 5 servers.</div>
<div><br>
</div>
<div>My second question is, what’s the best
way to monitor for these types of
replication errors? In this
case, nsds5replicaLastUpdateStatus did not
indicate a problem. If I had not been
looking at the error file on those 5 hosts,
I’m wondering how I would have known that a
delete failed to replicate to them. If the
answer is to just have something monitoring
the error log files, are there specific
search strings to look for to separate out
updates that have failed and won’t be
retried from other errors (e.g. temporary
connection issues)? Just curious if there
is a best practice here.</div>
<div><br>
</div>
<div>Thanks!</div>
<div><br>
</div>
<div>— Shilen</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">--
389 users mailing list
<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a><a moz-do-not-send="true" class="moz-txt-link-freetext" href="https://admin.fedoraproject.org/mailman/listinfo/389-users">https://admin.fedoraproject.org/mailman/listinfo/389-users</a></pre>
</blockquote>
<br>
</div>
</div>
</blockquote>
</span><br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">--
389 users mailing list
<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a><a moz-do-not-send="true" class="moz-txt-link-freetext" href="https://admin.fedoraproject.org/mailman/listinfo/389-users">https://admin.fedoraproject.org/mailman/listinfo/389-users</a></pre>
</blockquote>
<br>
</div>
</div>
</blockquote>
</span>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">--
389 users mailing list
<a class="moz-txt-link-abbreviated" href="mailto:389-users@lists.fedoraproject.org">389-users@lists.fedoraproject.org</a>
<a class="moz-txt-link-freetext" href="https://admin.fedoraproject.org/mailman/listinfo/389-users">https://admin.fedoraproject.org/mailman/listinfo/389-users</a></pre>
</blockquote>
<br>
</body>
</html>