[389-users] db import failure, when setting replication up
Noriko Hosoi
nhosoi at redhat.com
Wed May 25 18:12:54 UTC 2011
On 05/25/2011 10:57 AM, Reinhard Nappert wrote:
> So, you are saying that the server would start up when I just replace
> the Berkeley Database libraries.
> I doubt it, but I will try ......
Ah, sorry. You were updating from 4.2.x to what version? (I was
somehow thinking you were applying the patch...) In such a case, yes,
there are some data changes. Most likely, the directory server's
backend detects the change and takes care of it. Please let us know how
it goes.
> I think I have to rebuild the source with the new lib. When I upgrade
> ds-base with the newly build package, does the server still understand
> the db files?
Yes, it's supposed to do so.
--noriko
> -Reinhard
>
> ------------------------------------------------------------------------
> *From:* 389-users-bounces at lists.fedoraproject.org
> [mailto:389-users-bounces at lists.fedoraproject.org] *On Behalf Of
> *Noriko Hosoi
> *Sent:* Wednesday, May 25, 2011 1:50 PM
> *To:* General discussion list for the 389 Directory server project.
> *Subject:* Re: [389-users] db import failure, when setting replication up
>
> On 05/24/2011 06:27 AM, Reinhard Nappert wrote:
>> I do that.
>> Now, I have two questions:
>> So, what db version do you recommend?
> Hi Reinhard,
>
> Which OS you are running?
>
> If it's RHEL5 (BDB4.3.29) or RHEL6 (BDB4.7.25), they are patched. But
> RHEL4 (BDB4.2.52) was rejected.
>
>> More importantly, is there a migration path or do I have to reload
>> the existing data? I could see issues migrating replicated environments.
> There's no data change needed. The bug was just in the data
> verification code.
>
> This bug has more detailed info.
> *Bug 472131* <https://bugzilla.redhat.com/show_bug.cgi?id=472131>
> -dbverify: when a duplicate is large enough to have internal page(s),
> dbverify issues bogus out-of-order key errors
>
> Thanks,
> --noriko
>> Thanks,
>> -Reinhard
>>
>> ------------------------------------------------------------------------
>> *From:* 389-users-bounces at lists.fedoraproject.org
>> [mailto:389-users-bounces at lists.fedoraproject.org] *On Behalf Of
>> *Noriko Hosoi
>> *Sent:* Monday, May 23, 2011 1:42 PM
>> *To:* 389-users at lists.fedoraproject.org
>> *Subject:* Re: [389-users] db import failure, when setting replication up
>>
>> Hi Reinhard,
>>
>> That was an unfortunate... I was hoping you were using a newer
>> version. :) You hit this bug.
>>
>> *Bug 472131* <https://bugzilla.redhat.com/show_bug.cgi?id=472131>
>> -dbverify: when a duplicate is large enough to have internal page(s),
>> dbverify issues bogus out-of-order key errors
>>
>> The bug was fixed by Sleepycat on db4.8. And we ported the fix back
>> to 4.3, but no chance to do so to 4.2. So, we cannot use dbverify to
>> check if the index file is healthy or not... Could it be possible to
>> reindex the ancestorid index and see if the error goes away? (Or you
>> could reinitialize the consumer? That would be the cleanest)
>>
>> Thanks,
>> --noriko
>>
>> Reinhard Nappert wrote:
>>> Hi Noriko,
>>> I run it on a CentOS 4.4 box (Linux 2.6.24). I use the db 4.2 libs
>>> with all the patches.
>>> Oh, yes dbverify does complain a lot. I see for all of the db files
>>> messages like:
>>> [20/May/2011:11:03:05 -0400] DB verify - verify failed(-30976):
>>> /var/lib/dirsrv/slapd-ID/db/userRoot/cn.db4
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 2
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 5
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 8
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 10
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 13
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 16
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 19
>>> [20/May/2011:11:03:06 -0400] - libdb: Page 5: out-of-order key at
>>> entry 21
>>> [20/May/2011:11:03:07 -0400] DB verify - verify failed(-30976):
>>> /var/lib/dirsrv/slapd-ID/db/userRoot/parentid.db4
>>> DB verify: Passed
>>> This said, I guess I should re-index the entire db. Any idea, why this is happening?
>>> Right now, I have a 2 MMR setup, where both masters also have a
>>> replication agreement to a third box, which is a dedicated consumer.
>>> I do run tests, where I perform simultaneously adds and deletes (not
>>> on the same object) on all three boxes. I just want to verify how
>>> replication behaves in 1.2.8.
>>> -Reinhard
>>>
>>> ------------------------------------------------------------------------
>>> *From:* 389-users-bounces at lists.fedoraproject.org
>>> [mailto:389-users-bounces at lists.fedoraproject.org] *On Behalf Of
>>> *Noriko Hosoi
>>> *Sent:* Thursday, May 19, 2011 5:33 PM
>>> *To:* 389-users at lists.fedoraproject.org
>>> *Subject:* Re: [389-users] db import failure, when setting
>>> replication up
>>>
>>> Hi Reinhard,
>>>
>>> Could you tell me the OS version and Berkeley DB version (rpm -q db4)?
>>>
>>> Could you run "/usr/lib[64]/dirsrv/slapd-ID/dbverify"? Does it
>>> complain anything? Especially, the ancestorid index? If it does,
>>> you may want to re-create the corrupted index...
>>> --noriko
>>>
>>> Reinhard Nappert wrote:
>>>> Noriko,
>>>> I observed one more item, which does not bother me right now, but
>>>> you may want to see:
>>>> I am not sure why and how it happened, but I see the following
>>>> message on the supplier:
>>>> [18/May/2011:13:59:50 -0400] NSMMReplicationPlugin -
>>>> agmt="cn=supplier2consumer" (consumer:389): Consumer failed to
>>>> replay change (uniqueid aea3731d-808711e0-83d5fdc8-f32b8f3c, CSN
>>>> 4dd4085b004800040000): Operations error. Will retry later.
>>>> And I see the following on the consumer:
>>>> [18/May/2011:13:59:29 -0400] - idl_new.c BAD 22, err=-30988
>>>> DB_PAGE_NOTFOUND: Requested page not found
>>>> [18/May/2011:13:59:29 -0400] - ancestorid BAD 13120, err=-30988
>>>> DB_PAGE_NOTFOUND: Requested page not found
>>>> Any idea, what happened there....
>>>> Thanks,
>>>> -Reinhard
>>>>
>>>>
>>>>
>>>> ------------------------------------------------------------------------
>>>> *From:* 389-users-bounces at lists.fedoraproject.org
>>>> [mailto:389-users-bounces at lists.fedoraproject.org] *On Behalf Of
>>>> *Noriko Hosoi
>>>> *Sent:* Tuesday, May 17, 2011 4:02 PM
>>>> *To:* General discussion list for the 389 Directory server project.
>>>> *Subject:* Re: [389-users] db import failure, when setting
>>>> replication up
>>>>
>>>> Hi Reinhard,
>>>>
>>>> Reinhard Nappert wrote:
>>>>> Hi Noriko,
>>>>> I have to correct myself. The box which had the import issue was
>>>>> on a 1.2.7.5 system. The other box was running 1.2.8.2.
>>>>> So, it looks like you have fixed the issue with 1.2.8.2.
>>>> *relieved* Thanks for testing it on 1.2.8.2!
>>>> --noriko
>>>>> Thanks,
>>>>> -Reinhard
>>>>>
>>>>> ------------------------------------------------------------------------
>>>>> *From:* 389-users-bounces at lists.fedoraproject.org
>>>>> [mailto:389-users-bounces at lists.fedoraproject.org] *On Behalf Of
>>>>> *Reinhard Nappert
>>>>> *Sent:* Tuesday, May 17, 2011 3:21 PM
>>>>> *To:* General discussion list for the 389 Directory server project.
>>>>> *Subject:* Re: [389-users] db import failure, when setting
>>>>> replication up
>>>>>
>>>>> 1.2.8.2
>>>>> -Reinhard
>>>>>
>>>>> ------------------------------------------------------------------------
>>>>> *From:* 389-users-bounces at lists.fedoraproject.org
>>>>> [mailto:389-users-bounces at lists.fedoraproject.org] *On Behalf Of
>>>>> *Noriko Hosoi
>>>>> *Sent:* Tuesday, May 17, 2011 2:16 PM
>>>>> *To:* General discussion list for the 389 Directory server project.
>>>>> *Subject:* Re: [389-users] db import failure, when setting
>>>>> replication up
>>>>>
>>>>> It looks to me you have hit this bug... Which version of
>>>>> 389-ds-base you are running?
>>>>>
>>>>> *Bug 684996*
>>>>> <https://bugzilla.redhat.com/show_bug.cgi?id=684996> -Exported
>>>>> tombstone cannot be imported correctly.
>>>>>
>>>>> The patch should be in the version 1.2.8.2.
>>>>> Thanks,
>>>>> --noriko
>>>>>
>>>>> On 05/17/2011 11:03 AM, Reinhard Nappert wrote:
>>>>>> Hi,
>>>>>> I have seen the following:
>>>>>> I set 2 systems up in MMR. Replication worked. For some reason, I
>>>>>> needed to take one of the boxes out of the replication and
>>>>>> disabled replication. Later on, I enabled it again and created
>>>>>> the shadowing agreement to the other box. Now, I saw the
>>>>>> following errors during the import of the db:
>>>>>> [17/May/2011:11:46:04 -0400] NSMMReplicationPlugin -
>>>>>> multimaster_be_state_change
>>>>>> : replica o=base is going offline; disabling replication
>>>>>> [17/May/2011:11:46:07 -0400] - WARNING: Import is running with
>>>>>> nsslapd-db-privat
>>>>>> e-import-mem on; No other process is allowed to access the database
>>>>>> [17/May/2011:11:46:08 -0400] - import userRoot: WARNING: Skipping
>>>>>> entry "nsuniqu
>>>>>> eid=06869502-7fe011e0-8f589300-7e7b2163,ou=sample,o=base" which
>>>>>> has no parent,
>>>>>> ending at line 0 of file "(bulk import)"
>>>>>> [17/May/2011:11:46:08 -0400] - import userRoot: WARNING: bad
>>>>>> entry: ID 453
>>>>>> .....
>>>>>> Any idea, what is going on there?
>>>>>> Thanks,
>>>>>> -Reinhard
>>>>>>
>>>>>>
>>>>>> --
>>>>>> 389 users mailing list
>>>>>> 389-users at lists.fedoraproject.org
>>>>>> https://admin.fedoraproject.org/mailman/listinfo/389-users
>>>>>
>>>>>
>>>>> --
>>>>> 389 users mailing list
>>>>> 389-users at lists.fedoraproject.org
>>>>> https://admin.fedoraproject.org/mailman/listinfo/389-users
>>>>
>>>>
>>>> --
>>>> 389 users mailing list
>>>> 389-users at lists.fedoraproject.org
>>>> https://admin.fedoraproject.org/mailman/listinfo/389-users
>>>
>>>
>>> --
>>> 389 users mailing list
>>> 389-users at lists.fedoraproject.org
>>> https://admin.fedoraproject.org/mailman/listinfo/389-users
>>
>>
>> --
>> 389 users mailing list
>> 389-users at lists.fedoraproject.org
>> https://admin.fedoraproject.org/mailman/listinfo/389-users
>
>
> --
> 389 users mailing list
> 389-users at lists.fedoraproject.org
> https://admin.fedoraproject.org/mailman/listinfo/389-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.fedoraproject.org/pipermail/389-users/attachments/20110525/dd0fd81d/attachment.html>
More information about the 389-users
mailing list