[389-users] Replication hell - picking apart info/warning/error messages

Justin Edmands shockwavecs at gmail.com
Sun May 4 14:00:47 UTC 2014


What versions of all packages are your ABC running?
Has this all worked in the past? If so, any changes to structure?
Have you been able to initialize consumers from C (and have data appear)?
Random thought about C...what happens if you ;
- initialize C from B or A (blowing away and rebuilding all data on C)
- allow C to complete
- setup a new host as D
- initialize D from C
- after above completes, initialize C from D

Are you attempting to restore the data from backup to C and then run throw C back into the replication loop? The error below make me think that you are leaving your replication agreements intact while you attempt to restore to C. This would potentially make A or B attempt to sync updates to C while C is restoring from backup. 

"WARNING: Import is running with nsslapd-db-private-import-mem on; No other process is allowed to access the database"

Are you deleting all replication agreements from A B and C before starting to work with A --> C again?
--

Justin Edmands

> On May 4, 2014, at 9:16 AM, Graham Leggett <minfrin at sharp.fm> wrote:
> 
> Hi all,
> 
> I am into my second day of hell trying to get server C to replicate to servers B and A in a multimaster replication setup.
> 
> For reasons unknown it appears getting servers A and B to be blown away completely by the contents of (initially empty) server C is trivial in the extreme, cue two three hour outages and restores from backup. I cannot for the life of me get this to work the other way around.
> 
> After running the scripts to deploy server C from scratch, and adding the replication agreement to server A, I get the following in server C's log below.
> 
> I have a number of questions:
> 
> - "Replica has a different generation ID than the local data." - what does this mean? Is it simply information to be ignored, a warning to be heeded (if so, how?), or an error (if so, what action must be taken?).
> 
> - "ERROR bulk import abandoned". I am assuming this means that the replication failed, but it gives no indication as to *why* it failed. Anyone know of a way to coax a reason out of 389ds?
> 
> - "NSMMReplicationPlugin - replica_replace_ruv_tombstone: failed to update replication update vector for replica". Again, is this purely for information, is this a warning, is this an error, what should I do in response to this?
> 
> All I want to do is make serverc have the same data as servera. Does anyone have a clear and concise set of instructions for how this is to be achieved safely?
> 
> [04/May/2014:15:50:17 +0200] NSMMReplicationPlugin - agmt="cn=Agreement servera.example.com" (servera:636): Replica has a different generation ID than the local data.
> [04/May/2014:15:50:21 +0200] NSMMReplicationPlugin - agmt="cn=Agreement serverb.example.com" (serverb:636): Replica has a different generation ID than the local data.
> [04/May/2014:15:51:36 +0200] NSMMReplicationPlugin - multimaster_be_state_change: replica o=Foo,c=za is going offline; disabling replication
> [04/May/2014:15:51:36 +0200] - WARNING: Import is running with nsslapd-db-private-import-mem on; No other process is allowed to access the database
> [04/May/2014:15:51:40 +0200] - ERROR bulk import abandoned
> [04/May/2014:15:51:40 +0200] - import userRoot: Aborting all Import threads...
> [04/May/2014:15:51:45 +0200] - import userRoot: Import threads aborted.
> [04/May/2014:15:51:45 +0200] - import userRoot: Closing files...
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/aci.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/cn.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/parentid.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/mailAlternateAddress.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/objectclass.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/entryrdn.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/givenName.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/uniquemember.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/nsuniqueid.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/mailHost.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/mail.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/id2entry.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/sn.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - libdb: userRoot/uid.db4: unable to flush: No such file or directory
> [04/May/2014:15:51:45 +0200] - import userRoot: Import failed.
> [04/May/2014:15:51:45 +0200] - process_bulk_import_op: NULL target sdn
> [04/May/2014:15:51:49 +0200] NSMMReplicationPlugin - replica_replace_ruv_tombstone: failed to update replication update vector for replica o=Foo,c=ZA: LDAP error - 1
> [04/May/2014:15:51:53 +0200] NSMMReplicationPlugin - replica_replace_ruv_tombstone: failed to update replication update vector for replica o=Foo,c=ZA: LDAP error - 1
> [04/May/2014:15:51:57 +0200] NSMMReplicationPlugin - replica_replace_ruv_tombstone: failed to update replication update vector for replica o=Foo,c=ZA: LDAP error - 1
> 
> Regards,
> Graham
> --
> 
> --
> 389 users mailing list
> 389-users at lists.fedoraproject.org
> https://admin.fedoraproject.org/mailman/listinfo/389-users



More information about the 389-users mailing list