[389-users] 389 crash libdb: PANIC: fatal region error detected

Ludwig Krispenz lkrispen at redhat.com
Fri Aug 2 07:39:48 UTC 2013


If the database panics, it has detected a state of inconsistency it 
cannot resolve and sets a panic flag to the database environment to 
prevents its use. Just restarting the ds will detect the flag and abort.
You can try to remove the __db.00* files and then start DS again, if 
this fails you need to rebuild the database either by a bak backup or 
ldif import.

Why it did panic, I think we can no longer find out.

Ludwig

On 08/02/2013 09:30 AM, Manel Gimeno Zaragozá wrote:
> Hello,
>
> Yesterday afternoon my LDAP server crashed without doing any 
> modification, just consulting.
>
> My environment is an openvz container:
>
> # cat /etc/issue
> CentOS release 6.4 (Final)
> Kernel \r on an \m
>
> # uname -a
> Linux newldap.test.es 2.6.32-042stab053.5 #1 SMP Tue Mar 27 11:42:17 
> MSD 2012 x86_64 x86_64 x86_64 GNU/Linux
>
> # rpm -qa | grep 389
> 389-console-1.1.7-1.el6.noarch
> 389-admin-1.1.29-1.el6.x86_64
> 389-admin-console-1.1.8-1.el6.noarch
> 389-ds-base-1.2.11.15-14.el6_4.x86_64
> 389-ds-console-1.2.6-1.el6.noarch
> 389-ds-base-libs-1.2.11.15-14.el6_4.x86_64
> 389-ds-base-devel-1.2.11.15-14.el6_4.x86_64
> 389-adminutil-1.1.15-1.el6.x86_64
>
>
> Please find below the log when it crashes:
>
> errors:10:[01/Aug/2013:14:09:44 +0200] configure_pr_socket - Unable to 
> move socket file descriptor 62 above 64: OS error 24 (Too many open files)
> errors:11:[01/Aug/2013:14:09:44 +0200] configure_pr_socket - Unable to 
> move socket file descriptor 63 above 64: OS error 24 (Too many open files)
> errors:12:[01/Aug/2013:14:09:44 +0200] - PR_Accept() failed, Netscape 
> Portable Runtime error -5971 (Process open FD table is full.)
> errors:13:[01/Aug/2013:14:09:44 +0200] - PR_Accept() failed, Netscape 
> Portable Runtime error -5971 (Process open FD table is full.)
> errors:14:[01/Aug/2013:14:09:44 +0200] - PR_Accept() failed, Netscape 
> Portable Runtime error -5971 (Process open FD table is full.)
> ...
> [01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable 
> Runtime error -5971 (Process open FD table is full.)
> [01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable 
> Runtime error -5971 (Process open FD table is full.)
> [01/Aug/2013:14:10:01 +0200] - libdb: PANIC: fatal region error 
> detected; run recovery
> [01/Aug/2013:14:10:01 +0200] - Serious Error---Failed in deadlock 
> detect (aborted at 0x0), err=-30974 (DB_RUNRECOVERY: Fatal error, run 
> database recovery)
> [01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable 
> Runtime error -5971 (Process open FD table is full.)
> [01/Aug/2013:14:10:01 +0200] - PR_Accept() failed, Netscape Portable 
> Runtime error -5971 (Process open FD table is full.)
> ...
> [01/Aug/2013:14:12:27 +0200] - libdb: PANIC: fatal region error 
> detected; run recovery
> [01/Aug/2013:14:12:27 +0200] - Serious Error---Failed in deadlock 
> detect (aborted at 0x0), err=-30974 (DB_RUNRECOVERY: Fatal error, run 
> database recovery)
> [01/Aug/2013:14:12:28 +0200] - libdb: PANIC: fatal region error 
> detected; run recovery
> [01/Aug/2013:14:12:28 +0200] - Serious Error---Failed in deadlock 
> detect (aborted at 0x0), err=-30974 (DB_RUNRECOVERY: Fatal error, run 
> database recovery)
> [01/Aug/2013:14:12:28 +0200] - libdb: PANIC: fatal region error 
> detected; run recovery
> [01/Aug/2013:14:12:28 +0200] - Serious Error---Failed to checkpoint 
> database, err=-30974 (DB_RUNRECOVERY: Fatal error, run database recovery)
>
> when I noticed that the ldap had crashed, I tried to restart the 
> process and the following errors show up:
>
> [01/Aug/2013:14:20:15 +0200] - 389-Directory/1.2.11.15 B2013.105.2259 
> starting up
> [01/Aug/2013:14:20:15 +0200] - WARNING: userRoot: entry cache size 
> 10485760B is less than db size 14729216B; We recommend to increase the 
> entry cache size nsslapd-cachememsize.
> [01/Aug/2013:14:20:15 +0200] - libdb: PANIC: fatal region error 
> detected; run recovery
> [01/Aug/2013:14:20:15 +0200] - Opening database environment 
> (/var/lib/dirsrv/slapd-ldap_kolab/db) failed. err=-30974: 
> DB_RUNRECOVERY: Fatal error, run database recovery
> [01/Aug/2013:14:20:15 +0200] - start: Failed to init database, 
> err=-30974 DB_RUNRECOVERY: Fatal error, run database recovery
> [01/Aug/2013:14:20:15 +0200] - Failed to start database plugin ldbm 
> database
> [01/Aug/2013:14:20:15 +0200] - WARNING: ldbm instance userRoot already 
> exists
> [01/Aug/2013:14:20:15 +0200] - ldbm_config_read_instance_entries: 
> failed to add instance entry cn=userRoot,cn=ldbm 
> database,cn=plugins,cn=config
> [01/Aug/2013:14:20:15 +0200] - ldbm_config_load_dse_info: failed to 
> read instance entries
> [01/Aug/2013:14:20:15 +0200] - start: Loading database configuration 
> failed
> [01/Aug/2013:14:20:15 +0200] - Failed to start database plugin ldbm 
> database
> [01/Aug/2013:14:20:15 +0200] - Error: Failed to resolve plugin 
> dependencies
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin 7-bit check 
> is not started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Account 
> Usability Plugin is not started
> [01/Aug/2013:14:20:15 +0200] - Error: accesscontrol plugin ACL Plugin 
> is not started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin ACL 
> preoperation is not started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Auto 
> Membership Plugin is not started
> [01/Aug/2013:14:20:15 +0200] - Error: object plugin Class of Service 
> is not started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin deref is not 
> started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin HTTP Client 
> is not started
> [01/Aug/2013:14:20:15 +0200] - Error: database plugin ldbm database is 
> not started
> [01/Aug/2013:14:20:15 +0200] - Error: object plugin Legacy Replication 
> Plugin is not started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Linked 
> Attributes is not started
> [01/Aug/2013:14:20:15 +0200] - Error: preoperation plugin Managed 
> Entries is not started
> [01/Aug/2013:14:20:15 +0200] - Error: object plugin Multimaster 
> Replication Plugin is not started
> [01/Aug/2013:14:20:15 +0200] - Error: object plugin Roles Plugin is 
> not started
> [01/Aug/2013:14:20:15 +0200] - Error: object plugin Views is not started
>
> Can any one help me?
>
> Thanks for your help
>
> Manel
>
>
> --
> 389 users mailing list
> 389-users at lists.fedoraproject.org
> https://admin.fedoraproject.org/mailman/listinfo/389-users

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.fedoraproject.org/pipermail/389-users/attachments/20130802/3265e48a/attachment.html>


More information about the 389-users mailing list