Fault Recovery is a feature of Domino which enables a server that has crashed to shut itself down and then restart automatically, without any administrator intervention. When Domino encounters a situation which is considered a fatal error, each Domino process is terminated and all associated resources are freed. The startup script (/opt/lotus/bin/server) detects the situation and restarts the server.
When Fault Recovery is enabled, the server password, if any, is securely stored in the system kernel memory area where it can be retrieved by a restarting server.
Fault Recovery supports Domino partitions. When a situation arises in a partition, only that partition is terminated and restarted. Other Domino partitions on the server are unaffected.
An optional program can be executed by the Fault Recovery system before any other cleanup takes place by using the CleanupScriptPath notes.ini variable. For example, to automatically invoke nsd add the following line to notes.ini:
CleanupScriptPath=/opt/lotus/bin/nsd
(NOTE: The cleanup script feature is available even if Fault Recovery is disabled)
To enable Fault Recovery, add the following line to NOTES.INI:
FaultRecovery=1
Fault Recovery is currently available only on Domino for UNIX platforms.
If you are using the Fault Recovery feature on Solaris 2.6 or greater, you must add the following line to /etc/system and reboot:
set msgsys:msginfo_msgtql=1024
This needs to be set on Solaris because Fault Recovery uses message queues. The default for Solaris is 40 messages per queue, and Fault Recovery requires more than 40 messages in this queue. This setting increases it from 40 to 1024.