diff --git a/doc/source/afs.rst b/doc/source/afs.rst index 26b5455054..63bfcf24e7 100644 --- a/doc/source/afs.rst +++ b/doc/source/afs.rst @@ -311,6 +311,25 @@ Then remove the server with :: Finally run the ``bos create`` command above with any modified parameters to restart the server. +Recovering a Failed Fileserver +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +If a fileserver crashes, take the following steps to ensure it's +usable after recovery: + +* Pause mirror updates and volume release cron jobs + +* Reboot the server; fix any filesystem errors and check the salvager + logs + +* Check for any stuck volume transactions; remedy as appropriate + +* Perform a manual release of every volume from a terminal on a server + using "-localauth" in case OpenAFS decides it can't do an + incremental update. + +* Re-enable cron jobs + Mirrors ~~~~~~~