|
LCA: Andrew Tanenbaum on creating reliable systemsLCA: Andrew Tanenbaum on creating reliable systemsPosted Jan 31, 2007 22:50 UTC (Wed) by tjc (subscriber, #137)In reply to: LCA: Andrew Tanenbaum on creating reliable systems by pascal.martin Parent article: LCA: Andrew Tanenbaum on creating reliable systems I just found this bit of information in the paper "Reorganizing UNIX for Reliability"
If crashes reoccur, a binary exponential backoff protocol could be used to prevent bogging down the system with repeated recoveries. Unfortunately, no specifics are given. It sounds like something from Star Trek TNG. Data: "Captain, I could use an binary exponential backoff protocol to restart the warp engines." Picard: "Very good Mr. Data -- make it so!" http://www.minix3.org/doc/ACSAC-2006.pdf
(Log in to post comments)
exponential backoff Posted Feb 1, 2007 12:54 UTC (Thu) by robbe (guest, #16131) [Link] Exponential backoff is a standard technique used, for example by mailservers, in the face of transient failures: after the n-th consequitve error, wait f * k^n seconds, then retry. Suitable values for f and k depend on the application -- k is often 2 -> binary exponential backoff.
Example with f = 300, i.e. 5 minutes (a viable value for SMTP):
* First try ... fails
It would work the same for OS-component restart, of course with values
|
Copyright © 2008, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds
Powered by Rackspace Managed Hosting.