TY - GEN
T1 - R-Kernel
T2 - IEEE INFOCOM 2011
AU - Cao, Qing
AU - Wang, Xiaorui
AU - Qi, Hairong
AU - He, Tian
PY - 2011
Y1 - 2011
N2 - In this paper, we present r-kernel, an operating system kernel foundation specifically designed to improve software reliability in networked embedded systems. The key novelty of r-kernel lies in that it exploits the time dimension of software execution to improve robustness. Specifically, r-kernel keeps track of the execution of applications through checkpoints. If one application has been determined to have failed, r-kernel performs rollback operations to restore its state to one of those checkpoints created earlier. For the second round of operation, r-kernel provides a safe mode environment to avoid triggering the same bugs. Finally, if the whole system has crashed, r-kernel relies on watchdog timers to reset the node, and develops a technique called past-run trace reconstruction to locate and report the thread that had caused the system failure. We have implemented r-kernel based on the LiteOS operating system kernel running on the popular MicaZ platform. We demonstrate that it achieves the desired goals above with acceptable overhead.
AB - In this paper, we present r-kernel, an operating system kernel foundation specifically designed to improve software reliability in networked embedded systems. The key novelty of r-kernel lies in that it exploits the time dimension of software execution to improve robustness. Specifically, r-kernel keeps track of the execution of applications through checkpoints. If one application has been determined to have failed, r-kernel performs rollback operations to restore its state to one of those checkpoints created earlier. For the second round of operation, r-kernel provides a safe mode environment to avoid triggering the same bugs. Finally, if the whole system has crashed, r-kernel relies on watchdog timers to reset the node, and develops a technique called past-run trace reconstruction to locate and report the thread that had caused the system failure. We have implemented r-kernel based on the LiteOS operating system kernel running on the popular MicaZ platform. We demonstrate that it achieves the desired goals above with acceptable overhead.
UR - http://www.scopus.com/inward/record.url?scp=79960873051&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960873051&partnerID=8YFLogxK
U2 - 10.1109/INFCOM.2011.5935074
DO - 10.1109/INFCOM.2011.5935074
M3 - Conference contribution
AN - SCOPUS:79960873051
SN - 9781424499212
T3 - Proceedings - IEEE INFOCOM
SP - 2507
EP - 2515
BT - 2011 Proceedings IEEE INFOCOM
Y2 - 10 April 2011 through 15 April 2011
ER -