mirror of
https://github.com/apache/cloudstack.git
synced 2025-10-26 08:42:29 +01:00
The watchdog timer adds functionality where the Hypervisor can detect if an instance has crashed or stopped functioning. The watchdog timer adds functionality where the Hypervisor can detect if an instance has crashed or stopped functioning. When the Instance has the 'watchdog' daemon running it will send heartbeats to the /dev/watchdog device. If these heartbeats are no longer received by the HV it will reset the Instance. If the Instance never sends the heartbeats the HV does not take action. It only takes action if it stops sending heartbeats. This is supported since Libvirt 0.7.3 and can be defined in the XML format as described in the docs: https://libvirt.org/formatdomain.html#elementsWatchdog To the 'devices' section this will be added: In the agent.properties the action to be taken can be defined: vm.watchdog.action=reset The same goes for the model. The Intel i6300esb is however the most commonly used. vm.watchdog.model=i6300esb When the Instance has the 'watchdog' daemon running it will send heartbeats to the /dev/watchdog device. If these heartbeats are no longer received by the HV it will reset the Instance. If the Instance never sends the heartbeats the HV does not take action. It only takes action if it stops sending heartbeats. This is supported since Libvirt 0.7.3 and can be defined in the XML format as described in the docs: https://libvirt.org/formatdomain.html#elementsWatchdog To the 'devices' section this will be added: <watchdog model='i6300esb' action='reset'/> In the agent.properties the action to be taken can be defined: vm.watchdog.action=reset The same goes for the model. The Intel i6300esb is however the most commonly used. vm.watchdog.model=i6300esb Signed-off-by: Wido den Hollander <wido@widodh.nl>