This article contains information specific to the Dell EMC PowerEdge server hardware. The procedures have been tested on a PowerEdge R240.
The hardware watchdog is a very efficient recovery method in the unlikely event the computer experience interference that makes the system instable and freezes. Follow these steps to enable the watchdog handled from Linux systemd.
From the terminal as the pixi-admin user enter:
sudo nano /etc/systemd/system.conf
This opens up the systemd config text file. Find the entry "#RuntimeWatchdogSec=60" uncomment by remove the leading "#" to enable the watchdog. The time value can be set to any time value in seconds that fits your environment. Type ctrl+o then enter to save and ctrl+x to close the text editor.
Next open up modules.conf in the Nano editor by enter the following line in the terminal.
sudo nano /etc/modules-load.d/modules.conf
Add the following line in the config file:
ipmi_watchdog
Type ctrl+o then enter to save and ctrl+x to close the text editor. Restart the server. To test, it is possible to freeze the kernel using the following method.
This method can cause issues, make sure there is backups available before trying this. Don’t do this in a production environment
In the terminal as pixi-admin user, type:
sudo -s
Sets user as root.
echo "1" > /proc/sys/kernel/sysrq
Enables all sysrq.
echo c > /proc/sysrq-trigger
Creates a “kernel-panic” that halts the system. A secondary recover action counter starts at 255 seconds and when countdown = 0 the hardware should restart.