PacketFence - BTS - PacketFence | ||||||||||
View Issue Details | ||||||||||
ID | Project | Category | View Status | Date Submitted | Last Update | |||||
0001053 | PacketFence | error-handling | public | 2010-08-27 12:14 | 2012-06-14 12:16 | |||||
Reporter | obilodeau | |||||||||
Assigned To | ||||||||||
Priority | high | Severity | major | Reproducibility | random | |||||
Status | closed | Resolution | duplicate | |||||||
Platform | OS | OS Version | ||||||||
Product Version | ||||||||||
Target Version | 3.4.0 | Fixed in Version | 3.4.0 | |||||||
fixed in git revision | ||||||||||
fixed in mtn revision | ||||||||||
Summary | 0001053: restart code needs to be more aggressive if a daemon doesn't kill quickly enough | |||||||||
Description | Here's a situation that happened at a client: - a restart is issued in a high-load situation for some reason, one daemon stays stuck and didn't finish (in our case one of the pfdhcplistener) - the rest of the systems restarts but because that daemon is hung, the restart of the daemons of the same type (pfdhcplistener) is delayed indefinitely. Another `pfcmd service pfdhcplistener restart` fixed it but it took days to realized that (IP to MAC stopped working). Our restart code should handle that situation and start being more aggressive if it waited longer than 1 minute (for ex.). Logs provided below. | |||||||||
Steps To Reproduce | ||||||||||
Additional Information | Aug 26 16:45:12 pfcmd(0) INFO: packetfence restart ... executing stop followed by start (main::service) ... Aug 26 16:45:12 pfcmd(0) INFO: Stopping pfdhcplistener with 'pkill pfdhcplistener' (pf::services::service_ctl) Aug 26 16:45:12 pfdhcplistener(0) FATAL: pfdhcplistener: caught SIGTERM - terminating (main::normal_sighandler) Aug 26 16:45:12 pfdhcplistener(0) FATAL: pfdhcplistener: caught SIGTERM - terminating (main::normal_sighandler) Aug 26 16:45:12 pfcmd(0) INFO: /usr/local/pf/sbin/pfdhcplistener status (pf::services::service_ctl) Aug 26 16:45:12 pfdhcplistener(0) INFO: stopping pfdhcplistener for interface eth0 (main::END) Aug 26 16:45:12 pfdhcplistener(0) INFO: stopping pfdhcplistener for interface eth1 (main::END) but then one didn't stop: Aug 26 16:45:14 pfcmd(0) INFO: pidof -x pfdhcplistener returned 21523 (pf::services::service_ctl) Aug 26 16:45:14 pfcmd(0) INFO: Waiting for pfdhcplistener to stop (pf::services::service_ctl) ... Aug 26 17:58:38 pfcmd(0) INFO: pidof -x pfdhcplistener returned 21523 (pf::services::service_ctl)Aug 26 17:58:38 pfcmd(0) INFO: Waiting for pfdhcplistener to stop (pf::services::service_ctl) ... Aug 27 11:25:43 pfcmd(0) INFO: pidof -x pfdhcplistener returned 21523 (pf::services::service_ctl) Aug 27 11:25:43 pfcmd(0) INFO: Waiting for pfdhcplistener to stop (pf::services::service_ctl) | |||||||||
Tags | No tags attached. | |||||||||
Relationships |
| |||||||||
Attached Files | ||||||||||
Issue History | ||||||||||
Date Modified | Username | Field | Change | |||||||
2010-08-27 12:14 | obilodeau | New Issue | ||||||||
2010-09-15 10:54 | obilodeau | Target Version | 1.9.1 => 1.9.2 | |||||||
2010-09-22 16:02 | obilodeau | Target Version | 1.9.2 => 1.9.3 | |||||||
2012-04-30 09:32 | fgaudreault | Note Added: 0002691 | ||||||||
2012-06-11 15:27 | obilodeau | Relationship added | duplicate of 0001453 | |||||||
2012-06-11 15:27 | obilodeau | Note Added: 0002755 | ||||||||
2012-06-11 15:28 | obilodeau | Status | new => closed | |||||||
2012-06-11 15:28 | obilodeau | Resolution | open => duplicate | |||||||
2012-06-11 15:28 | obilodeau | Fixed in Version | => +1 | |||||||
2012-06-14 12:15 | obilodeau | Target Version | 1.9.3 => 3.4.0 | |||||||
2012-06-14 12:15 | obilodeau | Fixed in Version | +1 => 3.4.0 | |||||||
2012-06-14 12:16 | obilodeau | Note Added: 0002780 |
Notes | |||||
|
|||||
|
|
||||
|
|||||
|
|
||||
|
|||||
|
|