Fixed
Details
Assignee
Chandra GorantlaChandra GorantlaReporter
Will KeaneyWill KeaneyComponents
Sprint
NoneFix versions
Affects versions
Priority
Critical
Details
Details
Assignee
Chandra Gorantla
Chandra GorantlaReporter
Will Keaney
Will KeaneyComponents
Sprint
None
Fix versions
Affects versions
Priority
PagerDuty
PagerDuty
PagerDuty
Created September 20, 2018 at 8:54 PM
Updated December 6, 2018 at 4:55 PM
Resolved November 13, 2018 at 2:26 AM
Some exceptions can cause a Drools engine to stop working entirely, while the Correlator module remains "running". In this state, OpenNMS will not stop cleanly, and must be killed.
One such exception is
java.util.ConcurrentModificationException
:Please improve the Correlator to be able to detect when an engine has failed, and attempt to restart it.
If the engine cannot be started, there should be a notification mechanism, and it should be possible to stop OpenNMS without resorting to "
kill $(cat ${OPENNMS_HOME}/logs/opennms.pid)
".The current state also prevents cluster management software from identifying that part of the application has failed - "service opennms status" still says it's Running.
There should be some way to signal a clustering tool that part of the application has failed, and it should be restarted.