An error on threshd-configuration.xml breaks Collectd when reloading thresholds configuration

Description

The threshold packages have to be manually modified when you want to add new packages, or tune the parameters of an existing package. The reason for this is because there is no WebUI to manage changes on this file.

If the operator introduces an error on this file and send the reload event for it, Collectd will stop gathering data until the error is fixed.

If there is an error on the file, the current configuration in memory should be used until the error is fixed, and this should not block the operation of Collectd.

After fixing the file, you have to restart OpenNMS.

I verified this behavior on my test environment running latest foundation-2016.

Also, this kind of errors can be detected if the files can be verified: .

Acceptance / Success Criteria

None

Lucidchart Diagrams

Activity

Show:

Alejandro Galue April 13, 2016 at 12:05 PM

Thanks!

I cherry picked 160f48205344fab2a088a6f49ca06fd71ec7f65b in foundation.

Jesse White April 13, 2016 at 10:33 AM

I removed Meridian-2015.1.2 from the fix version since the PR was targeted to foundation-2016.

Please cherry pick this to foundation if you want the fix in Meridian-2015.1.2 as well.

Alejandro Galue April 13, 2016 at 9:37 AM

Fixed

Details

Assignee

Reporter

Labels

Components

Affects versions

Priority

PagerDuty

Created April 7, 2016 at 11:26 AM
Updated April 13, 2016 at 4:04 PM
Resolved April 13, 2016 at 10:32 AM