Kafka Producer: Sync timing issues cause erroneous deletes
Description
Acceptance / Success Criteria
None
Lucidchart Diagrams
Activity
Show:

Jesse White November 26, 2018 at 8:07 PM
PR for foundation-2018: https://github.com/OpenNMS/opennms/pull/2256
PR for release-23.0.2: https://github.com/OpenNMS/opennms/pull/2257
Fixed
Details
Details
Assignee

Reporter

Labels
Components
Sprint
None
Fix versions
Affects versions
Priority
PagerDuty
PagerDuty Incident
PagerDuty

PagerDuty Incident
Created November 26, 2018 at 2:55 PM
Updated November 28, 2018 at 12:32 AM
Resolved November 28, 2018 at 12:09 AM
If an alarm is created immediately after the synchronization process is started, then the alarm can be erroneously deleted from the ktable.
While the state will be eventually consistent (rectified on the next sync) these deletes can cause problems for other systems integrating with the topics. In the case of OCE, this problem manifested itself by creating duplicate situations.
The following log snippet from
karaf.log
shows this in in action: