DetectDuplicate 2.0.0

Bundle
org.apache.nifi | nifi-standard-nar
Description
Caches a value, computed from FlowFile attributes, for each incoming FlowFile and determines if the cached value has already been seen. If so, routes the FlowFile to 'duplicate' with an attribute named 'original.identifier' that specifies the original FlowFile's "description", which is specified in the <FlowFile Description> property. If the FlowFile is not determined to be a duplicate, the Processor routes the FlowFile to 'non-duplicate'
Tags
dedupe, dupe, duplicate, hash
Input Requirement
REQUIRED
Supports Sensitive Dynamic Properties
false
Properties
Relationships
Name Description
duplicate If a FlowFile has been detected to be a duplicate, it will be routed to this relationship
non-duplicate If a FlowFile's Cache Entry Identifier was not found in the cache, it will be routed to this relationship
failure If unable to communicate with the cache, the FlowFile will be penalized and routed to this relationship
Writes Attributes
Name Description
original.flowfile.description All FlowFiles routed to the duplicate relationship will have an attribute added named original.flowfile.description. The value of this attribute is determined by the attributes of the original copy of the data and by the FlowFile Description property.
See Also