Crash Deduplication
The process of grouping distinct crash inputs that trigger the same underlying bug into a single unique report.
A fuzzing campaign against a target with one real bug may produce thousands of inputs that all trigger the same crash. Without deduplication, engineers drown in duplicate reports rather than unique bugs. Crash deduplication groups crashes by their root cause, typically by comparing sanitized stack traces: two crashes are considered duplicates if their top N stack frames match. More sophisticated approaches compare the crash address, fault type (read vs. write, stack vs. heap), or use sanitizer output like AddressSanitizer's allocation and deallocation traces to distinguish overlapping bugs. Deduplication is not a solved problem — imperfect heuristics both merge distinct bugs (undercounting) and split variants of the same bug (overcounting). A practical system errs toward splitting (showing more unique groups) and lets humans merge obvious duplicates.