From b936429958f3e682545abf4e9e27211558fb3865 Mon Sep 17 00:00:00 2001 From: Xavier Brochard Date: Fri, 29 Apr 2022 22:03:28 +0200 Subject: [PATCH] short technical explanation about fuzzy hashes Useful to learn more. Comes from SSDEEP documentation https://ssdeep-project.github.io/ssdeep/index.html --- doc/fuzzy_storage.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/doc/fuzzy_storage.md b/doc/fuzzy_storage.md index dc4f2af9c..e11327b7f 100644 --- a/doc/fuzzy_storage.md +++ b/doc/fuzzy_storage.md @@ -13,6 +13,8 @@ Fuzzy hashes are used to search for similar messages – i.e. you can find messa Furthermore, fuzzy hashes are used not merely for textual data but also for images and other attachments types in email messages. However, in this case, rspamd looks for the exact matches to find similar objects. +_Fuzzy hashes is a short term for context triggered piecewise hashes (CTPH). Fuzzy hashes can match inputs that have homologues. Such inputs have sequences of identical bytes in the same order, although bytes in between these sequences may be different in both content and length._ + This article is intended for mail system administrators who wish to create and maintain their own hash storage. ## Step 1: Hash sources selection