How do you differ from simple hashing (anonymization, depersonalization, ID assignment)?
Hashing is a technique for anonymizing personal data in text information. However, hashed data cannot be directly used in machine learning and serves only as a linking entity. Our product, in contrast, protects sensitive data that is actively utilized in machine learning processes.
Classic hashing involves applying a cryptographic hash function to arbitrary data, generating a fixed-length hash value. This transformation is irreversible, akin to passing data through a meat grinder. Currently, there are no known approaches to leverage hashing in machine learning. It does not have the properties we need, unlike our proposed defense methods.
In contrast to hashed data's inapplicability for ML model training, encryption using Fully Homomorphic Encryption (FHE) enables operations on fully protected data throughout all stages: transmission, training, storage, quality evaluation, and result retrieval.