Skip to content

Releases: thoughtworks/pii-anonymizer

Fix - Spark Performance

04 Jan 04:53
Compare
Choose a tag to compare

There was a bug that's causing Spark's code to run indefinitely as the size of the data grows.

Thai Citizen Id Detector

09 Dec 10:14
Compare
Choose a tag to compare

Added 13 digits Thai Citizen Id detector

Multiple input for both csv, parquet

01 Dec 04:07
Compare
Choose a tag to compare

Automatically get and parse input files according to their format

Multiple CSV input

30 Nov 09:23
Compare
Choose a tag to compare

Open up the option to specify wildcard in file_path

Parquet or CSV as output format

28 Nov 07:33
Compare
Choose a tag to compare

It's now possible to select the output file format from pii-anonymizer.

Specify the output_file_format under anonymize section with "csv" (default) or "parquet"

Merge drop and redact into replace

22 Nov 09:04
Compare
Choose a tag to compare

Replace drop and redact mode with replace mode, when no value is given under anonymize section the default behavior drops the detected value. Otherwise, we replace the detected value with what was given in value field.

Anonymizer mode

22 Nov 07:09
Compare
Choose a tag to compare

Introduce mode option for Anonymizer

  • Redact (Replace with [Redacted])
  • Drop (Remove PII)