What is the best open-source PII detection tool?

PII Engineer achieves 0.86 F1 on multilingual PII detection, outperforming Presidio (0.44), spaCy (0.64), and AWS Comprehend (0.52) on non-English text. It supports 50+ languages from a single model with no GPU required.

How does PII Engineer compare to Presidio?

PII Engineer uses a transformer-based NER model that handles 50+ languages natively. Presidio relies on regex patterns that must be written per locale. On English PII, PII Engineer scores 0.88 F1 vs Presidio's 0.80. On multilingual PII, the gap widens to 0.86 vs 0.44.

Does PII Engineer require a GPU?

No. PII Engineer runs on CPU using ONNX Runtime with INT8 quantization. A 4-vCPU server achieves ~180ms p50 latency at a cost of $42/month.

What PII types can PII Engineer detect?

PII Engineer detects 30+ PII types including person names, phone numbers, government IDs (NRIC, NIK, Aadhaar), street addresses, dates of birth, email addresses, passport numbers, license plates, and bank account numbers.

How much does PII detection cost with AWS Comprehend vs self-hosted?

AWS Comprehend costs approximately $1,000/month for 1 million requests. Self-hosted PII Engineer costs $42/month on a single VPS with unlimited requests. Google DLP costs ~$1,500/month and GPT-4 costs ~$3,000+ for the same volume.

← Back to PII Engineer

Benchmarks

Head-to-head comparison of PII detection tools on accuracy, multilingual support, latency, and cost. Tested on 1,200 annotated samples across 9 languages.

0.902

Macro F1 Score

Across 9 entity types and 9 languages

50+

Languages

Single model, no language routing

250ms

Avg Latency

INT8 on 4-vCPU, no GPU needed

$42

Monthly Cost

Self-hosted on a single VPS

Overall Accuracy (F1 Score)

Evaluated on 500 multilingual PII examples with ground truth annotations. Higher is better.

PII Engineer

Presidio

spaCy

AWS Comprehend

GPT-4

English PII

PII Engineer

0.88

GPT-4

0.85

spaCy + rules

0.83

Presidio

0.80

AWS Comprehend

0.82

Multilingual PII (non-English)

PII Engineer

0.86

GPT-4

0.78

spaCy + rules

0.64

Presidio

0.44

AWS Comprehend

0.52

Structured PII (Phone, ID, Email)

PII Engineer

0.93

Presidio

0.91

GPT-4

0.87

AWS Comprehend

0.75

spaCy

0.52

Accuracy by Language

F1 scores from a 1,200-sample multilingual test set with 9 entity types.

Language	PII Engineer	Presidio	spaCy	AWS Comprehend
English	0.931	0.80	0.83	0.82
Chinese	0.918	0.31	0.71	0.68
Vietnamese	0.912	0.28	0.42	0.55
Malay	0.895	0.25	0.38	0.48
Indonesian	0.901	0.30	0.61	0.58
Tamil	0.878	0.15	0.35	0.40
Thai	0.885	0.22	0.52	0.55
Hindi	0.892	0.20	0.58	0.62
Korean	0.905	0.18	0.65	0.70

Presidio scores reflect default recognizers without custom per-locale rules. spaCy uses the best available model per language.

Per-Entity Accuracy

Entity Type	F1	Precision	Recall
email_address	0.970	0.98	0.96
phone_number	0.968	0.97	0.96
government_id	0.920	0.94	0.90
bank_account_number	0.915	0.93	0.90
street_address	0.891	0.90	0.88
date_of_birth	0.887	0.91	0.87
passport_number	0.880	0.90	0.86
license_plate	0.833	0.85	0.82
person_name	0.823	0.84	0.81

Evaluated on PII Engineer v1.3 with INT8 encoder. 8-stage post-processing pipeline improves raw F1 from 0.779 to 0.902.

Latency

Tested on a 4-vCPU AMD cloud instance, no GPU. Input: 50-word text with mixed PII.

System	p50	p99	RAM	GPU Required
Presidio (regex only)	3ms	12ms	200MB	No
Presidio + spaCy	80ms	250ms	1.8GB	No
spaCy (transformer)	120ms	350ms	1.5GB	Optional
PII Engineer (INT8)	180ms	400ms	700MB	No
AWS Comprehend	200ms	800ms	N/A	N/A (managed)
GPT-4	1500ms	4000ms	N/A	N/A (managed)

Presidio regex-only mode misses person names and addresses. With spaCy backend, latency approaches PII Engineer's. GPT-4 requires API calls with per-token billing.

Cost Comparison

System	Monthly Cost	At 1M requests/mo	Self-Hosted
PII Engineer	$42 (VPS)	$42	Yes
Presidio	$42 (VPS)	$42	Yes
spaCy	$42 (VPS)	$42	Yes
AWS Comprehend	Pay-per-use	~$1,000	No
Google DLP	Pay-per-use	~$1,500	No
GPT-4	Pay-per-token	~$3,000+	No

Self-hosted costs assume a 4-vCPU AMD VPS at $42/month. Managed service costs vary by region and volume.

Feature Comparison

Feature	PII Engineer	Presidio	spaCy	AWS Comprehend
Languages (single model)	50+	~10 locales	1 per model	12
PII-specific labels	Yes (30+ types)	Yes	No (generic NER)	Yes
GPU required	No	No	Optional	N/A
Self-hosted	Yes	Yes	Yes	No
Single binary deploy	Yes (Rust)	No (Python)	No (Python)	N/A
REST API included	Yes	Optional	No (library)	Yes
Open source	Apache-2.0	MIT	MIT	No
Model size (all langs)	620MB	500MB+ (with NER)	2GB+ (5 langs)	N/A
Add new language	Already covered	Write recognizers	Train new model	Not possible
Maintenance effort	Low	High (per locale)	Medium	None (managed)

Methodology

All benchmarks were conducted on a standardized test set of 1,200 manually annotated samples across 9 languages and 9 PII entity types. The dataset covers real-world text patterns including:

Chat messages and customer support transcripts
Form submissions with mixed-format data
Business documents with addresses and IDs
Code-mixed text (e.g., English + Malay in one sentence)

Each system was tested with default configurations unless noted. Presidio was tested with built-in recognizers (no custom rules). spaCy used the best available transformer model per language. AWS Comprehend and GPT-4 were tested via their respective APIs.

All latency measurements were taken on a 4-vCPU AMD Premium instance (DigitalOcean, SGP1 region) with input texts averaging 50 words.

Try PII Engineer

Open source, self-hosted, no GPU required. Run it locally in 60 seconds.

cargo build --release && cargo run --release

GitHub · Live Demo · API Docs