Blog

Apr 22, 2026

OpenAI Open-Sources Privacy Filter, a Tiny Model That Scrubs PII Without an API Call

OpenAI released Privacy Filter under Apache 2.0 — a 1.5B-parameter (50M active) bidirectional token-classification model that detects and masks PII in text locally, in a single forward pass. It runs on a laptop, supports 128K context, hits 96% F1 out of the box, and is fine-tunable with minimal data. Eight entity categories: names, addresses, emails, phones, URLs, dates, account numbers, and secrets. It's context-aware (not regex), ships with a CLI and eval tooling, and slots into the same open-weight ecosystem as gpt-oss. The catch: multilingual support is thin, adversarial formatting breaks it, and the benchmark validation used OpenAI's own models to grade OpenAI's model.

Source: HackerNoon →


Share

BTCBTC
$80,641.00
1.21%
ETHETH
$2,282.14
2.32%
USDTUSDT
$1.000
0%
BNBBNB
$665.43
0.49%
XRPXRP
$1.44
2.32%
USDCUSDC
$1.000
0%
SOLSOL
$94.65
2.58%
TRXTRX
$0.349
0.53%
FIGR_HELOCFIGR_HELOC
$1.04
0.73%
DOGEDOGE
$0.110
0.8%
WBTWBT
$59.15
1.49%
USDSUSDS
$1.000
0.01%
ADAADA
$0.272
2.79%
HYPEHYPE
$40.46
3.55%
ZECZEC
$570.96
2.76%
LEOLEO
$9.99
2.8%
BCHBCH
$439.86
2.26%
XMRXMR
$411.65
1.65%
LINKLINK
$10.31
2.29%
TONTON
$2.31
4.65%