OpenAI Open-Sources Privacy Filter, a Tiny Model That Scrubs PII Without an API Call

OpenAI released Privacy Filter under Apache 2.0 — a 1.5B-parameter (50M active) bidirectional token-classification model that detects and masks PII in text locally, in a single forward pass. It runs on a laptop, supports 128K context, hits 96% F1 out of the box, and is fine-tunable with minimal data. Eight entity categories: names, addresses, emails, phones, URLs, dates, account numbers, and secrets. It's context-aware (not regex), ships with a CLI and eval tooling, and slots into the same open-weight ecosystem as gpt-oss. The catch: multilingual support is thin, adversarial formatting breaks it, and the benchmark validation used OpenAI's own models to grade OpenAI's model.

Source: HackerNoon →

Blog

OpenAI Open-Sources Privacy Filter, a Tiny Model That Scrubs PII Without an API Call

Category

Related News

Will Ghostwriters Be Replaced by AI?

$NXT Launches on OKX Boost, KuCoin, MEXC, and LBank Bringing AI-Powered Global E...

The Machine Shows the Victims, But Hides Who Caused the Suffering

AI Isn’t “Inspired” by Human Writing. It Is Built on Unpaid Intellectual Labor.

AI Is Making Crypto Wallet Deanonymization Much Cheaper

Top Category

Blog

OpenAI Open-Sources Privacy Filter, a Tiny Model That Scrubs PII Without an API Call

Category

Share

Related News

Will Ghostwriters Be Replaced by AI?

$NXT Launches on OKX Boost, KuCoin, MEXC, and LBank Bringing AI-Powered Global E...

The Machine Shows the Victims, But Hides Who Caused the Suffering

AI Isn’t “Inspired” by Human Writing. It Is Built on Unpaid Intellectual Labor.

AI Is Making Crypto Wallet Deanonymization Much Cheaper

Top Category