PrivaCI¶
One command. Sanitized staging data. No data leaves your VPC.
PrivaCI is a stateless batch engine that reads from a PostgreSQL source (typically a production replica), masks PII with a three-tier pipeline, and writes realistic synthetic data to a staging database with referential integrity preserved.
Start here¶
| Goal | Page |
|---|---|
| Run your first mask in under five minutes | Quickstart |
| Every CLI flag and subcommand | CLI reference (generated) |
Author mask-rules.yaml |
Configuration guide |
| Deploy with Docker or Helm | Deployment |
| Extend the engine with plugins | Building a plugin |
How it works¶
- Introspect the source schema (tables, FKs, partitions, implied keys).
- Replicate DDL to an empty target database.
- Stream rows through COPY-binary, mask in memory, and load in FK order.
- Checkpoint every batch so a crashed run can resume.
- Audit every decision in
_privacitables on the target.
See the architecture overview for design rationale.
License¶
The engine is licensed under the Elastic License 2.0. Optional paid features ship as a separate plugin layer.