What is Redaction?
Redaction is a permanent privacy protection method that completely removes sensitive data from text. The detected sensitive information is deleted and cannot be restored. Example:How It Works
- Detection: Blindfold identifies sensitive entities in your text
- Complete Removal: Each detected entity is completely removed from the text
- Permanent: Original values are discarded and cannot be recovered
- Clean Output: Text flows naturally with sensitive data removed
When to Use Redaction
Redaction is ideal when you need to:1. Permanent Data Anonymization
Remove PII from logs, support tickets, or archives that will be stored long-term.- Compliant long-term storage
- No risk of data breach exposing PII
- Meets “right to be forgotten” requirements
2. Third-Party Analytics
Share data with analytics platforms without exposing sensitive information.- Google Analytics
- Mixpanel, Amplitude
- Custom analytics platforms
- Business intelligence tools
3. Public Disclosure
Prepare data for public release or legal disclosure.4. Log Sanitization
Remove sensitive data from application logs.5. GDPR Compliance
Implement “right to be forgotten” by permanently removing user data.When NOT to Use Redaction
Redaction is not suitable when:1. You Need to Restore Data Later
Redaction is permanent. Use Tokenization instead.2. Users Need to Identify the Data
If users need to recognize their own data, use Masking.3. You Need Consistent Identifiers
For analytics with user tracking, use Hashing.Key Features
Permanent Removal
Data is completely removed and cannot be recovered
Complete Deletion
Sensitive text is deleted, not replaced
GDPR Compliant
Meets data minimization requirements
50+ Entity Types
Removes all detected PII types
Quick Start
- Python
- JavaScript
- cURL
Configuration Options
Filter Specific Entity Types
Only redact specific types of sensitive data:Adjust Confidence Threshold
Control detection sensitivity:Common Patterns
Log Sanitization
Automatically redact logs before storage:Support Ticket Archival
Redact tickets before long-term storage:Analytics Event Tracking
Send events to analytics without PII:Common Use Cases
Compliance Logs
Compliance Logs
Maintain audit logs without storing PII:Benefits: Audit trail maintained, no PII storage, GDPR compliant
Customer Feedback
Customer Feedback
Collect feedback without storing customer PII:Benefits: Feedback preserved, PII removed, safe for analysis
Error Reports
Error Reports
Share error reports without exposing user data:Benefits: Developers get context, user privacy protected
Public Dataset Creation
Public Dataset Creation
Create shareable datasets from sensitive data:Benefits: Data useful for research, no privacy violations
Best Practices
1. Redact Early
Redact sensitive data as early as possible in your pipeline:2. Log What Was Redacted
Keep audit trails of redaction events:3. Review Redaction Policies
Regularly review what gets redacted:4. Combine with Other Methods
Use redaction alongside other privacy methods:Security Considerations
Learn More
Python SDK
Full Python SDK documentation
JavaScript SDK
Complete JavaScript guide
REST API
HTTP API reference for /redact
Examples
Practical integration examples