Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
36 changes: 31 additions & 5 deletions chapters/limits-and-specifications/data-retention.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
description: "Data retention policies and zero data behavior"
---

To provide transcription services, Gladia processes several types of data:

Check warning on line 6 in chapters/limits-and-specifications/data-retention.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/limits-and-specifications/data-retention.mdx#L6

Did you really mean 'Gladia'?
- **Audio input**: Audio files or audio streams provided for transcription
- **Transcription output**: Text, timestamps, words, utterances
- **API Metadata**: Request IDs, timestamps, processing status
Expand All @@ -18,15 +18,41 @@
Only Enterprise users are eligible for custom data retention and the zero data retention option.
</Info>

To enable usage tracking, Gladia retains essential API metadata: request ID, timestamp, processing status and audio duration. Immutable logs are also maintained, for a limited period, to ensure service quality and reliability.

Check warning on line 21 in chapters/limits-and-specifications/data-retention.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/limits-and-specifications/data-retention.mdx#L21

Did you really mean 'Gladia'?

## Zero Data Retention behavior

When Zero Data Retention is enabled, Gladia processes data ephemerally; no data is stored at rest.

Check warning on line 25 in chapters/limits-and-specifications/data-retention.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/limits-and-specifications/data-retention.mdx#L25

Did you really mean 'Gladia'?

- **No audio files are stored**: Files cannot be retrieved through the API or in the [Gladia's playground](https://app.gladia.io/?utm_campaign=197609689-Website&utm_source=documentation&utm_content=sign-up). File upload is also disabled; the asynchronous API must use an external audio file url, such as S3 presigned url.
- **No transcripts are stored**: Transcription results cannot be retrieved through the API and are not visible in the [Gladia's playground](https://app.gladia.io/?utm_campaign=197609689-Website&utm_source=documentation&utm_content=sign-up).
- **No metadata retrieval**: Transcription API calls, audio duration, and other metadata cannot be retrieved through the API or in the [Gladia's playground](https://app.gladia.io/?utm_campaign=197609689-Website&utm_source=documentation&utm_content=sign-up).
- **Transcription results delivered only via callbacks**: The only way to receive transcription results is through callbacks; they cannot be retrieved by any other means.
<Warning>
Enabling Zero Data Retention is a **breaking change** if your integration relies on file upload ([`/v2/upload`](/api-reference/v2/upload/audio-file)) or result retrieval via polling ([`GET /v2/pre-recorded/:id`](/api-reference/v2/pre-recorded/get)). Review the restrictions below and update your integration **before** enabling ZDR.
</Warning>

Once the result is delivered, the audio, transcript, and metadata cannot be accessed.
### Disabled endpoints

- **File upload is disabled**: The [`/v2/upload`](/api-reference/v2/upload/audio-file) endpoint is fully disabled. Any request to this endpoint will fail. The asynchronous API must use an external audio file URL (e.g. an S3 presigned URL or any publicly accessible URL) passed directly as `audio_url` to [`POST /v2/pre-recorded`](/api-reference/v2/pre-recorded/init).

Check warning on line 33 in chapters/limits-and-specifications/data-retention.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/limits-and-specifications/data-retention.mdx#L33

Did you really mean 'presigned'?
- **Result polling is disabled**: Transcription results cannot be retrieved via [`GET /v2/pre-recorded/:id`](/api-reference/v2/pre-recorded/get). The only way to receive results is through [callbacks](/api-reference/v2/pre-recorded/callback/success).

### No data stored

- **No audio files are stored**: Files cannot be retrieved through the API or in [Gladia's playground](https://app.gladia.io/?utm_campaign=197609689-Website&utm_source=documentation&utm_content=sign-up).
- **No transcripts are stored**: Transcription results are not visible in the API or in [Gladia's playground](https://app.gladia.io/?utm_campaign=197609689-Website&utm_source=documentation&utm_content=sign-up).
- **No metadata retrieval**: Transcription API calls, audio duration, and other metadata cannot be retrieved through the API or in [Gladia's playground](https://app.gladia.io/?utm_campaign=197609689-Website&utm_source=documentation&utm_content=sign-up).

Once the result is delivered via callback, the audio, transcript, and metadata cannot be accessed.

### Migration checklist

Before enabling Zero Data Retention, make sure your integration meets the following requirements:

<Steps>
<Step title="Host audio files externally">
Replace any usage of `/v2/upload` with an external storage provider. Pass a publicly accessible or signed URL (e.g. AWS S3 presigned URL, GCS signed URL) as `audio_url` when creating a transcription job.

Check warning on line 50 in chapters/limits-and-specifications/data-retention.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/limits-and-specifications/data-retention.mdx#L50

Did you really mean 'presigned'?
</Step>
<Step title="Set up a callback endpoint">
Configure a callback URL in your transcription requests to receive results, since polling and the playground will not be available. See [callback configuration](/api-reference/v2/pre-recorded/init) for setup details.
</Step>
<Step title="Remove polling and GET calls">
Remove any code that retrieves transcription results via `GET /v2/pre-recorded/:id` or relies on the Gladia playground for debugging. Results are delivered exclusively through callbacks.

Check warning on line 56 in chapters/limits-and-specifications/data-retention.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/limits-and-specifications/data-retention.mdx#L56

Did you really mean 'Gladia'?
</Step>
</Steps>