Data Locker for marketers

Premium

For:

Marketers Marketers Developers

Last update: July 14, 2026 11:12

At a glance: Data Locker sends your report data to cloud storage for loading into your BI systems. You can select between different storage destinations: An AppsFlyer-owned bucket in AWS, or storage owned by you in AWS, GCS, Yandex, BigQuery, and Snowflake. Data Locker supports multiple destinations. That means you can send all data to multiple destinations, segregate data by destination, or a combination of both.

Overview

In Data Locker select your apps, media sources, events, and reports to include in the data AppsFlyer delivers to your selected cloud storage options. Then, load data programmatically from the storage into your systems.

Data Locker—features

Feature	Description
Storage options (cloud)	Data Locker can send your data to any of the following cloud service providers: Storage owned by AppsFlyer on AWS Storage owned by you on: AWS GCS Azure Blob [Beta] Yandex BigQuery Snowflake You can set more than 1 destination in Data Locker. This means that you can send all or some of your data to multiple destinations. Examples Segregate data by report type. Send raw data to GCS and aggregate data to Snowflake. Segregate data by app and send the data per app group to different buckets.
Multi-app	Send data of 1, more, or all apps in your account. When you add apps to the account, they can be automatically included.
Availability window	14 days
Data segregation	Available data segregation options (relevant for bucket cloud storage): [Default] Unified: Data of all apps combined. The row-level app ID field is used to identify the app in data files. Segregated by app: Data of each app is in a separate folder. The folder name consists of the app ID.
Data format options	For bucket cloud storage: CSV Parquet Adobe Data warehouse
Data freshness	Freshness depends on the report type Hourly: Data generated continuously; for example, installs and in-app event data are written within hours of the event arriving in AppsFlyer. Daily: Reports like uninstalls, are generated daily and are ready on the following day. Versioned: If the same report is generated multiple times for the same time period a versioning mechanism is in place.

Reports available via Data Locker

Set Data Locker report settings

To configure Data Locker, follow these steps to connect your cloud service, define export settings, and customize report content:

1. Set up your cloud service

You can connect your Data Locker to one or more cloud service providers. See the following for instructions on how to configure them to work with Data Locker:

AWS bucket
GCS bucket
Azure Blob
[Beta] Yandex bucket
BigQuery data warehouse
Snowflake data warehouse

Note! If you don't have a Data Locker subscription and you access Cohorts analytics or SKAN data, you must still complete a marketer-owned cloud storage service procedure.

2. Add a connection to your cloud service

After configuring your cloud service account to work with Data Locker (see "Set up your cloud service" above), create a connection in Data Locker using the credentials from your account. You can create up to two connections.

Note

By default, each account can open up to 2 Data Locker connections. If you need additional connections, contact your Customer Success Manager (CSM).

To create a connection for your cloud provider perform the following steps:

In AppsFlyer, from the sidebar, go to Exports > Data Locker.
On the right-hand side, click New connection.
In Connection name enter the name for your connection. Use only lowercase letters, digits, and hyphens.
Click the icon of the cloud service to which you want to connect.
Depending on the service you selected, enter the following connection information.
AWS cloud bucket connection
Before setting the AWS connection, create an AWS bucket. To learn, how see here.

To set the connection:
1. Enter your AWS S3 bucket name. af- prefix is mandatory, and should be entered manually.
2. Click Test connection.
3. Verify that an error message indicating that the bucket path is invalid isn't displayed.
4. Select whether to Make this connection compatible with Adobe Experience Platform. If selected, click Save and continue to select global-level filters.
5. Click Save.
GCS cloud bucket connection
Before setting the GCS connection, create a bucket on GCS. To learn how see here.

To set the connection:
1. Enter your GCS bucket name.
2. Click Test connection.
3. Verify that an error message indicating that the bucket path is invalid isn't displayed.
4. Select whether to Make this connection compatible with Adobe Experience Platform. If selected, click Save and continue to select global-level filters.
5. Click Save.
Azure cloud bucket connection
Before setting the Azure connection, open a storage account in Azure. To learn how, see here.

To set the connection:
1. Enter your Connection name, Storage account name, and Key.
2. Verify that an error message indicating that the bucket path is invalid isn't displayed.
3. Select whether to Make this connection compatible with Adobe Experience Platform. If selected, click Save and continue to select global-level filters.
4. Click Save.
Yandex Cloud bucket connection
Before setting the AWS connection, create a service account in Yandex. To learn how, see here.

To set the connection:
1. Enter your Bucket name, Access key, and Secret key.
2. Verify that an error message indicating that the bucket path is invalid isn't displayed.
3. Select whether to Make this connection compatible with Adobe Experience Platform. If selected, click Save and continue to select global-level filters.
4. Click Save.
BigQuery data warehouse connection
Before setting the BigQuery connection, create a dataset in BigQuery. To learn how, see here.

To set the connection:
1. Enter your BigQuery project ID and dataset name.
2. Click Test connection.
3. Verify that an error message indicating that the bucket path is invalid isn't displayed.
4. Click Save and continue to select global-level filters.
Snowflake data warehouse connection
Before setting the Snowflake connection, open an account in Snowflake. To learn how, see here.

To set the connection:
1. Enter your Snowflake region and account ID.
2. Click Test connection.
3. Verify that an error message indicating that the bucket path is invalid isn't displayed.
4. Click Save and continue to select global-level filters.
Click Save. The Report output settings section is displayed.

Note

You can use the audit log to confirm whether a connection was created, updated, or deleted, and who made the change.

3. Set the report output settings

After setting the connection with the cloud service, you can continue to set the general settings of your Data Locker reporting outputs. If your cloud service is BigQuery or Snowflake, you can skip this step.

Under the Report output settings section, select the folder structure (data segregation):
- Unified (default): The report files include records from all the apps.
- Segregated by app: Each report file is dedicated to one app.
Select the reports file format: Parquet (default) or CSV.
Select the report's file compression type:
- Snappy (only available for Parquet files)
- GZIP
Select the maximum row number you want in your file: Either 10k, 25k, 50, 100k, 200k, or 500k. More rows in the file mean fewer files but a larger file size.

Note

Under Expected path, view the path patterns for your reports. Note: The real path may be different than what is displayed.

4. Select global-level filters

The global-level filters allow you to filter your reports by apps, geo, or media sources. These filters apply to most of the reports in your Data Locker account, but you can also set them at the report level (see 7-select-the-reportlevel-filters below). If the same filter is applied on both levels, the report-level filter takes precedence.

To apply a filter, perform the following:

In the Reports section, click the filter and select the items to include in the report. For example, click the Apps filter and select the apps to include in the reports.
Then click the Enter (⏎) button.

5. Select the report group

Select the reports that you want to get in your cloud service. The reports are listed in groups. Clicking on the report group name expands or collapses the groups.

To select a report, click to expand the report group. For each report in the group, the following information is presented:
- Report Name: The title of the report.
- Dataset Name: The name of the dataset that contains the report's records.
- Data Freshness: How often the report is updated with new records (e.g., hourly, daily, or versioned).
- Fields: the number of fields (or columns) that you selected for the report compared to the total number of fields available for selection.

6. Customize or duplicate the report

After selecting one or more reports from a report group, you can choose to either customize the original report or create a separate customized copy by duplicating it. This allows you to tailor the report’s fields and filters to your specific needs without affecting the original version.

Option A: Customize the original report

Click the Customize button next to the report name.
The report editor will open, allowing you to select fields and apply filters.
Changes will be saved to the original report configuration.

Note: This modifies how the report is delivered to your cloud storage.

Option B: Duplicate the report

Click next to the report name.
Select Duplicate from the dropdown menu.
A copy of the report will be created, named with the prefix copy_of_.
The duplicated report opens in edit mode for further customization.

Tip: Duplicating is ideal for creating variations of a report for different use cases.

After choosing whether to customize or duplicate, proceed to configure the report fields as described in the next step.

7. Select the report fields

Once you’ve chosen to customize or duplicate a report, the next step is to define which data fields should be included. Each report provides a complete set of available fields, and you can customize your selection to include only what’s relevant for your analysis or integration. By default, all fields are selected, but you can refine the report by manually choosing specific ones.

To select the fields to include in the report:

In the selected report dialog, under the Fields tab, hover over any field to view its description.
Check the fields you want to include in the report, or uncheck the fields you wish to exclude from the report.
Click Apply to save your settings.

Copy field selection from another report

You can copy the field selection from another report as a starting point and then continue to select or deselect fields to fine-tune the report.

In the Fields tab, deselect any random field.
Click Pull schema from report.
Select the report you want to copy the field selection from.
Continue to select or deselect fields.
To restore the report's original field selection, click Refresh.

8. Select the report-level filters

The report-level filters enable you to filter a single report by apps, media sources, or other dimensions. You can also set filters that apply to all the reports in your account; see select global-level filters. By default, the report-level filters are set to the global-level filter settings, but you can update them to custom settings that apply only to the selected report.

To select the filters to apply to a specified report:

Hover on the specific report that you want to customize.
Click to open the actions menu, and select Edit report.
Open the Filters tab. The filters are set to the global-level filter settings.
Click the filter and select the items to include in the report. For example, click the Apps filter and select the apps to include in the reports.
Click the Enter (⏎) button. Your selection overrides the global-level settings.
(Optional) For the Inapps report, you can set the In-app event filter. Enter their names exactly to select them.
Click Apply to save your settings.

9. Remove legacy fields

Legacy fields are those that were previously included in the report schema but are now excluded. We recommend removing these fields to ensure your report contains only relevant information. Before making any changes, make sure that your workflows and integrations do not depend on them.

To remove specific legacy fields

Open the Legacy fields tab.
Turn on: Include legacy fields in the report.
Deselect the fields you want to exclude.
Click Apply.
Save the connection settings.

To remove all legacy fields:

Open the legacy fields tab.
Turn off: Include legacy fields in the report.

Note

If you want to include legacy fields in the report but can't because the legacy fields list is grayed out and locked, contact your Customer Success Manager.

Non-empty legacy fields

Most legacy fields are empty or null. However, a few of them contain values but are still considered legacy because either:

They appear in the report under a different name (renamed).
They were excluded from the report schema (deprecated).

Download the non-empty legacy fields list (CSV).

10. Save the connection

Click Save, and the first data dump will be written to your cloud service within 3 hours. Subsequent data update schedules are specific to each report.

Important!

Any changes to Data Locker settings take up to 3 hours to take effect.

Set user permissions

Both admins and team members, with the correct permissions, can access Data Locker.

Admins can access the Data Locker page, create and manage all connections, add editors, and assign owners to existing connections.
Team members can access the Data Locker page, edit existing connections that they own, or create new connections. They cannot manage ownership (set the owner and editors), even if they are the account owners.

Manage Data Locker access and ownership (Admins)

Give Data Locker access

To provide a team member permission to access Data Locker, assign them a role with Data Locker set to Manage.

Manage ownership and editors

To transfer ownership or add a team member as an editor on an existing connection:

Click the three-dot options menu within the connection.
Select Manage ownership.
Change the connection owner or add an editor.

Review filters after changing ownership

If you change the owner of a Data Locker connection to a user with data access restrictions (for example, limited by app, media source, agency, or country), existing global-level or report-level filters may be reset to match the new owner’s permissions.

After transferring ownership, review the filters and confirm the connection reflects the intended data scope.

For more information about data access restrictions, see User management > Data access.

Note

In AF AWS connections, there is no Manage ownership option even for Admins, since only Admins can edit, delete, or create new AF AWS connections. This precaution is taken to prevent the exposure of the bucket credentials.

Data storage architecture

Overview

The structure of your data in storage depends on whether the data is sent to cloud storage or a data warehouse. The folder structure described here applies to storage (buckets). For data warehouse storage, consider that the reference to folders applies to views.

Data is written to your selected storage option. In the case of cloud storage, the storage is owned by AppsFlyer on AWS or owned by you on AWS, GCS, or Yandex. You can switch storage options at any time or send some or all of your data to multiple storage options.

Data in the cloud bucket storage is organized in a hierarchical folder structure, according to report type, date, and time. The following figure contains an example of this structure:

Data of a given report is contained in the hour (h) folders associated with that report:

The number of hour folders depends on the report data freshness (hourly, daily or versioned).
Data is provided in Snappy or GZIP compressed files, or uncompressed files, having Parquet or CSV format.
Data files consist of columns (fields).
The schema (field) structure of the user journey reports is identical to each other and depends on the fields selected by you. Other reports each have their own explicit fields, AKA schemaless reports. See Data Locker marketer reports for the reports available and links to the report specifications.

Folder structure

App segregation

For bucket cloud storage, data is provided in unified data files containing the data of all apps selected or segregated into folders by app. The segregation is within the h folder as described in the table that follows.

Data files

Data files depend on segregation type.

Content	Details
Completion flag	The last file (completion) flag is set when all the data for a given h folder has been written. Don't read data in a folder before verifying that the _SUCCESS flag exists. The _SUCCESS flag is set even in cases where there is no data to write to a given folder and the folder is empty. Note! In the segregation by app option, the flag is set in the h folder and not the individual app folders. See the figures in the previous section.
File types	Data is provided in Snappy or GZIP compressed files, or uncompressed files, having Parquet or CSV format. After unzipping, the data files are in Parquet or CSV format according to your settings.
Column sequence (CSV files)	In the case of CSV files, the sequence of fields in reports is always the same. When we add new fields these are added to the right of the existing fields. In this regard: The column structure of user journey reports is identical. This means you can have similar data-loading procedures for different report types. You select the fields contained in the reports. The field meaning is detailed in the raw data dictionary. Reports having an FF notation in the report availability section don't adhere to the common column structure.
Field population considerations	Blank or empty fields: Some fields are populated with null or are empty. This means that in the context of a given report there is no data to report. Typically null means this field is not populated in the context of a given report and app type. Blank "" means the field is relevant in its context but no data was found to populate it with. In the case of the restricted media source, the content of restricted fields is set to null. Overall regard null and blank as one and the same thing; there is no data available. Time zone and currency App-specific time zone and currency settings have no effect on data written to Data Locker. The following apply: Time zone: Date and hour data are in UTC. Currency: The field event_revenue_usd is in USD. Values with commas: These commas are contained between double quotes `"`, for example, `"iPhone6,1"`.

Storage options

Caution!

If you are using the marketer-owned storage option:

Verify that you comply with data privacy regulations like GDPR and ad network/SRN data retention policies.
Don't use the marketer-owned storage solution to send data to third parties.

Data is written to a storage owner of your choice as follows:
- AppsFlyer storage
- Customer storage—AWS, GCS, Azure, Yandex, BigQuery, and Snowflake
You can change the storage selection at any time.
If you change the storage, the following happens:
- We start writing to the newly selected storage within one hour.
- We continue writing to the existing storage during a transition period of 7 days. The transition period expiry time displays in the user interface. Use the transition period to update your data loading processes. You can restart the transition period or revert to the AppsFlyer bucket if needed.
- Changing storage: You can migrate from one storage option to another by using the multi-storage option and sending data to multiple destinations simultaneously. Once you have completed the migration and testing, delete the storage option you no longer need.

	AppsFlyer-owned storage (AWS)	Marketer-owned storage (GCS, AWS, Azure, Yandex, BigQuery, Snowflake)
Bucket name	Set by AppsFlyer	GCS: No restriction AWS: Set by you. Must have the prefix af-. Example: `af-datalocker-your-bucket-name`
Storage ownership	AppsFlyer	Marketer
Storage platform	AWS	AWS, GCS, Azure, Yandex, BigQuery, Snowflake
Credentials to access data by you	Available in the Data Locker user interface to your AppsFlyer account admins	Not known to AppsFlyer. Use credentials provided by the cloud provider.
Data retention	Data is deleted after 14 days	Marketer responsibility
Data deletion requests	AppsFlyer responsibility	Marketer responsibility
Security	AppsFlyer controls the storage. The customer has read access.	The marketer controls the storage. AWS: AppsFlyer requires GetObject, ListBucket, DeleteObject, PutObject permission to the bucket. The bucket should be dedicated to AppsFlyer use. Don't use it for other purposes. GCS: See GCS configuration article.
Storage capacity	Managed by AppsFlyer	Managed by the marketer
Access control using VPC endpoints with bucket policies	Not Applicable	[Optional] In AWS, if you implement VPC endpoint security at the bucket level, you must allowlist AppsFlyer servers.

Notice to security officers in the case of customer-controlled storage

Consider:

The bucket or destination is for the sole use of AppsFlyer. There should be no other entity writing to a given destination.
You can delete data in the destination 25 hours after we write the data.
Data written to the destination is a copy of data already in our servers. The data continues to be in our servers in accordance with our retention policy.
For technical reasons, we sometimes delete and rewrite the data. For this reason, we need delete and list permissions. Neither permissions are a security risk for you. In the case of list, we are the sole entity writing to the bucket. In the case of delete, we are able to regenerate the data.
For additional information, you can contact our security team via hello@appsflyer.com or your CSM.

Multiple-connections principles (more than one destination)

In Data Locker you can send some or all of your data to 2 destinations (defined in the connection settings). For example, you can send App A data to AWS, and App B data to GCS.

Each connection consists of a complete set of Data Locker settings, including a destination. Connection settings are independent of one another.

In managing your connections, consider:

In Data Locker settings, connections are shown in tabs. Each connection has its own settings tab from which you can manage the connection. The icon of each tab represents the storage type.
To see connection details, duplicate a connection, or delete a connection, click ⋮ (options).

Additional information

Track connection changes in the audit log

You can view Data Locker connection changes in the Audit log, available from the Security center in the AppsFlyer dashboard. Use the audit log to confirm whether a change was made, when it occurred, and who made it. This can help resolve issues like missing data or unexpected connection changes without needing to contact support.

The following connection lifecycle events are tracked:

New connection created
Connection updated
Connection disabled
Connection deleted

To access the audit log:

In the top navigation bar, open the account menu.
Select Security center.
In the Audit log section, click View audit log.
Filter by Service: Datalocker to view related entries.

For more information see: Audit log.

Traits and Limitations

Trait	Remarks
Ad networks	Not for use by ad networks
Agencies	Not for use by agencies
App-specific time zone	Not Applicable. Data Locker folders are divided into hours using UTC. The actual events contain times in UTC. Convert the times to any other time zone as needed. Irrespective of your app time-zone the delay from event occurrence until it is recorded in Data Locker remains the same.
App-specific currency	Not supported
Size limitations	Not applicable
Data freshness	Data is updated according to the specific report data freshness detailed in this article.
Historical data	Not supported. If you need historical data, some reports, but not all, are available via Pull API.
Restricted data	Fields in some reports are restricted due to privacy limitations. Learn more
User access	Only account users with required permissions can configure Data Locker.
Single app/multiple app	Multi-app support. Data Locker is at the account level
Maximum connections	By default, each account can open up to 2 Data Locker connections. To request additional connections, contact your CSM.

Troubleshooting

Symptom: Unable to retrieve data using AWS CLI
Error message: An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied
Cause: The AWS credentials being used are not the correct credentials for the AppsFlyer bucket. This can be caused by having multiple or invalid credentials on your machine.
Solution:
1. Use a different method, like Cyberduck to access the bucket, meaning not the CLI. Do this to verify that the credentials you are using are working. If you are able to connect using Cyberduck, this indicates an issue with the credentials cache.
2. Refresh the AWS credentials cache.
  Screenshot from AWS

AWS data retrieval

Use your preferred AWS data retrieval tool, AWS CLI, or one of the tools described in the sections that follow. Note! The exact instructions are suitable for AppsFlyer owned buckets. Adjust the instructions as needed if you are connecting to your bucket.

AWS CLI

Before you begin:

Install the AWS CLI on your computer.
In AppsFlyer, go to Data Locker, and retrieve the information contained in the credentials panel.

To use AWS CLI:

Open the terminal. To do so in Windows, <Windows>+<R>, click OK.
The command line window opens.
Enter aws configure.
Enter the AWS Access Key as it appears in the credentials panel.
Enter your AWS Secret Key as it appears in the credentials panel.
Enter eu-west-1.
Press Enter (None).

Use the CLI commands that follow as needed.

In the following commands, the value of {home-folder} can be found

To list folders in your bucket:

aws s3 ls s3://af-ext-reports/{home-folder}/data-locker-hourly/

Listing files and folders

There are three types of folders in your Data Locker bucket:

Report Type t=
Date dt=
Hour h=

To list all the reports of a specific report type:

aws s3 ls s3://af-ext-reports/{home-folder}/data-locker-hourly/t=installs/

To list all the reports of a specific report type for a specific day:

aws s3 ls s3://af-ext-reports/{home-folder}/data-locker-hourly/t=installs/dt=2019-01-17

To list all the reports of a specific report, in a specific hour of a specific day:

aws s3 ls s3://af-ext-reports/{home-folder}/data-locker-hourly/t=installs/dt=2019-01-17/h=23

To download files for a specific date:


aws s3 cp s3://af-ext-reports/<home-folder>/data-locker-hourly/t=installs/dt=2020-08-01/h=9/part-00000.gz ~/Downloads/

Cyberduck

Before you begin:

Install the Cyberduck client.
In AppsFlyer, go to Data Locker and retrieve the information contained in the credentials panel.

To configure Cyberduck:

In Cyberduck, click Action.
Select New Bookmark. The window opens.
In the first field (marked [1] in the screenshot below) select Amazon S3.
Complete the fields as follows:
- Nickname: Free text
- Server: s3.amazonaws.com
- Access Key ID: Copy the AWS Access Key as it appears in the credentials panel in AppsFlyer
- Secret Access Key: Copy the Bucket Secret key as it appears in the credentials panel in AppsFlyer.
- Path: {Bucket Name}/{Home Folder} For example: af-ext-reports/1234-abc-ffffffff
Close the window. To do so, click the X in the upper-right corner of the window.
Select the connection.
The data directories are displayed.

Amazon S3 browser

Before you begin:

Install the Amazon S3 Browser.
In AppsFlyer, go to Data Locker and retrieve the information contained in the credentials panel.

To configure the Amazon S3 Browser:

In the S3 browser, Click Accounts > Add New Account.
The Add New Account window opens.
Complete the fields as follows:
- Account Name: free text.
- Access Key ID: copy the AWS Access Key as it appears in the credentials panel.
- Secret Access Key: copy the Bucket Secret key as it appears in the credentials panel.
- Select Encrypt Access Keys with a password and enter a password. Make a note of this password.
- Select Use secure transfer.
Click Save changes.
Click Buckets > Add External Bucket.
The Add External Bucket window opens.
Enter the Bucket name. The Bucket name has the following format: {Bucket Name}/{Home Folder}. The values needed for bucket name and home folder appear in the credentials window.
Click Add External bucket.The bucket is created and displays in the left panel of the window.
You can now access the Data Locker files.

Folder	Description
Subscription ID	The top-level folder in the bucket depends on the storage owner and provider. In general, the top-level folder is your Subscription ID but in some cases, for example, if you use Cyberduck the ID is set in the bookmark and doesn't display in the folder structure. The data-locker-hourly folder contains the report topics. Folders above this level depend on bucket ownership and cloud service provider. Examples of folder structure based on bucket owner and cloud provider AppsFlyer bucket: <af-ext-reports>/<unique_identifier>/<data-locker-hourly> Your AWS bucket: <af-datalocker-your bucket prefix>/<generated-home-folder><subscription-id> Your GCS bucket: <your bucket name>/<generated-home-folder>/<subscription-id>
Topic (t)	Report type relates to the subject matter of the report.
Date (dt)	This is the related data date. In the case of raw data, it means the date the event occurred. In the case of aggregated data, the reporting date itself.
Time (h or version)	Date folders are divided into hourly (h) or version folders depending on the report type. Hourly folders The h folders relate to the time the data was received by AppsFlyer. For example, install events received between 14:00-15:00 UTC are written to the h=14 file. Note! There is a delay, of about 1-3 hours, between the time the data arrives in AppsFlyer until the h folder is written to Data Locker. For example, the h=14 folder is written 1 hour later at 15:00 UTC. Hourly folder characteristics: There are 24 h folders numbered 0–23. For example, h=0, h=1, and so on. A late folder, h=late, contains events of the preceding day arriving after midnight. Meaning events arriving during 00:00–02:00 UTC of the following day. For example, if a user installs an app on Monday 08:00 UTC and the event arrives on Tuesday 01:00 UTC, the event is written to Monday's late folder. Data arriving after 02:00 UTC is written to the folder of the actual arrival date and time. Ensure that data in the h=late folder is consumed. It isn't contained in any other folder. _temporary folder: In some cases, we generate a temporary folder within an h folder. Disregard temporary folders and subfolders. Example: /t=impressions/dt=2021-04-11/h=18/_temporary. Daily reports Raw data reports having a daily data freshness are stored in the h=23 folder. The uninstall report is usually in the h=2 folder but can be in any folder. Cohort and Incrementality reports are stored directly in the dt folder. Versioned reports adhere to a different convention described in this section. Hourly report considerations for apps that don't use UTC time. To make sure that you get all the data for a given calendar day you must consume the folders according to the day defined by the app timezone as detailed: Eastern hemisphere timezone: To get all the data of a given calendar date you must consume folders according to UTC time and date. Example: Your app timezone is UTC+10 (Sydney, Australia). To get all the hourly data related to Tuesday (Sydney) you must consume the following folders: Monday h=14–23 and late, Tuesday h=0–13 and 14-15 Why must you consume Tuesday h=14-15? Some data can arrive late. So the h=14–15 folders can contain late-arriving events. You must filter event_time to align with the app calendar day relative to UTC. Western hemisphere timezone: To get all the data of a given calendar date you must consume folders according to UTC time and date. Example: Your app timezone is UTC- 7 (Los Angeles). To get all the hourly data related to Tuesday (Los Angeles) you must consume the following folders: Tuesday h=7–23 and late, Wednesday h=0–6 and 7-8. Why must you consume Wednesday h=7-8? Some data can arrive late. So the h=7–8 folders can contain late-arriving events. You must filter event_time to align with the app calendar day relative to UTC. Version folders Some reports have a versioned option. This means that the most updated data for a given day is provided multiple times. Because data can continue to update due to late-arriving data or more accurate data the same report has multiple versions where the most recent version is the most accurate. The reports for a given day are contained in the versions folder of that day. Each version is contained in a separate folder whose name is set using an Epoch timestamp that uniquely identifies the report. Your data import processes must consider that data can be written retroactively. For example, on January 14, data can be written to the Jan 1 folder. If the bucket is owned by you, consider using cloud service notification to trigger your import process (AWS \| GCS)

Overview

Data Locker—features

Reports available via Data Locker

Set Data Locker report settings

1. Set up your cloud service

2. Add a connection to your cloud service

Note

AWS cloud bucket connection

GCS cloud bucket connection

Azure cloud bucket connection

Yandex Cloud bucket connection

BigQuery data warehouse connection

Snowflake data warehouse connection

Note

3. Set the report output settings

Note

4. Select global-level filters

5. Select the report group

6. Customize or duplicate the report

7. Select the report fields

8. Select the report-level filters

9. Remove legacy fields

Note

Non-empty legacy fields

10. Save the connection

Important!

Set user permissions

Manage Data Locker access and ownership (Admins)

Give Data Locker access

Manage ownership and editors

Review filters after changing ownership

Note

Data storage architecture

Overview

Folder structure

Hourly folders

Version folders

App segregation

Data files

Storage options

Caution!

Notice to security officers in the case of customer-controlled storage

Multiple-connections principles (more than one destination)

Additional information

Track connection changes in the audit log

Traits and Limitations

Troubleshooting

AWS data retrieval

AWS CLI

Listing files and folders

Cyberduck

Amazon S3 browser

See also