At a glance: Cost ETL, part of AppsFlyer ROI360, provides advertisers with campaign cost data having the best granularity available per media source. Data, updated up to 4 times daily, is delivered to your cloud service ready for loading into advertiser BI systems.
About Cost ETL
Cost ETL, part of AppsFlyer ROI360, provides advertisers with campaign cost data with the best granularity available per media source. Up to four times daily, the data is updated and delivered to your Data Locker cloud service, ready for loading into advertiser BI systems.
Advertisers can effortlessly route reports to multiple cloud destinations at once. This empowers teams when transitioning from one cloud service to another, as data is accessible in multiple services simultaneously.
Campaign cost data is written:
- To your cloud service or bucket for viewing, transferring, and loading cost data into your systems.
- For all apps you choose to include in Cost ETL, per day, per version.
- Up to 4 times per day (data freshness: intraday).
- For the current day and the previous 6 days (referred to as 7 days in this article), as well as days 14, 29, and 88. Take this into consideration in your data loading process.
- Example: For the date October 14, 2024, the file contains data for October 14 (the current day), October 13 (1 day back), October 12 (2 days back), October 11 (3 days back), October 10 (4 days back), October 9 (5 days back), October 8 (6 days back), September 30 (14 days back), September 15 (29 days back), and July 18 (88 days back).
- The retroactive data allows for updates and corrections in the cost data reporting.
- For cost matched with an attribution, click, or impression.
Note:- Data for the last 7 days is pulled anew from the media sources. Data for days 14, 29, and 88 is re-processed.
- Only cost data is updated retroactively; not attribution data.
Note:
- The sample provided is an Excel file with the data broken down by channel. Cost ETL files are sent to your bucket as parquet files.
- Report dimensions and metrics are detailed in the file fields table. You should use the dimensions and metrics that best fit your business needs. This may differ depending on the media source.
Report details
Directory and filename structure
Data written to the cloud service or bucket has the directory and file structure described:
- When Cost ETL completes writing to a directory, a flag is set by creating a success file. This is always the most recent timestamp in the directory.
- The number of folders/files is as follows:
- Summary: Up to 4 version folders per day.
- Each version folder contains parquet files with 7 days of data.
- Geo or Channel report: Each report contains up to 4 version folders per day.
- Each version folder contains parquet files containing data with numbering starting from 1.
- [Closed beta] All cost: Based on the geo dimension, contains up to 4 files per day.
- Summary: Up to 4 version folders per day.
Data Locker connection directory structure
<client-bucket>/<DL-generated-partition>/<connection-name>/t=<report-name>/dt=<YYYY-MM-DD>/v=<1/2/3/4>/<parquet file number>
Variable | Content |
---|---|
client-bucket | Name of the bucket as configured in the cloud service |
DL-generated-partition | Generated partition, unique per customer |
connection-name | Connection name as defined during Data Locker setup |
t | The name of the specific report. Either:
|
dt |
|
v |
|
Parquet file number |
|
Cost ETL connection directory structure
<advertiser bucket name>/cost_etl/version/dt=<yyyy-mm-dd>/b=<n>/<dimension>/<file name/<parquet file number>
Variable | Content |
---|---|
advertiser_bucket_name | As defined in the Cost ETL configuration: af-xpend-cost-etl-<af-account-id>-[your bucket name suffix] |
cost_etl | Always cost_etl |
version | Cost ETL version |
date |
Cost date Format: yyyy-mm-dd |
batch | Number 1-4 |
dimension | Data dimension:
|
file_name |
|
Parquet file number |
|
Example: For the first data pull of June 23, 2020, the directory and file name structure is as follows: /bucket-name/cost_etl/v1/dt=2020-06-23/b=1/geo/part-00001
File fields
Fields/Dimensions
Field | Remarks | Always populated | Format | Included in summary file |
---|---|---|---|---|
date | Date cost incurred reported by the media source | Yes | String yyyy-mm-dd | ✓ |
app_id | App id in the AppsFlyer platform | Yes | String | ✓ |
media_source | Media source responsible for display the ad | Yes | String | ✓ |
os | Operating system of the device. Possible values:
|
Yes | String | - |
agency | Agency responsible for placing the ad | No | String | - |
campaign | Component of the advertising hierarchy | No | String | - |
campaign_id | Component of the advertising hierarchy | No | String | - |
adset | Component of the advertising hierarchy | No | String | - |
adset_id | Component of the advertising hierarchy | No | String | - |
ad | Component of the advertising hierarchy | No | String | - |
ad_id | Component of the advertising hierarchy | No | String | - |
ad_account |
|
No | String | - |
currency | Currency of advertiser spend as defined for the app in AppsFlyer | Yes | 3 character string compliant with ISO-4217 | ✓ |
original_currency | Currency of cost as reported by the network before any conversions | Yes | 3 character string compliant with ISO-4217 | - |
timezone |
|
Yes | String | - |
geo | Dimension in the advertising hierarchy | No | 2 character string compliant with ISO 3166 | ✓ |
channel | Dimension in the advertising hierarchy | No | String | - |
keyword_term | Word(s) used by the user for online search | Yes | String | - |
keyword_id | ID of the ASA keyword terms | Yes | String | - |
site_id | Publisher ID | No | String | - |
campaign_objective | Component of campaign properties. Learn more | No | String | - |
cost_model | Component of campaign properties. Learn more | No | String | - |
af_cost_model | Cost model mapped and normalized by AppsFlyer. Component of campaign properties. Learn more | No | String | - |
bid_strategy | Component of campaign properties. Learn more | No | String | - |
af_bid_strategy | Bid strategy mapped and normalized by AppsFlyer. Component of campaign properties. Learn more | No | String | - |
bid_amount | Component of campaign properties. Learn more | No | Integer | - |
original_bid_amount | Component of campaign properties. Learn more | No | Integer | - |
Metrics
Field | Remarks | Always populated | Format | Included in summary file |
---|---|---|---|---|
impressions |
|
Yes. If no value is available for a particular metric, it is populated with 0. |
Integer | ✓ |
clicks |
|
Integer | ✓ | |
reported_impressions | Counted by the Media source | Integer | ✓ | |
reported_clicks | Counted by the Media source | Integer | ✓ | |
installs | Counted by AppsFlyer | Integer | ✓ | |
reported_conversions | Counted by the Media source | Integer | ||
re_engagements | Counted by AppsFlyer | Integer | ✓ | |
re_attributions | Counted by AppsFlyer | Integer | ✓ | |
cost | Amount of spend (including agency fees where relevant) | Value | ✓ | |
original_cost | Cost as reported by the network, in the currency reported by the network before any currency conversion (with agency fees calculated by AppsFlyer added where relevant) | Value | - | |
impressions_discrepancy |
|
Integer | - | |
clicks_discrepancy |
|
Integer | - | |
installs_discrepancy |
|
Integer | - | |
fees | Fees an agency charges in addition to the usual ad cost. Counted by AppsFlyer | Yes | Integer | - |
cost_without_fees | Cost minus the agency fee. Counted by AppsFlyer | Yes | Integer | - |
original_cost_without_fees | Original cost as reported by the ad network, without agency fees | Yes | Integer | - |
ctr |
|
No |
Integer | - |
cvr |
|
Integer | - | |
ecpm |
|
Integer | - | |
cpi |
|
Integer | - | |
ccvr |
|
Integer | - | |
cvvr |
|
Integer | - | |
reported_cvr |
|
Integer | - | |
ecpc |
|
Integer | - | |
video_25p_views | Video played 25%. Reported by ad network | Integer | - | |
video_50p_views | Video played 50%. Reported by ad network | Integer | - | |
video_75p_views | Video played 75%. Reported by ad network | Integer | - | |
video_completions | Reported by ad network | Integer | - |
Procedures
Set up Cost ETL
This configuration procedure must be performed by an admin user.
Before you start:
Cost ETL setup requires:
- A cloud service set up with permission for AppsFlyer to write data to it. This means you need to set up either 1 or more of the following cloud services:
- AppsFlyer admin privileges and access to the AppsFlyer UI to complete Cost ETL setup.
- In AppsFlyer, from the side menu, go to Export > Cost ETL.
- Select or set up a Data connection:
- If you already have data connections set up, in the Connection dropdown, select the ones you want for Cost ETL.
- If you don't have a data connection yet, click + New connection in Data Locker and create a connection there:
- Name your connection.
-
Select a cloud service data destination. Do one of the following:
- Select S3.
- Enter your AWS S3 bucket name.
af-
prefix is mandatory, and should be entered manually. - Click Test connection.
- Verify that an error message indicating that the bucket path is invalid isn't displayed.
- Select whether to Make this connection compatible with Adobe Experience Platform.
- Click Save.
- In the Cost ETL setup, in the Connection dropdown, select the cloud service you created.
- Enter your AWS S3 bucket name.
- Select GCS.
- Enter your GCS bucket name.
- Click Test connection.
- Verify that an error message indicating that the bucket path is invalid isn't displayed.
- Select whether to Make this connection compatible with Adobe Experience Platform.
- Click Save.
- In the Cost ETL setup, in the Connection dropdown, select the cloud service you created.
- [Beta] Select Azure.
- Enter your Connection name, Storage account name, and Key.
- Verify that an error message indicating that the bucket path is invalid isn't displayed.
- Select whether to Make this connection compatible with Adobe Experience Platform.
- Click Save.
- In the Cost ETL setup, in the Connection dropdown, select the cloud service you created.
- [Beta] Select Yandex.
- Enter your Bucket name, Access key, and Secret key.
- Verify that an error message indicating that the bucket path is invalid isn't displayed.
- Select whether to Make this connection compatible with Adobe Experience Platform.
- Click Save.
- In the Cost ETL setup, in the Connection dropdown, select the cloud service you created.
- Select BigQuery.
- Enter your BigQuery project ID and dataset name.
- Click Test connection.
- Verify that an error message indicating that the bucket path is invalid isn't displayed.
- Click Save.
- In the Cost ETL setup, in the Connection dropdown, select the cloud service you created.
- Select Snowflake.
- Enter your Snowflake region and account ID.
- Click Test connection.
- Verify that an error message indicating that the bucket path is invalid isn't displayed.
- Click Save.
- In the Cost ETL setup, in the Connection dropdown, select the cloud service you created.
- Select S3.
- Name your connection.
- If you already have data connections set up, in the Connection dropdown, select the ones you want for Cost ETL.
- Click Next.
- Set the report Version schedule (UTC):
- Select how many reports to receive every day (1, 2, 3, or 4).
- Select what time to receive the reports. Learn more
- Click Next.
- Set up the report details:
- Select 1 or more apps to include in reports. Select Add all future apps to automatically include apps you add in the future.
- Select at least 2 reports to receive:
- Summary: Generalized report that provides high-level data for easier and faster consumption. Enabled by default.
- Geo: Data broken down by geo for all media sources contained in that report. For example data for Google Ads is further broken down by geo: USA, Canada, France, etc.
- Channel: Data broken down by channel for all media sources contained in that report. For example for Google Ads, data is broken down further by YouTube, Shopping, SERP, and Gmail.
- [Closed beta] All cost report: Based on the geo dimension. Includes cost data for all marketing activity on all platforms, including for apps/platforms not added in AppsFlyer (in these cases, the app ID is marked as unknown.
- Select dimensions to be included in the reports. Note: If you chose to receive the Geo report, the geo dimension is included by default. If you chose to receive the Channel report, the channel dimension is included by default.
- Select the metrics to be included in the reports.
- Select 1 or more apps to include in reports. Select Add all future apps to automatically include apps you add in the future.
- Click Apply.
Edit or change connection
You can edit or change connections to get data sent to different (or additional) places.
Note: After switching to an existing connection, you will not be able to revert to your previous Cost ETL connection.
Edit connection
To edit connection details:
- In AppsFlyer, from the side menu, go to Export > Cost ETL.
- In the Data connection section, either:
- Follow the on-screen instructions to edit your connection details.
- Click Save changes or Apply as prompted.
Change Data Locker connection
To change from one Data Locker connection to another:
- In AppsFlyer, from the side menu, go to Export > Cost ETL.
- In the Data connection section, either:
- If you already have the Data Locker connections you want set up, in the Connection dropdown, select the ones you want for Cost ETL.
- If you don't have the Data Locker connection you want set up yet, click + New connection in Data Locker and create a connection there. Then go back to the Cost ETL settings and select it from the Connection dropdown.
- If you already have the Data Locker connections you want set up, in the Connection dropdown, select the ones you want for Cost ETL.
- Click Save changes.
Change from Cost ETL to Data Locker connection
Before you begin:
- Consider that after switching to a Data Locker connection, you can't switch back to your previous Cost ETL connection.
-
Note that when switching to a Data Locker connection, the folder path structure changes to the following format without impacting the report data:
<client-bucket>/<DL-generated-partition>/<connection-name>/t=<report-name>/dt=<YYYY-MM-DD>/v=<1/2/3/4>/
To change from a deprecated Cost ETL connection to a Data Locker connection:
- In AppsFlyer, from the side menu, go to Export > Cost ETL.
- In the Data connection section, click Data Locker connections.
- Do one of the following to set up the Data Locker connection:
- If you already have the data connections you want set up, in the Connection dropdown, select the ones you want for Cost ETL.
- If you don't have the data connection you want set up yet, click + New connection in Data Locker and create a connection there. Then go back to the Cost ETL settings and select it from the Connection dropdown.
- If you already have the data connections you want set up, in the Connection dropdown, select the ones you want for Cost ETL.
- Click Save changes.
Change AWS object ownership
In AWS, by default, when AppsFlyer writes objects to your bucket, the object owner is AppsFlyer. Depending on your data loading process, you might have to change the default ownership to you—the bucket owner.
To change the ownership of objects in your bucket:
- Sign in to the AWS Management Console and open the Amazon S3 console at https://console.aws.amazon.com/s3/.
- In the Buckets list, choose the name of the bucket that you want to enable S3 Object Ownership for.
- Go to the Permissions tab.
- Under Object Ownership, click Edit.
- Select Bucket owner preferred.
- Click Save.
Best practices
Report schedule
The report schedule settings allow you to get the freshest data right when you need it.
When choosing your report schedule:
- Select the time to get the report as close as possible to the time you start processing data in your BI system.
- If you have any any ad networks that provide yesterday's data later than others, set an additional report at the time when that ad network data is ready.
Override data
When pulling and analyzing your data, it is recommended to pull data for a specific date and batch, or override all previous data for the days that the current batch provides. Otherwise, you may see the same data repeated.
For example, batch 1 on February 20 contains data for Feb 14-20. But, batches written on February 19 also contained data for Feb 14 to Feb 19. Override the data of the previous days received on February 19 with the data received in the most recent February 20 batch.
Geo versus channel
Not all networks provide data for all dimensions together. Geo and channel data in Meta ads are the most common examples. This is why two separate data sets are provided. The geo data set is guaranteed to have geo data and the channel data set is guaranteed to have channel data.
In many cases and for many media sources, the data in the geo and channel sets will be identical. As such, consume one of the data sets (geo or channel), according to what best suits your needs.
If the integration agreement with a given media source doesn't include channel, so that channel is blank, we treat that data as if it contains the channel.
Aggregate data
Cost ETL provides flexible and granular data as deep as can be extracted from the ad network. To extract actionable insight from such potentially huge amounts of data, it is recommended to aggregate the data in a way that best suits your business needs. For example, if you need to understand cost data at the campaign and country-level, use those dimensions.
Standardization across networks
Not all networks provide data with the same granularity. For example, Meta ads doesn't provide site ID cost data, while X Ads doesn't provide geo cost data. Be aware of such cases as you aggregate Cost ETL data, and make sure you look at similar data as you compare networks.
Compare data
Cost ETL provides information regarding all your cost data. Some campaigns provided in Cost ETL do not appear in some AppsFlyer dashboards, for example, data of inactive campaigns, meaning campaigns without any recorded installs. To compare the data, find a specific campaign ID in the overview dashboard and compare it to its cost data in Cost ETL. Learn more about cost data availability
Combine Cost ETL and cohort reports
Consider combining Cost ETL reports with Aggregated advanced cohort reports (or regular Cohort reports via Data Locker) in your BI system. Together, they give the fullest picture of marketing performance with fresh and accurate data, including clicks, impressions, cost, revenue, in-app events, etc.). And you can use this combined data to get ROAS, CPA, etc. Learn more
Additional information
Traits and limitations
Trait | Remarks |
---|---|
Timezone | If the timezone is changed, cost data is duplicated on the day and the day following the change. Learn more |
Data freshness |
|
All cost report | All cost reports (closed beta) currently don't include cost data for Google Performance Max campaigns. |
SKAN campaign cost data |
|