Skip to main content

Shopify data sync — what's included

Which Shopify data is synced, how often, and how historical backfills work.

Updated today

Datadrew syncs a specific set of data streams from your Shopify store to build your analytics dashboards and power Drew AI. This article explains exactly what data is included, how each stream is synced, and what you can expect after connecting.

Data streams included in the sync

The following table shows every data stream Datadrew pulls from your Shopify store:

Data Stream

Sync Method

What It Contains

Orders

Incremental (by updated_at)

Order ID, line items, financial status, fulfillment status, discounts, shipping, taxes, created/updated timestamps, customer reference, and tags.

Customers

Incremental (by updated_at)

Customer ID, email, first/last name, order count, total spent, tags, addresses, accepts marketing, created/updated timestamps.

Products

Incremental (by updated_at)

Product ID, title, handle, vendor, product type, tags, status, variants (with prices, SKUs, inventory), created/updated timestamps.

Product Images

Incremental (by updated_at)

Image ID, product reference, source URL, position, alt text, dimensions.

Order Refunds

Incremental (by created_at)

Refund ID, order reference, line item adjustments, refund amounts, reason, created timestamp.

Shop

Full refresh

Store name, domain, currency, timezone, country, plan name. Refreshed completely on each sync to capture any configuration changes.

Understanding sync methods

  • Incremental sync -- Only records created or updated since the last sync are fetched. This keeps syncs fast and efficient. Datadrew tracks the cursor (usually updated_at) so it knows where to resume.

  • Full refresh -- The entire dataset is replaced on each sync. This is only used for the Shop stream, which is a single lightweight record.

What is NOT included

To keep syncs fast and focused on analytics, Datadrew does not currently sync:

  • Customer addresses as a separate stream (address data is available within the Customers stream)

  • Shopify GraphQL product data (the REST product stream is used instead)

  • Draft orders, abandoned checkouts, or inventory locations as separate streams

  • Shopify POS data (unless it creates standard orders)

How data flows after syncing

  1. Raw data lands in our data warehouse -- Each data stream is written to a dedicated table in your store's our data warehouse dataset.

  2. data transformations process the data -- Our transformation pipeline cleans, deduplicates, and joins your raw data into analytics-ready tables like shopify_orders, shopify_customers, and shopify_products.

  3. Dashboards and AI query the transformed data -- Your Datadrew dashboards (powered by our analytics engine) and Drew AI both read from these transformed tables.

Initial sync timeline

Your first sync imports your full Shopify history. Here is a rough guide:

  • Small stores (under 10,000 orders) -- 5 to 30 minutes

  • Medium stores (10,000 to 100,000 orders) -- 30 minutes to 2 hours

  • Large stores (100,000+ orders) -- 2 to 6 hours

You will see a sync status indicator in Datadrew while the initial sync is in progress. Once complete, your dashboards will begin populating with data.

Ongoing sync schedule

  • Starter plan -- Once every 24 hours

  • Pro plan -- Every hour

Sync schedules are managed automatically. You do not need to trigger syncs manually.

Need help? Contact us at support@datadrew.io.

Related articles


Need help?

If you have questions or run into issues, reach out to us at support@datadrew.io or use the in-app chat. We're happy to help.

Did this answer your question?