Datadrew syncs a specific set of data streams from your Shopify store to build your analytics dashboards and power Drew AI. This article explains exactly what data is included, how each stream is synced, and what you can expect after connecting.
Data streams included in the sync
The following table shows every data stream Datadrew pulls from your Shopify store:
Data Stream | Sync Method | What It Contains |
Orders | Incremental (by updated_at) | Order ID, line items, financial status, fulfillment status, discounts, shipping, taxes, created/updated timestamps, customer reference, and tags. |
Customers | Incremental (by updated_at) | Customer ID, email, first/last name, order count, total spent, tags, addresses, accepts marketing, created/updated timestamps. |
Products | Incremental (by updated_at) | Product ID, title, handle, vendor, product type, tags, status, variants (with prices, SKUs, inventory), created/updated timestamps. |
Product Images | Incremental (by updated_at) | Image ID, product reference, source URL, position, alt text, dimensions. |
Order Refunds | Incremental (by created_at) | Refund ID, order reference, line item adjustments, refund amounts, reason, created timestamp. |
Shop | Full refresh | Store name, domain, currency, timezone, country, plan name. Refreshed completely on each sync to capture any configuration changes. |
Understanding sync methods
Incremental sync -- Only records created or updated since the last sync are fetched. This keeps syncs fast and efficient. Datadrew tracks the cursor (usually
updated_at) so it knows where to resume.Full refresh -- The entire dataset is replaced on each sync. This is only used for the Shop stream, which is a single lightweight record.
What is NOT included
To keep syncs fast and focused on analytics, Datadrew does not currently sync:
Customer addresses as a separate stream (address data is available within the Customers stream)
Shopify GraphQL product data (the REST product stream is used instead)
Draft orders, abandoned checkouts, or inventory locations as separate streams
Shopify POS data (unless it creates standard orders)
How data flows after syncing
Raw data lands in our data warehouse -- Each data stream is written to a dedicated table in your store's our data warehouse dataset.
data transformations process the data -- Our transformation pipeline cleans, deduplicates, and joins your raw data into analytics-ready tables like
shopify_orders,shopify_customers, andshopify_products.Dashboards and AI query the transformed data -- Your Datadrew dashboards (powered by our analytics engine) and Drew AI both read from these transformed tables.
Initial sync timeline
Your first sync imports your full Shopify history. Here is a rough guide:
Small stores (under 10,000 orders) -- 5 to 30 minutes
Medium stores (10,000 to 100,000 orders) -- 30 minutes to 2 hours
Large stores (100,000+ orders) -- 2 to 6 hours
You will see a sync status indicator in Datadrew while the initial sync is in progress. Once complete, your dashboards will begin populating with data.
Ongoing sync schedule
Starter plan -- Once every 24 hours
Pro plan -- Every hour
Sync schedules are managed automatically. You do not need to trigger syncs manually.
Need help? Contact us at support@datadrew.io.
Related articles
Need help?
If you have questions or run into issues, reach out to us at support@datadrew.io or use the in-app chat. We're happy to help.
