A/B testing virtual try-on on Shopify product pages

You installed virtual try-on. Sessions are climbing. But did try-on actually change shopping behavior — or did a sale, an ad campaign, or seasonality move the numbers? Without a control group, you cannot know.

This guide walks through how to run a clean A/B test on Shopify product pages using Stylab's built-in traffic buckets. No external testing tool required.

Why guessing fails

Common traps merchants fall into:

Comparing this month to last month while running a promo
Enabling try-on store-wide and attributing any cart lift to the widget
Looking only at try-on session count (engagement ≠ purchase intent)
Stopping after a week when traffic is too thin to learn anything

A/B testing splits visitors on the same product pages into two groups: one sees try-on, one does not. Same traffic source, same pricing, same photos — only the try-on button differs.

How Stylab A/B testing works

From Widget in your Shopify admin app:

Toggle A/B testing on
Set the traffic slider (10%–90% of visitors see try-on)
Each visitor is assigned to a bucket consistently for the test period

The control group sees the normal product page. The test group sees the Try it on button. After enough time, compare buckets in your dashboard:

Page views per bucket
Try-on sessions (test group only)
Add-to-cart count and rate per bucket

Pre-test checklist (do this before you start)

Pick 2–5 products with steady weekly traffic — not new launches with zero history
Confirm garment photos are acceptable (upgrade flat-lays on key SKUs if needed — see flat-lay guide)
Freeze pricing, hero images, and major ad spend changes for 30 days
Note your baseline add-to-cart rate from Shopify analytics for those URLs
Estimate expected try-on volume with our usage calculator so you stay within plan quota

30-day test protocol

Week 0 — Enable try-on on selected SKUs only. Run without A/B for 3–5 days to confirm the widget works and shoppers use it.
Day 1 of test — Set A/B to 50/50. Do not touch the slider mid-test.
Weeks 1–4 — Monitor weekly: try-on adoption (% of test-group visitors who start try-on), add-to-cart rate per bucket, failed generations.
Day 30 — Review results using the decision framework below.

Low traffic? If a product gets fewer than ~500 PDP views in 30 days, extend the test or combine similar SKUs. Thin data produces noisy conclusions — not wrong answers, just unreliable ones.

Metrics that matter vs vanity metrics

Watch these

Add-to-cart rate — test bucket vs control bucket on the same products
Try-on adoption — are shoppers actually clicking? Low adoption may mean button placement, mobile UX, or wrong SKUs
Try-on → cart path — sessions that led to add-to-cart (available in dashboard)
Failed generation rate — high failures may indicate photo quality issues, not product-market fit

Do not over-index on these alone

Raw try-on session count without a control comparison
Store-wide revenue during a sale period
Social media mentions or press

How to decide after 30 days

Use this framework — not gut feeling:

Expand — test bucket shows equal or higher add-to-cart rate and meaningful try-on adoption. Roll out to more SKUs; consider 70/30 or 100% try-on on winners.
Iterate — try-on adoption is strong but cart rate is flat. Improve flat-lay photos, button copy, or product selection before scaling.
Pause on SKU — near-zero try-on usage after 30 days. The product may not need visualization, or traffic is too low.
Extend test — inconclusive with thin traffic. Run 30 more days or add products.

Results vary by niche, photography, price point, and shopper demographics. Stylab helps you measure — it does not guarantee a specific lift.

Related guides

Try the live demo →