04-09-2026 03:20 PM - edited 04-09-2026 04:06 PM
In this article, we will delve into the metrics available during and after the A/B test, the meaning of the outcome produced, and how to interpret the high or low statistical probability associated with the outcome. The following topics are covered:
After each day that shopper visits are qualified into the A/B Test, the experiment data is updated and available to view and analyze within the A/B Test Details page. As with all enhanced content data collected by Syndigo, this information is preserved and available at any point after the test concludes, as well.
While the test remains active, only the Metrics Table is visible on the A/B Test Details page (although raw experiment data may always be exported from this interface). This table provides the following KPIs of importance to enhanced content subscribers:
Each metric is captured with its unique value for the Content A and Content B cohorts, and the difference between these two values is presented in the last column of the table. Each metric may also be further expanded to reveal the breakdown at each retailer website from which experiment data was collected.
After the system produces the outcome of an A/B test, an additional visualization is provided on the test details page: A chart that helps represent the all-important conversion rate KPI from two perspectives – over time and by retailer website.
Within the context of a specific A/B test, exporting the raw underlying data is possible both during testing and any time after test conclusion. As with all Syndigo enhanced content, experiment data is aggregated once daily (overnight). Data becomes available after it is processed overnight for the day prior.
To export A/B test data:
The export types: URL, Widget, or Asset – These options refer to the aggregation level of the enhanced content metrics.
The highest level in the data hierarchy is the URL Level. Metrics such as visits, impressions, and clicks are rolled up to represent a maximum of one per page load and there is no further breakdown across multiple layouts/sections, individual widgets, or assets on the product pages.
Format of data at the URL Level:
Widget Level provides the same metrics broken down to each individual widget or module in the enhanced content. This data set includes more detail about which widgets specifically shoppers viewed and clicked.
Format of data at the Widget Level:
Asset Level is the last option and represents the deepest enhanced content data available. Within each page there may be multiple widgets, and within each widget there may be multiple assets. Choose Asset Level to see the metrics broken down to specific images, video files, etc.
Format of data at the Asset Level:
When reviewing the A/B test results at the conclusion of an experiment, the first item you may notice is the prominent statement on the details page. This statement captures the overall outcome of the experiment, and will appear as one of the following:
Note: If the primary message states that no data is available, please reference the following Help Center article: Troubleshooting: A/B Test availability and data collection.
Directly beneath this first statement, the conversion rates corresponding to each version are displayed. One content's conversion rate is deemed higher than the other when the calculated difference between the two contents' conversion rates is greater than 0.5%. If the difference is less than 0.5%, this results in the outcome that they have similar conversion rates.
This outcome statement provides immediate insight into what occurred during the experiment: whether there was an observed and notable difference in the frequency by which the product was added to cart or purchased by shoppers who viewed content A versus those who viewed content B. As the general goal of conducting A/B testing is to validate which assets and layouts resulted in a higher conversion rate during the course of the experiment, this declaration of the winning content version may serve as the sole insight you derive from your analysis.
However, the other information Syndigo provides in addition to this statement may be crucial in helping your organization interpret the outcome. The remainder of this article will address the concept of statistical probability, which plays a vital role in determining the validity and reliability of the results. Whether the A/B test concluded with a difference in conversion rates or a declaration of similar conversion rates across the two content versions, the Syndigo system presents the calculated likelihood that the outcome is accurate and will be consistent with continuous testing.
Beneath the conversion rates, an additional statement is provided in the A/B test results. This secondary statement is either preceded by a green checkmark or a yellow warning indicator, followed by the probability (captured as a percentage) that the winning content will result in more conversions than the losing content. If the A/B test shows a green checkmark, this means high statistical probability was calculated. If the yellow warning icon is displayed, this indicates a low statistical probability. Based on whether the probability is high or low, please navigate to the appropriate section below to learn more about what this means and the recommended actions to take.
When the Syndigo system presents a green icon with a checkmark alongside the secondary statement that includes probability information, this serves as confirmation that the experiment meets the criteria to be considered statistically significant. One of the requirements to reach statistical significance is that the calculated probability, or confidence level, is equal to or greater than 90%.
High statistical probability means the results of an A/B test exhibit a clear and substantial difference between the two variations being tested. In other words, when the observed data strongly suggests that one variation outperforms the other with a high degree of confidence, it is said to have a high statistical probability. This may also be thought of as the likelihood that the outcome of the A/B test will be the same if the test is run again any number of times.
If Content B has the higher conversion rate and the calculated probability is 90%, this means that there is a 90% likelihood that running the experiment again will always result in Content B having the higher conversion rate.
Next steps: When there is a conversion rate difference paired with high probability, confidently choose the winning variation and implement it, knowing that the observed effects are likely genuine and not due to chance.
If the conversion rates of A and B are deemed similar and there is a high probability calculated:
In either case where high probability is calculated:
Store the results to utilize in future analysis activities and generate valuable insights about shopper preferences, behaviors, and expectations. A high statistical probability indicates this information should be used to shape and refine future design and marketing strategies. Keep all raw experiment data, which can be exported from the A/B test details page, in an organized repository for advanced analysis at a later date. Potential insights include but are not limited to:
Lower statistical probability means the experiment does not meet the criteria to be considered statistically significant. There is a higher degree of uncertainty that the observed effects may not hold true in real-world scenarios. In such cases, the observed conversion rates could easily be attributed to random chance, and there is a lack of confidence in the validity of the results.
However, do not assume that a probability less than 90% means the A/B test is a failure. The standards for reaching statistical significance are quite high, and it's primarily the basis for experimentation in academic or scientific industries where outcomes must be backed by over 95% statistical probability. Very few organizations employ a strictly scientific approach to digital product marketing, where there is no risk to the audience's safety and wellbeing. Rather, there is another, simpler way to interpret the outcome of an A/B test:
The A/B test results show there is a n% likelihood that running the experiment again will result in this same outcome.
For example, in the scenario where Content B's conversion rate is higher than Content A and the calculated probability is 84%, the outcome of this experiment can be interpreted as follows: "There is an 84% likelihood that running the experiment again will result in Content B shoppers converting at a higher rate than Content A shoppers. That means there's a 16% likelihood the results will be different."
Or if the Content A and B conversion rates are similar, and the calculated probability is 56%, then this scenario can be summarized as: "There is a 56% likelihood that running this experiment again will result in Content A and B shoppers converting at a similar rate, and a 44% likelihood the results will be different."
In the first scenario, most eCommerce strategists would agree that 84% is more than enough likelihood to justify concluding that Content B is essentially the winner of the A/B experiment. There is only a 16% chance that running the experiment across a larger sample size of shoppers will result in a different outcome. However, in the second scenario, there is a sense that it is a coin toss: It is just as likely that running the experiment again will produce the same outcome as it may produce a different outcome. Depending on perspective and an organization's appetite for risk, the probability – while not over 90% - still can guide on the next steps to take in the journey.
The primary reason tests encounter low statistical probability is that the experiment did not collect enough data. More specifically, too little data is usually the culprit behind the following characteristics of low probability results:
Designing the A/B test to collect as much data as possible is the best way to avoid encountering low statistical significance.
Next steps:
Do not jump to the conclusion that a difference in conversion rate, when paired with low statistical probability, points accurately to the superior and inferior content variation. Rather, this underscores the need for iterative testing and optimization.
In all cases: Low statistical probability results should prompt organizations to delve deeper into the underlying factors affecting user behavior. Further research may be needed to uncover insights that were not initially apparent.
In conclusion, high and low statistical probability are essential concepts in the realm of A/B testing. High statistical probability signifies strong confidence in the observed effects and enables confident decision-making, while low statistical probability calls for further investigation. Understanding these concepts empowers organizations to make informed choices and optimize their strategies based on reliable data analysis. As technology and methodologies continue to evolve, mastering the art of interpreting statistical probability will remain a critical skill for data-driven decision-makers in various industries.