TABLE OF CONTENTS

Industry/News Company Updates Best Practices and How To Languages & Technologies Product Customer Stories

Get The Analytics You Need: A/B Testing with Feature Flags and Your Existing Stack

Kyle Johnson

How do you get the most out of your analytics with feature flags? Do you need A/B testing and experimentation to be built into your feature flagging tool?

In most cases—and for most A/B tests—you’ll get stronger metrics and be able to make better decisions by A/B testing with feature flags and the existing tools in your stack.

Here’s why. A/B tests in development often fit into the following categories:

Rollouts - These are percentage-based. They involve monitoring the performance of the application or other critical metrics and are operational in nature. To understand the results and behaviour, use different tools in your stack together—observability tools, analytics tools, and feature management tools.
Split testing for qualitative feedback - These tests don’t need to have statistical significance and it can be argued that they aren’t truly A/B tests. Rather, these are a great method of learning how samples of populations react to different experiences.
A vs. B for accuracy - For true accuracy, these need statistical significance, which means thousands and sometimes millions of visitors… or even worse, long periods of time. Most tests don’t meet this threshold because they don’t have the time or traffic to be accurate enough.
Multivariate (MVT) tests - Even more traffic is required and even more variables must be considered.

With these categories in mind, most A/B tests in the development process aren’t A/B tests for statistical significance. And they aren't based on purely statistical data—they’re qualitative tests or rollouts. To understand the behaviour of these tests and make informed decisions, you'll need more than the metrics you'll get from an all-in-one feature flagging and experimentation tool.

If you use a specialised feature management tool with your existing analytics tools, you’ll have the metrics you need with the tools that are already in your stack.

All-in-One Solutions vs. Homegrown Solutions vs. Specialised Tools for A/B Tests

The tooling options for A/B testing with feature flags sit on a continuum:

On one end there are full-suite products that offer feature flagging, A/B testing, and experimentation in a single platform
On the other, there are teams building fully configured home-grown solutions for both flags and A/B testing

Somewhere in the middle is the option to use specialised analytics tools with specialised feature flag software. With this option, you run A/B tests by automatically feeding event and flag data to your existing analytics platform(s). This is where Flagsmith sits.

‍Flagsmith lets you integrate your A/B and multivariate results with your existing behavioral, database, and performance monitoring tools.

This means you’re not adding another decision point. Instead, you’re using specialised tools together. You can get data in and out of feature flags and use that data to run A/B tests with your existing stack. These tests will be fed with enough data and metrics for your PMs, marketing teams, etc. to make data-driven decisions.

Why Use Feature Flagging Tools with Analytics Tools

Keep a single decision point for analytics
Don’t add another decision point. Run experiment analysis in your team's source of truth (and have that be informed with enough event and flag metrics to make data-informed decisions).

Keep looking at the right sources of data

The metrics from an all-in-one feature flag and experimentation platform are likely not enough for a PM, engineer, or marketer to make an informed decision. Feeding tests with flag data and analysing them with your existing tools (analytics, observability, etc.) will go further.‍

Keep the tools that work for your teams
Your marketing team might use Grafana for marketing-led tests. Your product team might use Amplitude. You might use behavioural tools like Dynatrace. Rather than adding another analytics tool (or using a tool that isn’t accessible for other teams, leaving you as the gatekeeper) keep using tools that work and feed flag data into them.

Keep costs down and don’t get locked into something you won’t use

Using a feature flag tool with your existing infrastructure/stack can enable engineers and save costs. Introducing more decision points adds tooling costs (and can lead to pricier tools with features you don’t touch), increases storage, and can create hidden costs like the organisational costs that come from discrepancies in data. Plus, you might be paying for features you don’t need.

Running a rollout and test with Flagsmith

Why We Chose to Integrate with Analytics Platforms

We want to build a specalised feature flag tool, and have chosen not to be an all-in-one solution or add features that would compete with your analytics tools. Instead, we specialise in flags and partner with analytics providers.

We want to make it simpler—and cheaper—for teams to keep making decisions where they are today. We do that by making it easy to get data through integrations or webhooks.

How to Use Flagsmith for A/B Testing with Feature Flags

TL;DR: We integrate and partner with analytics tools, offer webhooks and have a REST API. People also create custom builds for tests and event feeds—read our case studies or ask in Discord to find out how they’re implementing these.

You can set up and run A/B Tests with multivariate flags and flag data being fed to a third-party analytics platform like Amplitude or Mixpanel. You’ll need two main components:

A bucketing engine: The bucketing engine is used to put users into a particular AB testing bucket. These buckets will control the specific user experience that is being tested.
An analytics platform: The analytics platform will receive a stream of event data derived from user behaviour.

Read our Docs for more information on setting up and evaluating A/B tests.

Running an A/B Test on App Modals With Flagsmith and Analytics Platforms

A Fintech team implemented A/B tests when they noticed they weren’t receiving enough feedback from users from a popup modal in their app. They used Flagsmith segmentations with a % segment override.

Users were closing the modal and their Tech Lead thought they were asking for a review at the wrong time for users. To test this, they created two different modals (A and B) in different places in the app. Then, they evaluated how the user feedback varied depending on the modal they saw. They stored the event information in their data warehouse (using Redshift Database), and their Marketing and Product teams used Grafana and analytics tools to measure the results of the experiment.

They implemented the A/B test on their own, using Flagsmith segmentations with a % Segment Override. Then, on their side, they built an event pipeline so that they could send events through the app to the back end to track users’ behaviour.

When they ran the test, option A was by far the better approach for users. After implementing that option for all users, they’re receiving more feedback and the reviews are much better.

Read the full use case and technical implementation here.

More Team Implementations:

Resources to Get Started With Flagsmith and A/B Tests

Docs

Video

Master A/B Testing with Feature Flags: Comprehensive Guide with Flagsmith

Interactive Demo

A/B Testing Overview and Demo

Conclusion

There are so many ways to build and test. As we’ve built Flagsmith, we’ve decided not to add another decision point or make releasing any more complicated. Instead, we think releases are simpler and more data-informed with specialised tools that just work.

With A/B testing and feature flags, this means using a feature flag tool to send flag data to the analytics platforms your teams are already using.

‍

About the author

Co-founder of Flagsmith & Lead Front-End Developer.

June 4, 2025

TypeScript Feature Flags: A Next.js Example

Michael Dinerstein

May 14, 2025

Embracing Modernisation in Banking Through Platform Engineering

Benjamin Brial

May 9, 2025

Transitioning to Modern Authorisation Management

Alex Olivier

April 22, 2025

What Are Feature Flags? Everything Engineering Teams Need to Know

Ben Rometsch

April 7, 2025

A Conversation with Komerční Banka's Chief Software Architect

Mia Loiselle

March 26, 2025

GitOps for Feature Flags Using Terraform and Terrateam

Malcolm Matalka

March 25, 2025

Why It’s Time to Test in Production (+ How to Do It Safely)

Tanaaz Khan

January 22, 2025

How We Improved Our Docker Image Security Using Chainguard's Wolfi

Kim Gustyr

January 7, 2025

6 Best Enterprise-Grade Split Alternatives & Competitors

Tanaaz Khan

October 28, 2024

How to Roll out Pricing Changes With Zero Customer Complaints

Matthew Elwell

September 16, 2024

How to Use Feature Flags for Trunk-Based Development

Kyle Johnson

August 21, 2024

7 Best LaunchDarkly Alternatives & Competitors

Tanaaz Khan

August 12, 2024

How Global Banks Use Feature Flags to Stay Competitive

Tanaaz Khan

July 24, 2024

How To Guide: Flagsmith Grafana Integration

Pradumna Saraf

July 23, 2024

New in Flagsmith: 2024 Feature Roundup

Matthew Elwell

July 23, 2024

Don’t Let a Flawed Release Take Your Company Down

Ben Rometsch

June 26, 2024

How to Guide: Flagsmith GitHub Integration

Pradumna Saraf

May 28, 2024

6 Best Firebase Remote Config Alternatives & Competitors

Tanaaz Khan

May 16, 2024

How to Transition to Modern Feature Management in Banking

Ben Rometsch

March 21, 2024

5 Feature Flag Management Pitfalls To Avoid To Keep Your Flags in Check

Tanaaz Khan

February 29, 2024

The Best Thing about Founding a Remote-First Company? Pickled Onion Monster Munch and The Beautiful Game

Ben Rometsch

February 28, 2024

Flagsmith Jira Integration Guide: A Comprehensive How-to Guide

Abhishek Agarwal

February 16, 2024

Guide: How to Create Observability-Driven Development with Feature Flags

Savan Kharod

January 31, 2024

Build vs. Buy for Feature Flags: My Experience as a CTO with a 20+ Engineer Team

Daniel Engelke

January 16, 2024

Announcing the Flagsmith Referral Programme

Anna Redbond

January 15, 2024

How We Measure Feature Flags’ Success

Kyle Johnson

December 20, 2023

Customer Story: Serenis

Anna Redbond

December 7, 2023

Announcing the Flagsmith Jira Integration

Anna Redbond

June 6, 2024

Spring Boot Feature Flags: A Step-by-Step Implementation Guide with a Working Java Spring Boot Application

Abhishek Agarwal

November 22, 2023

Employees on Bootstrapping

Anna Redbond

November 14, 2023

Our POV: When Bootstrapping Works (and When It Doesn't)

Anna Redbond

October 25, 2023

How to Onboard Feature Flag Management Tools

Anna Redbond

October 12, 2023

When is it time to move to feature flag software?

Olga Diaz

September 26, 2023

Why We Bootstrap

Ben Rometsch

September 6, 2023

The Enshittification of Basically all Digital Design. But in this Case, Specifically, the Slack Redesign.

Ben Rometsch

January 9, 2025

Ruby Feature Flags: A Step-by-Step Guide to Implementing Feature Flags in a Ruby on Rails Application

Zeeshan Afridi

September 1, 2023

Unlocking Efficiency: Transitioning to Modern CI Processes

Geshan Manandhar

August 29, 2023

Customer Story: Vontobel

Anna Redbond

August 17, 2023

It's Time to Move to Modern Observability Tools and Progressive Delivery: Insights from Dynatrace

Andreas (Andi) Grabner

August 2, 2023

Moving to Modern Software Development and Continuous Integration for Banks: Insights from Romano Roth (Zühlke)

Anna Redbond

August 1, 2023

Developer-Led Podcast: Bootstrapping a Commerical Open Source Company to $1M ARR

Anna Redbond

July 24, 2023

Open Source Startup Podcast: Why Feature Flagging Should be Open Source with Ben Rometsch

Anna Redbond

July 18, 2023

Open-Source in Banking: Rob Moffat from FINOS Talks Barriers, Benefits, and Pushing the Battleship to Adoption

Anna Redbond

June 30, 2023

Customer Story: Rain (VP of Platform Engineering)

Anna Redbond

June 30, 2023

Customer Story: Rain (Tech Lead)

Anna Redbond

September 26, 2024

PHP Feature Flags: A Step-by-Step Guide in a Working Laravel Application

Geshan Manandhar

January 15, 2025

What is Canary Deployment? When and How To Use It

Geshan Manandhar

October 10, 2024

Node.js Feature Flags: a Step-by-Step Implementation Guide with an Express.js Example

Geshan Manandhar

June 3, 2021

Integrate Heap with Flagsmith

Ben Rometsch

April 30, 2021

Security Benefits of Self-Hosting Feature Flags On-Prem | Flagsmith

Geshan Manandhar

April 15, 2021

Best Practices to Achieve Automated Testing & Zero Downtime Deployments

Ben Rometsch

April 1, 2021

Deployment is not a release; a step-by-step guide with feature flags

Geshan Manandhar

November 25, 2024

Feature Flags vs Remote Configuration: What’s the Difference?

Ben Rometsch

December 14, 2020

Get the most out of your Feature Flags with these best practices

Ben Rometsch

December 1, 2020

Customer Story: Palo Alto Software

Ben Rometsch

March 14, 2020

What I’ve learned creating a React Native performance monitor

Kyle Johnson

September 20, 2024

How to Setup Feature Flags in Android using Kotlin

Shubham Aggarwal

June 8, 2023

Customer Story: Smartex

Anna Redbond

May 26, 2023

Our First Remote Company Off-Site: What Worked, What Didn’t, and What We’ll Do Differently Next Time

Anna Redbond

May 19, 2023

Customer Story: Wistia

Anna Redbond

April 28, 2023

A Decision Continuum: Deciding Between Feature Flagging Software vs. an In-House Solution

Anna Redbond

May 8, 2023

Customer Story: Rabbit Care

Anna Redbond

April 18, 2023

Customer Story: alt.bank

Anna Redbond

February 23, 2023

The actual infrastructure costs of running a global Edge API (part 2)

Ben Rometsch

May 3, 2023

Integrating your Flagsmith Project with Datadog: A Step-By-Step Guide with Real-Time Metrics

Abhishek Agarwal

May 10, 2024

Python Feature Flags & Toggles: A Step-by-Step Setup Guide in a Flask Application

Matthew Elwell

May 2, 2024

Java Feature Flags & Toggles: A Step-by-Step Guide with a Working Java Application

Abhishek Agarwal

November 16, 2022

Adventures in Terraform: How and why we built our Terraform Provider

Gagan Trivedi

April 8, 2025

Angular Feature Flags: a Step-by-Step Guide with a Working Application

Geshan Manandhar

January 30, 2025

Golang Feature Flags: A Step-by-Step Implementation Guide with a Working application

Abhishek Agarwal

June 29, 2022

Elixir feature flags: a step-by-step guide with an Elixir example

Ben Rometsch

June 6, 2022

How Banks Implement Feature Flags - Interview with KB Bank | Flagsmith

Ben Rometsch

June 16, 2022

.NET feature flag: a step-by-step guide with Xamarin example

Ben Rometsch

June 14, 2022

Our scariest release to date!

Ben Rometsch

June 15, 2022

The actual infrastructure costs of running SaaS at scale (billions of requests/month)

Ben Rometsch

January 2, 2022

How To Use Swift Feature Flags: iOS App with code examples

Ben Rometsch

May 11, 2022

Our CI/CD and release management process at Flagsmith

Ben Rometsch

January 21, 2022

How eFuse Uses Flagsmith for A/B & Multivariate Testing

Ben Rometsch

May 19, 2022

Flagsmith Submits OpenFeature as CNCF Sandbox Project | Flagsmith

Ben Rometsch

November 17, 2021

Using Flutter Feature Flags to Release Features Without Risk | Flagsmith

Ben Rometsch

May 24, 2024

How to Use JavaScript Feature Flags & Toggles to Deploy Safely [React.js Example]

Ben Rometsch

December 31, 2021

6 Metrics to Monitor When Rolling Out a New Feature Flag

Cassandra Polzin

September 29, 2021

How Inflow Improves Conversions Through A/B Testing with Flagsmith and Mixpanel

Ben Rometsch

October 7, 2021

5 learnings going from open source project to commercial open source business

Ben Rometsch

April 25, 2024

Feature Flags Best Practices: The Complete Guide

Geshan Manandhar

September 23, 2021

Decoupling Deployment from Release with Feature Flags

Cassandra Polzin

July 8, 2021

Use feature flags to release code safely in any git branching strategy

Geshan Manandhar

July 2, 2021

Feature Flag Analytics for users of Flagsmith and Amplitude

Ben Rometsch

August 20, 2021

How to Enhance Phased Rollouts with Feature Flags

Cassandra Polzin

October 1, 2024

React Native Remote Config: A Step-by-Step Implementation Guide

Geshan Manandhar

June 29, 2021

Decouple deployment from release to achieve continuous delivery with Feature Flags

Cassandra Polzin

June 23, 2021

Integrate New Relic with Flagsmith

Cassandra Polzin

June 21, 2021

Flagsmith & AppDynamics Enable Advanced Performance Analysis

Cassandra Polzin

May 5, 2021

Introducing Multivariate Feature Flags to enable seamless AB Testing and Canary Deployments

Ben Rometsch

June 11, 2021

Monolith vs. Microservice architecture: Embracing the Monolith safely with feature flags

Ben Rometsch

December 8, 2020

Flagsmith Release! v2.4.0

Ben Rometsch

February 1, 2020

Self Hosting all the things

Ben Rometsch

December 29, 2021

Is it time to delete your staging environment?

Ben Rometsch

January 11, 2021

My Mac Setup - 2020/21: Getting close to OS nirvana

Ben Rometsch

April 8, 2021

New Dynamic Flags combine the benefits of Feature Flags and Remote Config

Ben Rometsch