TABLE OF CONTENTS

Text Link

Industry/News Company Updates Best Practices and How To Languages & Technologies Product Customer Stories

Moving to Progressive Delivery with Feature Flags

Ben Rometsch

Tanaaz Khan

‍“Move fast and break things.” We've heard this countless times. But the best engineering teams ask a better question:

“How do we move fast without breaking things?”

When you’re in a highly-regulated sector, the stakes are high. You must constantly keep up with customer (and market) demands, but your requirements around compliance and risk-reduction remain the same. If you’re using older techniques like Big Bang deployments, every release comes with a risk of failure. The more you ship, the higher the risk.

Take Crowdstrike’s incident, for example. A single update to their Falcon sensor affected Windows systems globally—resulting in a major outage within critical industries like airlines and government services.

For teams that don’t want to risk it all with every deployment, progressive delivery provides a safer approach. In Crowdstrike’s case, it could’ve limited the scope to a smaller subset of systems if they had tested and gradually scaled rather than impacting all the systems they were connected to.

In this guide, we’ll explain how progressive delivery works and how to do it well using feature flagging and modern observability tools.

‍

What is progressive delivery?

Progressive delivery is a deployment strategy that involves continuously deploying code to production but controlling who sees it and when. It builds upon continuous delivery practices by giving you fine-grained control over feature releases.

Progressive delivery using feature flags

With progressive delivery, you introduce changes to small subsets of users first, monitor the impact, and gradually increase exposure as you gain confidence.

This controlled exposure model creates a safety net for deployments. Instead of shipping and praying it works, you get multiple decision points to evaluate if releasing is the right thing to do.

Progressive Delivery vs Continuous Delivery

Continuous Delivery methods automate build, test, and deployment pipelines—engineering teams ship code frequently and reliably. But once deployed, changes are exposed to all users immediately without a failsafe.

This is where progressive delivery differs. It adds a layer of control where you can still deploy frequently through your CI/CD pipeline, but you decide when and to whom the features become visible.

In Continuous Delivery, you mitigate risk primarily through pre-production testing, trying to catch issues before they hit end users (occuring at the infrastructure level and automating shipping). But with progressive delivery, you distribute that risk through limited exposure and rollback capabilities with tools like feature flags (occuring at the software level).

How engineering teams have “shifted left” to progressive delivery and observability

Fifteen or so years ago, the classic 3-tier application stack (web server, app server, database) ran on relatively static infrastructure. You made changes infrequently, and traditional monitoring tools used simple statistical baselining to detect problems.

But today, the approach looks drastically different. Your infrastructure runs on multi-layered virtualised stacks spanning virtual machines (VMs), Kubernetes, or other serverless platforms.

In many cases, teams are breaking down monoliths and embracing interconnected microservices with numerous third-party integrations throughout the architecture. And now, everyone wants to move faster, so deployments happen continuously rather than on fixed schedules.

This means it doesn’t make sense to introduce observability only after you’ve deployed the code. In fact, the more you shift left, moving testing and quality evaluation earlier in the process, the more problems you can prevent.

For instance, Flagsmith integrates with tools like Grafana to enable that. You can define your observability requirements during design and collect the data you need to make better decisions throughout the development lifecycle.

How feature flagging and observability work together

‍

Vitally, it’s important to understand that observability and progressive delivery work hand in hand. As feature flags enable controlled releases, observability gives you more confidence by showing you how the features behave in production. Together, they create a feedback loop where data drives progressive rollout decisions.

Without proper observability, feature flags could feel risky. You might toggle features on or off but remain blind to their effects. Conversely, without feature flags, observability data might reveal problems, but you'd lack granular control to deal with them quickly.

In short, engineering teams that incorporate both these tools to deliver features progressively do the following:

Reduce operational risk as fewer users experience problems
Lower costs because issues are detected earlier
Pass compliance requirements using detailed audit logs

‍

How do feature flags power progressive delivery?

Progressive delivery requires you to deploy code without immediately exposing new functionality to users. Feature toggles enable this by letting you wrap code to control the release of this functionality.

A major benefit is that it allows developers to merge code into the main branch more frequently. You don’t have to expose incomplete or untested features—you also don’t need to wait until they’re fully ready to deploy them. Additionally—and this is key—these tools also give product teams more control over the release process without needing a developer to help.

So, when you actually release and the data flows through observability tools, you can see how it performs in a real user environment. Interestingly, the relationship between feature flags and observability runs both ways. You see how your feature performs using observability tools and in turn, you can use that data to turn flags on or off automatically (based on certain thresholds).

In this webinar, Kyle Johnson, Co-Founder of Flagsmith, and Andreas Grabner, Global DevRel at Dynatrace, explained how the Flagsmith and Dynatrace integration helps engineering teams:

Directly correlate user experiences with specific feature configurations. For example, you can see if users with a new payment processing flow enabled are experiencing higher error rates than those with the feature disabled.
Segment metrics based on feature flag states to compare different variations and see what works best.
Set up automated responses based on specific thresholds, allowing for direct comparison between different variations.

Plus, your entire team eventually becomes more comfortable shipping incremental changes when they know the release process includes safety mechanisms.

‍

How to build a progressive delivery pipeline with the right tech stack

Let’s say you're the lead developer for a mid-sized banking application. Your team has built a redesigned onboarding flow that promises to increase conversion rates and reduce customer support tickets. If you want to make sure it works, try the progressive delivery method. Your rollout could look like this with Flagsmith, Grafana, and Dynatrace:

Create the flag: Using Flagsmith, set up a flag called ‘newonboardingflow’ to control access to the new onboarding module.
Conduct a gradual rollout: Incrementally expose the feature to larger user segments. For example, you might start with 10% and increase the percentage to 20%, 30%, and so on.
Monitor performance: Track performance and user metrics across both versions. With Grafana, you can use annotated queries to see how increasing the percentage affects user behavior. And with Dynatrace, you can monitor issues in real time and mark the feature as “healthy” or “critical” depending on the data.
Expand or rollback: Based on the observability data, either continue the rollout or revert to the previous version.

🏴 Note: Flagsmith also has a “Feature Health” capability that lets you monitor the feature’s status and performance within the platform.

‍

What are the common pitfalls to avoid when using progressive delivery?

Here are the most common issues teams encounter when implementing progressive delivery—and how to avoid them:

1. They neglect feature flag archiving and cleanup

Feature flags are inherently temporary control points, with a few exceptions like kill switches. But if you leave them in your codebase after they’ve served their purpose, it creates "flag debt" that bloats your codebase and could cause serious unintended app behavior.

You might add a flag today for a major release, but six months later, no one remembers why it exists or whether they should remove it.

Pro tip: Follow feature flag best practices like creating a flag lifecycle policy and naming convention to avoid this issue. Document the purpose of each flag along with on and off dates. If it’s a long-lived flag, explain why that’s the case.

2. They don’t set up the observability tools properly

If you can’t see what’s happening with your flag, it could lead to blind spots in your development and production environment.

For example, a team might roll out a new driving license application portal to 5% of users. But without proper monitoring, they miss that the new portal is increasing onboarding times by 30%.

Pro tip: Think about the three pillars of observability: metrics, logs, and traces. Set up your monitoring in a way that covers the impact and performance.

3. They don’t fix misalignment between developers and product teams

Let’s say your product team wants to roll out a feature to a specific customer segment, and they’ve decided to use an existing flag. The flag may already have a purpose—for example, your developers could've set it up to conduct another test. If you don’t have the context (and documentation) to be sure you can use this flag, you could be causing more problems by switching it back on.

As a result, you flip the switch, and next thing you know, your entire app is experiencing downtime.

Pro tip: Document every flag’s purpose in a living document to share with the product team. Also, make sure you implement role-based access (RBAC) to only allow the right stakeholders to make changes to the flags.

4. They complicate the flag’s usage and don’t account for nested dependencies

You might become so comfortable with feature flags that you start implementing them without realising how they affect the entire live environment.

Never nest flags more than one level deep, and do this sparingly. You want to avoid creating a web of conditionals that complicate observability and performance measurement.

Pro tip: Always check if a flag impacts other components in your environment. But if you want to avoid this issue altogether, maintain solid documentation around feature flags. You can use a feature flagging platform like Flagsmith to track these dependencies.

5. They silo flag data from other platforms in their tech stack

If you use a feature flagging platform, don’t let the data just sit there. Consider integrating it with tools like Grafana or Dynatrace to let them talk to each other and connect the dots between flag changes and impact.

For instance, your team might struggle to tally a sudden increase in drop-off rates with a flag change if they don’t communicate. As a result, you can’t respond faster because you’ll spend hours just teasing out the root cause.

Pro tip: If you don’t have access to specific integrations, use webhooks or APIs to push the flag analytics data into other relevant data tools.

‍

Progressive delivery de-risks deployment and lets you release with confidence

The tension between speed and stability doesn't need to be a zero-sum game. Progressive delivery stops you from moving fast and breaking things. Instead, it lets you move incredibly fast and makes the entire deployment process safer.

If you're still deploying features in an all-or-nothing approach, it’s time to make the switch. With progressive delivery, you can confidently ship code without any lingering anxiety.

You can make better decisions through real-world feedback and squash the dreaded 3 AM alerts.

‍

Frequently asked questions

1. What are the types of progressive delivery methods?

Progressive delivery influences many deployment strategies, such as:

Canary deployments where you route a small percentage of traffic to a new version of your application while directing the majority to the stable version.
Blue-green deployments where you maintain two identical production environments: “blue” running the current version and “green” running the new version.
Ring-based deployments extend the canary concept by defining explicit “rings” of users for progressive exposure.
A/B testing is when you test multiple implementations to determine which performs better according to specific parameters you’ve set.

2. How does progressive delivery work with containerised environments like Kubernetes?

Kubernetes provides capabilities like rolling updates that gradually replace pods, while extensions such as Flagger and Argo Rollouts add sophisticated canary and blue-green patterns. Feature flags complement Kubernetes by controlling functionality within containers—giving you both infrastructure-level and application-level progressive delivery capabilities.

3. What’s the recommended tech stack for using progressive delivery?

Ideally, you need a combination of feature flagging and observability platforms. For example:

Feature flagging: Flagsmith provides feature flag management with flexible deployment options, comprehensive targeting, and integrations with observability tools.
Observability (flag impact): Grafana delivers customisable dashboards with annotations that correlate feature flag changes to performance metrics.
Observability (issue detection): Dynatrace offers deep application monitoring with AI-powered problem detection that captures flag states and SLO monitoring to validate release quality.

About the author

Flagsmith co-founder. Besides Flagsmith, Ben has founded several other companies, and he currently serves on the Governance Board of OpenFeature, a CNCF Sandbox Project. He's an advocate for open standards and open source and also hosts “The Craft of Open Source" podcast, where he interviews creators and maintainers from the open-source community.

October 24, 2025

Progressive Delivery for Building LLM-Powered Features

Pete Hodgson

October 23, 2025

What is the Four Eyes Principle? A Developer's Guide to Safer Flag Changes

Tanaaz Khan

October 17, 2025

De-Risking AI Adoption: How Feature Flags Help Enterprises Move Fast Without Breaking Trust

Adrian Gregory

October 7, 2025

Monitoring Feature Flag Performance with Flagsmith, Prometheus, and Grafana

Daniel Efe

September 25, 2025

What is Release Management and How Does it Work in Regulated Industries?

Tanaaz Khan

September 17, 2025

Banking and Modern Observability: Dynatrace Insights

Andreas (Andi) Grabner

September 4, 2025

No More Hardening Phases: Testing in the Age of Continuous Deployment

Pete Hodgson

September 1, 2025

How Modernisation is Changing Open Source Banking

Rob Moffat

August 5, 2025

Use Grafana to Track Feature Health in Flagsmith

Mia Loiselle

August 28, 2025

6 Lessons From the World's Best Open-Source Founders

Ben Rometsch

August 27, 2025

Feature Toggles and Feature Flags: Understanding the Key Differences

Tanaaz Khan

August 25, 2025

8 Types of Deployment Strategies (And How Feature Flags Help)

Ben Rometsch

July 11, 2025

Top 7 Feature Flag Tools for Enterprises in 2025

Tanaaz Khan

June 3, 2025

Moving Fast, Without Breaking Things: Modern Software Delivery with Feature Flags

Pete Hodgson

June 4, 2025

TypeScript Feature Flags: A Next.js Example

Michael Dinerstein

May 14, 2025

Embracing Modernisation in Banking Through Platform Engineering

Benjamin Brial

May 9, 2025

Transitioning to Modern Authorisation Management

Alex Olivier

April 22, 2025

What Are Feature Flags? Everything Engineering Teams Need to Know

Ben Rometsch

April 7, 2025

A Conversation with Komerční Banka's Chief Software Architect

Mia Loiselle

March 26, 2025

GitOps for Feature Flags Using Terraform and Terrateam

Malcolm Matalka

March 25, 2025

Why It’s Time to Test in Production (+ How to Do It Safely)

Tanaaz Khan

January 22, 2025

How We Improved Our Docker Image Security Using Chainguard's Wolfi

Kim Gustyr

January 7, 2025

6 Best Enterprise-Grade Harness Alternatives & Competitors

Tanaaz Khan

October 28, 2024

How to Roll out Pricing Changes With Zero Customer Complaints

Matthew Elwell

September 16, 2024

How to Use Feature Flags for Trunk-Based Development

Kyle Johnson

August 21, 2024

7 Best LaunchDarkly Alternatives & Competitors

Tanaaz Khan

August 12, 2024

How Global Banks Use Feature Flags to Stay Competitive

Tanaaz Khan

July 24, 2024

How To Guide: Flagsmith Grafana Integration

Pradumna Saraf

July 23, 2024

New in Flagsmith: 2024 Feature Roundup

Matthew Elwell

July 23, 2024

Don’t Let a Flawed Release Take Your Company Down

Ben Rometsch

June 26, 2024

How to Guide: Flagsmith GitHub Integration

Pradumna Saraf

May 28, 2024

6 Best Firebase Remote Config Alternatives & Competitors

Tanaaz Khan

May 16, 2024

How to Transition to Modern Feature Management in Banking

Ben Rometsch

March 21, 2024

5 Feature Flag Management Pitfalls To Avoid To Keep Your Flags in Check

Tanaaz Khan

February 29, 2024

The Best Thing about Founding a Remote-First Company? Pickled Onion Monster Munch and The Beautiful Game

Ben Rometsch

February 28, 2024

Flagsmith Jira Integration Guide: A Comprehensive How-to Guide

Abhishek Agarwal

February 16, 2024

Guide: How to Create Observability-Driven Development with Feature Flags

Savan Kharod

January 31, 2024

Build vs. Buy for Feature Flags: My Experience as a CTO with a 20+ Engineer Team

Daniel Engelke

January 16, 2024

Announcing the Flagsmith Referral Programme

Anna Redbond

January 15, 2024

How We Measure Feature Flags’ Success

Kyle Johnson

December 20, 2023

Customer Story: Serenis

Anna Redbond

December 7, 2023

Announcing the Flagsmith Jira Integration

Anna Redbond

June 6, 2024

Spring Boot Feature Flags: A Step-by-Step Implementation Guide with a Working Java Spring Boot Application

Abhishek Agarwal

November 22, 2023

Employees on Bootstrapping

Anna Redbond

November 14, 2023

Our POV: When Bootstrapping Works (and When It Doesn't)

Anna Redbond

October 25, 2023

How to Onboard Feature Flag Management Tools

Anna Redbond

October 12, 2023

When is it time to move to feature flag software?

Olga Diaz

September 26, 2023

Why We Bootstrap

Ben Rometsch

September 6, 2023

The Enshittification of Basically all Digital Design. But in this Case, Specifically, the Slack Redesign.

Ben Rometsch

January 9, 2025

Ruby Feature Flags: A Step-by-Step Guide to Implementing Feature Flags in a Ruby on Rails Application

Zeeshan Afridi

September 1, 2023

Unlocking Efficiency: Transitioning to Modern CI Processes

Geshan Manandhar

August 29, 2023

Customer Story: Vontobel

Anna Redbond

Anna Redbond

June 30, 2023

Customer Story: Rain (VP of Platform Engineering)

December 1, 2020

Customer Story: Palo Alto Software

Ben Rometsch

March 14, 2020

What I’ve learned creating a React Native performance monitor

Kyle Johnson

September 20, 2024

How to Setup Feature Flags in Android using Kotlin

Anna Redbond

February 23, 2023

The actual infrastructure costs of running a global Edge API (part 2)

Ben Rometsch

May 3, 2023

Integrating your Flagsmith Project with Datadog: A Step-By-Step Guide with Real-Time Metrics

Abhishek Agarwal

May 10, 2024

Python Feature Flags & Toggles: A Step-by-Step Setup Guide in a Flask Application

Matthew Elwell

July 2, 2025

Java Feature Flags & Toggles: A Step-by-Step Guide with a Working Java Application

Abhishek Agarwal

November 16, 2022

Adventures in Terraform: How and why we built our Terraform Provider

Gagan Trivedi

April 8, 2025

Angular Feature Flags: a Step-by-Step Guide with a Working Application

Geshan Manandhar

January 30, 2025

Golang Feature Flags: A Step-by-Step Implementation Guide with a Working application

Abhishek Agarwal

June 29, 2022

Elixir feature flags: a step-by-step guide with an Elixir example

Ben Rometsch

June 6, 2022

How Banks Implement Feature Flags - Interview with KB Bank | Flagsmith

Ben Rometsch

June 16, 2022

.NET feature flag: a step-by-step guide with Xamarin example

Ben Rometsch

June 14, 2022

Our scariest release to date!

Ben Rometsch

June 15, 2022

The actual infrastructure costs of running SaaS at scale (billions of requests/month)

Ben Rometsch

January 2, 2022

How To Use Swift Feature Flags: iOS App with code examples

Ben Rometsch

May 11, 2022

Our CI/CD and release management process at Flagsmith

Ben Rometsch

January 21, 2022

How eFuse Uses Flagsmith for A/B & Multivariate Testing

Ben Rometsch

May 19, 2022

Flagsmith Submits OpenFeature as CNCF Sandbox Project | Flagsmith

Ben Rometsch

November 17, 2021

Using Flutter Feature Flags to Release Features Without Risk | Flagsmith

Ben Rometsch

May 24, 2024

How to Use JavaScript Feature Flags & Toggles to Deploy Safely [React.js Example]

Ben Rometsch

December 31, 2021

6 Metrics to Monitor When Rolling Out a New Feature Flag

Cassandra Polzin

September 29, 2021

How Inflow Improves Conversions Through A/B Testing with Flagsmith and Mixpanel

Ben Rometsch

October 7, 2021

5 Things I Learned Going from Open Source to Commercial Open Source

Ben Rometsch

April 25, 2024

Feature Flags Best Practices: The Complete Guide

Geshan Manandhar