Using T-SQL Snapshot Backup: Seeding Availability Groups

This article originally appeared on Anthony Nocentino’s blog. It has been republished with the author’s credit and consent.

In this post, the fifth in our series, I want to illustrate an example of using the T-SQL snapshot backup feature in SQL Server 2022 to seed availability groups (AGs) with storage-based snapshots. Efficiently seeding an availability group is essential for maintaining high availability and ensuring effective disaster recovery. With the introduction of T-SQL snapshot backup in SQL Server 2022, snapshots can now be created at the storage layer. This advancement significantly speeds up the initialization of secondary replicas, particularly in environments that handle large databases.

This post will walk through a PowerShell script that effectively seeds an AG using T-SQL snapshot backup, dbatools, and Pure Storage^® FlashArray^™.

You can find the complete script for this blog post on GitHub.

Why Is This Important?

If you’ve been working with AGs, you’re likely familiar with replica seeding, sometimes referred to as initializing, preparing, or data synchronization. Seeding is a data operation that involves copying data from a primary replica to one or more secondary replicas. This process is necessary before a database can join an AG. Typically, you can seed a replica through backup and restore or automatic seeding, each of which comes with its own challenges. Regardless of the method you choose, the seeding operation can be time-consuming. The duration of the seeding process depends on several factors, including the size of the database, network speed, and storage capabilities. If you have multiple replicas to seed, the time involved multiplies accordingly!

But what if I told you that you could seed your availability group from a storage-based snapshot and that the re-seeding process could be nearly instantaneous?

This method saves time and reduces the CPU, network, and disk resources consumed by traditional direct seeding and backup and restore processes.

The Scenario

We have two SQL Server 2022 instances, each with:

An availability group configured with two replicas. I will not cover creating an availability group for this post. So, you already have the AG up and running, with both instances configured as replicas, and the database is online on the primary, not in the AG and not in the secondary.
Storage volumes hosted on a Pure Storage FlashArray system. Each SQL Server has a volume allocated on a FlashArray system.
Protection groups ensuring consistent snapshots. Most databases are spread across multiple volumes. For snapshot backup to work correctly, the volumes must be snapshot at the exact time. A protection group guarantees that a snapshot happens simultaneously across all volumes in the protection group. This is also required to replicate snapshots between FlashArray systems.
Asynchronous replication between FlashArray systems. You can perform this process on a single array, but I want the data on two separate arrays if we’re talking about availability. So, we want to replicate the snapshot between two storage arrays, and that replicated snapshot will be used to seed the replica. Cool sidebar here: If you need to scale out read replicas, clone the snapshot to several AG replicas on the same array, and you’ll benefit from data reduction for your data.
dbatools and Pure Storage PowerShell SDK2 installed. We’re using these modules to coordinate the work in this script.

Setting Up the Environment

The code below defines key infrastructure components, including primary and secondary SQL Server replicas, AG details, and FlashArray volumes. Using PowerShell remoting, we establish a session with the secondary replica, create persistent SMO connections to both SQL Server instances, and also build REST API sessions with the FlashArray systems our SQL Servers’ volumes are on. This setup lays the foundation for automating AG tasks later in the script. Like in the previous posts in this series, I’m again implementing this using the PureStoragePowerShellSDK2 and dbatools PowerShell modules.

[crayon-68361493300be096997645/]
[crayon-68361493300d2770649277/]
[crayon-68361493300d6343387106/]
[crayon-68361493300d8119225886/]
[crayon-68361493300db030933598/]
[crayon-68361493300de672763027/]
[crayon-68361493300e0907855153/]
[crayon-68361493300e3910552481/]
[crayon-68361493300e6898756668/]
[crayon-68361493300e8619080596/]
[crayon-68361493300eb920506630/]
[crayon-68361493300f0425486322/]
[crayon-68361493300f3503995076/]
[crayon-68361493300f5070988314/]
[crayon-68361493300f8837418789/]
[crayon-68361493300fa390730311/]
[crayon-68361493300fd497902046/]
[crayon-6836149330100560595414/]
[crayon-6836149330102238904244/]
[crayon-6836149330105314275442/]
[crayon-6836149330107496241671/]
[crayon-683614933010a116979481/]
[crayon-683614933010c428181037/]
[crayon-683614933010f767858076/]
[crayon-6836149330111834510401/]
[crayon-6836149330114669684639/]
[crayon-6836149330116848762325/]
[crayon-6836149330119475491730/]
[crayon-683614933011b230516315/]

Take the Snapshot Backup on the Primary’s FlashArray

Now, we’re ready to take an application-consistent snapshot of our database. First, we freeze write I/O with SUSPEND_FOR_SNAPSHOT_BACKUP, then trigger a protection group snapshot and replicate it. Finally, we take a metadata-only backup, embedding snapshot details for seamless recovery, ensuring consistency and integration with FlashArray replication. Let’s walk through the code block below.

On the primary replica’s FlashArray:

Freeze write IO on the database using ALTER DATABASE [$DbName] SET SUSPEND_FOR_SNAPSHOT_BACKUP = ON:

[crayon-683614933011e466351287/]
[crayon-6836149330121992573835/]

Take a snapshot of the protection group and replicate it to our other array:

[crayon-6836149330123225732129/]

Execute the BACKUP DATABASE TestDB1 TO DISK=’\\FILESERVER\BACKUP\’$BackupFile” WITH METADATA_ONLY command. This takes a metadata backup of the database; this will automatically unfreeze if successful. We’ll use the MEDIADESCRIPTION parameter to hold information about our snapshot.

[crayon-6836149330126230675964/]
[crayon-683614933012c761738346/]
[crayon-683614933012e684622750/]
[crayon-6836149330131764421721/]
[crayon-6836149330133045081544/]

The BACKUP command generates a metadata file that describes what’s in the backup. We’ll need this later to restore the database on the secondary replica.

Let’s talk about snapshot replication for a second.

The first time FlashArray replicates a snapshot between the arrays, it moves the data-reduced data. On SQL Server, FlashArray generally sees a 3.58:1 data reduction. This reduces the time needed to seed the secondary replica on the secondary array since less data has to be replicated. This technique is immensely helpful in scenarios where you have to seed a replica in a DR site or cloud over a WAN or VPN link.

Now, if this was a re-seed of a replica, when we take a snapshot of the primary replica’s array and replicate it to the secondary’s array, only data that has changed on the primary’s array and not yet on the secondary’s array will be copied over the wire. This dramatically reduces the amount of data that needs to be replicated and the time it takes to re-seed that secondary replica. If this is a multi-terabyte database or set of databases, the time savings here is enormous.

Get the Snapshot on the Secondary’s Array

This loop ensures the snapshot is fully replicated between the FlashArray systems before proceeding. It continuously checks replication progress, logging updates and pausing as needed until completion. This guarantees that the snapshot is on the target array before proceeding.

[crayon-6836149330136544376080/]
[crayon-6836149330139813040321/]
[crayon-683614933013c801998978/]
[crayon-683614933013e815086890/]
[crayon-6836149330141253385183/]
[crayon-6836149330143148515274/]
[crayon-6836149330146849870285/]
[crayon-6836149330149485240219/]
[crayon-683614933014b215995545/]
[crayon-683614933014e110177504/]
[crayon-6836149330150396775924/]
[crayon-6836149330153940924844/]
[crayon-6836149330155289809582/]
[crayon-6836149330158824432243/]

Offline the Volumes on the Secondary and Update the Volumes’ Contents from the Snapshot

Now, on the secondary replica, we need to update the volumes with clones of the volumes in the snapshot. This refreshes the data on the secondary replica with the data in the snapshot. Here’s the code for that:

Offline the volume(s) supporting the database:

[crayon-683614933015b172655572/]

Overwrite the volumes on the secondary from the protection group snapshot with New-Pfa2Volume:

[crayon-6836149330161454940049/]

Online the volume(s) on the secondary:

[crayon-6836149330163305375940/]

You’ll want to ensure your volume names and drive letters/mount points match the primary’s layout. If you’re using the availability group, you probably already are. I’m using VMware VMs here with vVols attached. This technique works for RDM and physical servers.

Restore the Database from Snapshot Backup with NORECOVERY on the Secondary

With the data on the volumes updated and attached to the secondary replica, you can restore the snapshot backup on the secondary replica. The critical thing here is the NORECOVERY option; since we’re seeding an AG, the database state needs to be RESTORING.

[crayon-6836149330166052555214/]
[crayon-6836149330169294650155/]

Finalize the Seeding of the Replica and Join the AG

From here on out, since the database is in a RESTORING state on the secondary replica, we’re looking at standard availability group manual seeding.

Take a log backup on the primary:

[crayon-683614933016b772825982/]
[crayon-683614933016e488090538/]

Restore it on the secondary:

[crayon-6836149330170849429088/]
[crayon-6836149330173991703334/]

Set the seeding mode on the secondary to manual:

[crayon-6836149330175388647215/]
[crayon-6836149330178288860323/]

Add the database to the availability group:

[crayon-683614933017b075671528/]
[crayon-6836149330180948299752/]

Start data movement:

[crayon-6836149330183423499280/]

Now let’s check the status of the AG. Check to see if the SynchronizationState is Synchronized:

[crayon-6836149330186248834650/]
[crayon-6836149330188172380311/]
[crayon-683614933018b846414352/]
[crayon-683614933018d775429116/]
[crayon-6836149330190666758628/]
[crayon-6836149330193964126831/]
[crayon-6836149330195491393616/]
[crayon-6836149330198248983510/]
[crayon-683614933019a110408154/]
[crayon-683614933019d751872675/]
[crayon-683614933019f513880572/]
[crayon-68361493301a2604816669/]

Wrapping Things Up

In this post, the fifth in our series, we used the T-SQL snapshot backup feature to seed an availability group replica. Well, first, this helps increase the availability of your database systems. If you had a replica failure and it’s offline, your system is vulnerable if another replica fails. Leveraging this technique, you can quickly bring your systems back to full protection. Further, as a DBA, you won’t have to sit around and monitor the re-seeding process so that you can focus on different tasks in your organization.

You can grab the whole script for this blog post on GitHub.