PGO v4 to PGO v5
You can upgrade from PGO v4 to PGO v5 through a variety of methods by following this guide. There are several methods that can be used to upgrade: we present these methods based upon a variety of factors, including:
- Redundancy / ability to roll back
- Available resources
- Downtime preferences
and others.
These methods include:
- Migrating Using Data Volumes. This allows you to migrate from v4 to v5 using the existing data volumes that you created in v4. This is the simplest method for upgrade and is the most resource efficient, but you will have a greater potential for downtime using this method.
- Migrate From Backups. This allows you to create a Postgres cluster with v5 from the backups taken with v4. This provides a way for you to create a preview of your Postgres cluster through v5, but you would need to take your applications offline to ensure all the data is migrated.
- Migrate Using a Standby Cluster. This allows you to run a v4 and a v5 Postgres cluster in parallel, with data replicating from the v4 cluster to the v5 cluster. This method minimizes downtime and lets you preview your v5 environment, but is the most resource intensive.
You should choose the method that makes the most sense for your environment. Each method is described in detail below.
Prerequisites
There are several prerequisites for using any of these upgrade methods.
- PGO v4 is currently installed within the Kubernetes cluster, and is actively managing any existing v4 PostgreSQL clusters.
Any PGO v4 clusters being upgraded have been properly initialized using PGO v4, which means the v4
pgcluster
custom resource should be in apgcluster Initialized
status:$ kubectl get pgcluster hippo -o jsonpath='{ .status }' {"message":"Cluster has been initialized","state":"pgcluster Initialized"}
The PGO v4
pgo
client is properly configured and available for use.PGO v5 is currently installed within the Kubernetes cluster.
For these examples, we will use a Postgres cluster named hippo
.
Upgrade Method #1: Data Volumes
This upgrade method allows you to migrate from PGO v4 to PGO v5 using the existing data volumes that were created in PGO v4. Note that this is an “in place” migration method: this will immediately move your Postgres clusters from being managed by PGO v4 and PGO v5. If you wish to have some failsafes in place, please use one of the other migration methods. Please also note that you will need to perform the cluster upgrade in the same namespace as the original cluster in order for your v5 cluster to access the existing PVCs.
Step 1: Prepare the PGO v4 Cluster for Migration
You will need to set up your PGO v4 Postgres cluster so that it can be migrated to a PGO v5 cluster. The following describes how to set up a PGO v4 cluster for using this migration method.
- Scale down any existing replicas within the cluster. This will ensure that the primary PVC does not change again prior to the upgrade.
You can get a list of replicas using the pgo scaledown --query
command, e.g.:
pgo scaledown hippo --query
If there are any replicas, you will see something similar to:
Cluster: hippo
REPLICA STATUS NODE ...
hippo running node01 ...
Scaledown any replicas that are running in this cluser, e.g.:
pgo scaledown hippo --target=hippo
2. Once all replicas are removed and only the primary remains, proceed with deleting the cluster while retaining the data and backups. You can do this --keep-data
and --keep-backups
flags:
You MUST run this command with the --keep-data
and --keep-backups
flag otherwise you risk deleting ALL of your data.
pgo delete cluster hippo --keep-data --keep-backups
3. The PVC for the primary Postgres instance and the pgBackRest repository should still remain. You can verify this with the command below:
kubectl get pvc --selector=pg-cluster=hippo
This should yield something similar to:
NAME STATUS VOLUME ...
hippo-jgut Bound pvc-a0b89bdb- ...
hippo-pgbr-repo Bound pvc-25501671- …
A third PVC used to store write-ahead logs (WAL) may also be present if external WAL volumes were enabled for the cluster.
Step 2: Migrate to PGO v5
With the PGO v4 cluster’s volumes prepared for the move to PGO v5, you can now create a PostgresCluster
custom resource using these volumes. This migration method does not carry over any specific configurations or customizations from PGO v4: you will need to create the specific PostgresCluster
configuration that you need.
26
to match the postgres
group used in PGO v5.
Please see here for more information.To complete the upgrade process, your PostgresCluster
custom resource MUST include the following:
1. A volumes
data source that points to the PostgreSQL data, PostgreSQL WAL (if applicable) and pgBackRest repository PVCs identified in the spec.dataSource.volumes
section.
For example, using the hippo
cluster:
spec:
dataSource:
volumes:
pgDataVolume:
pvcName: hippo-jgut
directory: "hippo-jgut"
pgBackRestVolume:
pvcName: hippo-pgbr-repo
directory: "hippo-backrest-shared-repo"
# Only specify external WAL PVC if enabled in PGO v4 cluster. If enabled
# in v4, a WAL volume must be defined for the v5 cluster as well.
# pgWALVolume:
# pvcName: hippo-jgut-wal
Please see the Data Migration section of the tutorial for more details on how to properly populate this section of the spec when migrating from a PGO v4 cluster.
2. If you customized Postgres parameters, you will need to ensure they match in the PGO v5 cluster. For more information, please review the tutorial on customizing a Postgres cluster.
3. Once the PostgresCluster
spec is populated according to these guidelines, you can create the PostgresCluster
custom resource. For example, if the PostgresCluster
you’re creating is a modified version of the postgres
example in the PGO examples repo, you can run the following command:
kubectl apply -k examples/postgrescluster
Your upgrade is now complete! You should now remove the spec.dataSource.volumes
section from your PostgresCluster
. For more information on how to use PGO v5, we recommend reading through the PGO v5 tutorial.
Upgrade Method #2: Backups
This upgrade method allows you to migrate from PGO v4 to PGO v5 by creating a new PGO v5 Postgres cluster using a backup from a PGO v4 cluster. This method allows you to preserve the data in your PGO v4 cluster while you transition to PGO v5. To fully move the data over, you will need to incur downtime and shut down your PGO v4 cluster.
Step 1: Prepare the PGO v4 Cluster for Migration
1. Ensure you have a recent backup of your cluster. You can do so with the pgo backup
command, e.g.:
pgo backup hippo
Please ensure that the backup completes. You will see the latest backup appear using the pgo show backup
command.
2. Next, delete the cluster while keeping backups (using the --keep-backups
flag):
pgo delete cluster hippo --keep-backups
26
to match the postgres
group used in PGO v5.
Please see here for more information.Step 2: Migrate to PGO v5
With the PGO v4 Postgres cluster’s backup repository prepared, you can now create a PostgresCluster
custom resource. This migration method does not carry over any specific configurations or customizations from PGO v4: you will need to create the specific PostgresCluster
configuration that you need.
To complete the upgrade process, your PostgresCluster
custom resource MUST include the following:
1. You will need to configure your pgBackRest repository based upon whether you are using a PVC to store your backups, or an object storage system such as S3/GCS. Please follow the directions based upon the repository type you are using as part of the migration.
PVC-based Backup Repository
When migrating from a PVC-based backup repository, you will need to configure a pgBackRest repo of a spec.backups.pgbackrest.repos.volume
under the spec.backups.pgbackrest.repos.name
of repo1
. The volumeClaimSpec
should match the attributes of the pgBackRest repo PVC being used as part of the migration, i.e. it must have the same storageClassName
, accessModes
, resources
, etc. Please note that you will need to perform the cluster upgrade in the same namespace as the original cluster in order for your v5 cluster to access the existing PVCs. For example:
spec:
backups:
pgbackrest:
repos:
- name: repo1
volume:
volumeClaimSpec:
storageClassName: standard-wffc
accessModes:
- "ReadWriteOnce"
resources:
requests:
storage: 1Gi
S3 / GCS Backup Repository
When migrating from a S3 or GCS based backup repository, you will need to configure your spec.backups.pgbackrest.repos.volume
to point to the backup storage system. For instance, if AWS S3 storage is being utilized, the repo would be defined similar to the following:
spec:
backups:
pgbackrest:
repos:
- name: repo1
s3:
bucket: hippo
endpoint: s3.amazonaws.com
region: us-east-1
Any required secrets or desired custom pgBackRest configuration should be created and configured as described in the backup tutorial.
You will also need to ensure that the “pgbackrest-repo-path” configured for the repository matches the path used by the PGO v4 cluster. The default repository path follows the pattern /backrestrepo/<clusterName>-backrest-shared-repo
. Note that the path name here is different than migrating from a PVC-based repository.
Using the hippo
Postgres cluster as an example, you would set the following in the spec.backups.pgbackrest.global
section:
spec:
backups:
pgbackrest:
global:
repo1-path: /backrestrepo/hippo-backrest-shared-repo
2. Set the spec.dataSource
section to restore from the backups used for this migration. For example:
spec:
dataSource:
postgresCluster:
repoName: repo1
You can also provide other pgBackRest restore options, e.g. if you wish to restore to a specific point-in-time (PITR).
3. If you are using a PVC-based pgBackRest repository, then you will also need to specify a pgBackRestVolume data source that references the PGO v4 pgBackRest repository PVC:
spec:
dataSource:
volumes:
pgBackRestVolume:
pvcName: hippo-pgbr-repo
directory: "hippo-backrest-shared-repo"
postgresCluster:
repoName: repo1
4. If you customized other Postgres parameters, you will need to ensure they match in the PGO v5 cluster. For more information, please review the tutorial on customizing a Postgres cluster.
5. Once the PostgresCluster
spec is populated according to these guidelines, you can create the PostgresCluster
custom resource. For example, if the PostgresCluster
you’re creating is a modified version of the postgres
example in the PGO examples repo, you can run the following command:
kubectl apply -k examples/postgrescluster
WARNING: Once the PostgresCluster custom resource is created, it will become the owner of the PVC. This means that if the PostgresCluster is then deleted (e.g. if attempting to revert back to a PGO v4 cluster), then the PVC will be deleted as well.
If you wish to protect against this, first remove the reference to the pgBackRest PVC in the PostgresCluster spec:
kubectl patch postgrescluster hippo-pgbr-repo --type='json' -p='[{"op": "remove", "path": "/spec/dataSource/volumes"}]'
Then relabel the PVC prior to deleting the PostgresCluster custom resource. Below uses the hippo
Postgres cluster as an example:
kubectl label pvc hippo-pgbr-repo \
postgres-operator.crunchydata.com/cluster- \
postgres-operator.crunchydata.com/pgbackrest-repo- \
postgres-operator.crunchydata.com/pgbackrest-volume- \
postgres-operator.crunchydata.com/pgbackrest-
You will also need to remove all ownership references from the PVC:
kubectl patch pvc hippo-pgbr-repo --type='json' -p='[{"op": "remove", "path": "/metadata/ownerReferences"}]'
It is recommended to set the reclaim policy for any PV’s bound to existing PVC’s to Retain
to ensure data is retained in the event a PVC is accidentally deleted during the upgrade.
Your upgrade is now complete! For more information on how to use PGO v5, we recommend reading through the PGO v5 tutorial.
Upgrade Method #3: Standby Cluster
This upgrade method allows you to migrate from PGO v4 to PGO v5 by creating a new PGO v5 Postgres cluster in a “standby” mode, allowing it to mirror the PGO v4 cluster and continue to receive data updates in real time. This has the advantage of being able to fully inspect your PGO v5 Postgres cluster while leaving your PGO v4 cluster up and running, thus minimizing downtime when you cut over. The tradeoff is that you will temporarily use more resources while this migration is occurring.
This method only works if your PGO v4 cluster uses S3 or an S3-compatible storage system, or GCS. For more information on standby clusters, please refer to the tutorial.
Step 1: Migrate to PGO v5
Create a PostgresCluster
custom resource. This migration method does not carry over any specific configurations or customizations from PGO v4: you will need to create the specific PostgresCluster
configuration that you need.
To complete the upgrade process, your PostgresCluster
custom resource MUST include the following:
1. Configure your pgBackRest to use an object storage system such as S3/GCS. You will need to configure your spec.backups.pgbackrest.repos.volume
to point to the backup storage system. For instance, if AWS S3 storage is being utilized, the repo would be defined similar to the following:
spec:
backups:
pgbackrest:
repos:
- name: repo1
s3:
bucket: hippo
endpoint: s3.amazonaws.com
region: us-east-1
Any required secrets or desired custom pgBackRest configuration should be created and configured as described in the backup tutorial.
You will also need to ensure that the “pgbackrest-repo-path” configured for the repository matches the path used by the PGO v4 cluster. The default repository path follows the pattern /backrestrepo/<clusterName>-backrest-shared-repo
. Note that the path name here is different than migrating from a PVC-based repository.
Using the hippo
Postgres cluster as an example, you would set the following in the spec.backups.pgbackrest.global
section:
spec:
backups:
pgbackrest:
global:
repo1-path: /backrestrepo/hippo-backrest-shared-repo
2. A spec.standby
cluster configuration within the spec that is populated according to the name of pgBackRest repo configured in the spec. For example:
spec:
standby:
enabled: true
repoName: repo1
3. If you customized other Postgres parameters, you will need to ensure they match in the PGO v5 cluster. For more information, please review the tutorial on customizing a Postgres cluster.
4. Once the PostgresCluster
spec is populated according to these guidelines, you can create the PostgresCluster
custom resource. For example, if the PostgresCluster
you’re creating is a modified version of the postgres
example in the PGO examples repo, you can run the following command:
kubectl apply -k examples/postgrescluster
5. Once the standby cluster is up and running and you are satisfied with your set up, you can promote it.
First, you will need to shut down your PGO v4 cluster. You can do so with the following command, e.g.:
pgo update cluster hippo --shutdown
You can then update your PGO v5 cluster spec to promote your standby cluster:
spec:
standby:
enabled: false
Note: When the v5 cluster is running in non-standby mode, you will not be able to restart the v4 cluster, as that data is now being managed by the v5 cluster.
Once the v5 cluster is up and running, you will need to run the following SQL commands as a PostgreSQL superuser. For example, you can login as the postgres
user, or exec into the Pod and use psql
:
-- add the managed replication user
CREATE ROLE _crunchyrepl WITH LOGIN REPLICATION;
-- allow for the replication user to execute the functions required as part of "rewinding"
GRANT EXECUTE ON function pg_catalog.pg_ls_dir(text, boolean, boolean) TO _crunchyrepl;
GRANT EXECUTE ON function pg_catalog.pg_stat_file(text, boolean) TO _crunchyrepl;
GRANT EXECUTE ON function pg_catalog.pg_read_binary_file(text) TO _crunchyrepl;
GRANT EXECUTE ON function pg_catalog.pg_read_binary_file(text, bigint, bigint, boolean) TO _crunchyrepl;
The above step will be automated in an upcoming release.
Your upgrade is now complete! Once you verify that the PGO v5 cluster is running and you have recorded the user credentials from the v4 cluster, you can remove the old cluster:
pgo delete cluster hippo
For more information on how to use PGO v5, we recommend reading through the PGO v5 tutorial.
Additional Considerations
Upgrading to PGO v5 may result in a base image upgrade from EL-7 (UBI / CentOS) to EL-8 (UBI). Based on the contents of your Postgres database, you may need to perform additional steps.
Due to changes in the GNU C library glibc
in EL-8, you may need to reindex certain indexes in
your Postgres cluster. For more information, please read the
PostgreSQL Wiki on Locale Data Changes, how
you can determine if your indexes are affected, and how to fix them.