Seeking best practices and real world experiences around managing, tuning, and operating PostgreSQL and MongoDB databases in production environments at scale

Hi everyone,

I’m looking to start a discussion around real world database administration challenges for PostgreSQL and MongoDB in production environments.

Specifically, I’m interested in hearing from DBAs and platform engineers on:
:speaking_head: Common performance bottlenecks you see in production
:speaking_head: Best practices for backup, restore, and disaster recovery
:speaking_head: High availability and replication strategies that have worked well
:speaking_head: Monitoring and alerting tools you rely on day to day
:speaking_head: Lessons learned from incidents or migrations at scale

This would be especially helpful for teams running databases on cloud platforms (AWS/GCP/Azure) as well as hybrid or on-prem setups.

Looking forward to learning from the community and sharing experiences.

Thanks!