Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix two flakiness sources in test_scrubber_physical_gc_ancestors (#10457
) We currently have some flakiness in `test_scrubber_physical_gc_ancestors`, see #10391. The first flakiness kind is about the reconciler not actually becoming idle within the timeout of 30 seconds. We see continuous forward progress so this is likely not a hang. We also see this happen in parallel to a test failure, so is likely due to runners being overloaded. Therefore, we increase the timeout. The second flakiness kind is an assertion failure. This one is a little bit more tricky, but we saw in the successful run that there was some advance of the lsn between the compaction ran (which created layer files) and the gc run. Apparently gc rejects reductions to the single image layer setting if the cutoff lsn is the same as the lsn of the image layer: it will claim that that layer is newer than the space cutoff and therefore skip it, while thinking the old layer (that we want to delete) is the latest one (so it's not deleted). We address the second flakiness kind by inserting a tiny amount of WAL between the compaction and gc. This should hopefully fix things. Related issue: #10391 (not closing it with the merger of the PR as we'll need to validate that these changes had the intended effect). Thanks to Chi for going over this together with me in a call.
- Loading branch information
7e4a39e
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
7510 tests run: 7119 passed, 1 failed, 390 skipped (full report)
Failures on Postgres 16
test_layer_map[github-actions-selfhosted]
: release-x86-64Flaky tests (4)
Postgres 17
test_scrubber_physical_gc_ancestors[None]
: debug-x86-64Postgres 16
test_metrics_normal_work
: release-x86-64Postgres 15
test_scrubber_tenant_snapshot[4]
: release-arm64Postgres 14
test_metrics_normal_work
: release-x86-64Code coverage* (full report)
functions
:33.6% (8445 of 25111 functions)
lines
:49.2% (70802 of 143967 lines)
* collected from Rust tests only
7e4a39e at 2025-01-21T18:03:22.163Z :recycle: