-
Notifications
You must be signed in to change notification settings - Fork 479
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failures in test_nbtree_pagesplit_cycleid #10390
Comments
This should fix the largest source of flakyness of test_nbtree_pagesplit_cycleid. ## Problem #10390 ## Summary of changes By using a guaranteed-flushed LSN, we ensure that PS won't have to wait forever. (If it does wait forever, we know the issue can't be with Compute's WAL)
The biggest source of flakyness of the test has been reduced significantly since #10413, with remaining flakyness (1 failure every few days) not quite fully understood. |
I've come across a failure of this test when testing a sans-enabled build on ARM:
and then reproduced it locally, on x86_64, when running 8 test instances (also with sanitizers) in parallel, on iteration 1. And I can reproduce it even with sleep(2) -> sleep(10). Will try to investigate this. |
By the way, I wonder whether [1, pg_relation_size('t_uidx'::regclass) / 8192] here is a correct range for block numbers?
Doesn't get_raw_page_at_lsn() return blocks numbered from 0? As far as I can see, get_raw_page_at_lsn() doesn't check blocknum validity (e.g. it works for blocknum = 100000 or blocknum = -1), but when requesting block 10 from the t_uidx relation, I get a zero-only page, while page 0 returned contains some data. |
|
Yes, I understand, my question was mostly about blocknum passed to get_raw_page_at_lsn(): is it 0- or 1-based... |
This test checks |
Still trying to find out what makes the test fail, but I see the following difference with the modified query:
when it fails. I've also added
for debugging and seeing the following:
(when the test passes)
(when the test fails) So maybe the index can be split in several ways? |
@MMeent will check the remaining failures later this week |
https://neonprod.grafana.net/d/fddp4rvg7k2dcf/regression-test-failures?orgId=1&var-test_name=test_nbtree_pagesplit_cycleid&from=now-7d&to=now&timezone=utc&var-restrict=true&var-max_count=100&var-reference=$__all&var-ignore_reference=refs%2Fpull%2F0000%2Fmerge
The text was updated successfully, but these errors were encountered: