System test: txn commit slowdown
See original GitHub issueProblem description
We are observing a very important slowdown of txn commits in system test runs. Here is a sample as observed by @shiveshr in a failed run of readtxnwritescalewithfailovertest
:
Time | # of commits
8:48 54
8:49 31
8:50 34
8:51 37
8:52 38
8:53 11
8:54 13
// scale started
8:55 12
8:56 6
8:57 5
8:58 4
8:59 5
9:00 4
9:01 3
9:02 3
9:03 1
9:04 1
9:05 3
9:06 3
9:07 3
9:08 3
9:09 2
9:10 2
9:11 2
9:12 2
9:13 1
Further investigation of the issue seems to indicate of a slowdown of the segment store to commit transactions after failover. It is not clear, however, what is causing the slow down precisely
Problem location Transaction commits, controller, segment store
Suggestions for an improvement Investigate the bug and determine the cause of the slowdown.
Issue Analytics
- State:
- Created 6 years ago
- Comments:30 (30 by maintainers)
Top Results From Across the Web
3 Hazards of Transactional Test Isolation - Atomic Spin
Although using database transactions to isolate your test suite is appealing, there are some important caveats to consider.
Read more >Why does TRANSACTION / COMMIT improve performance so ...
I recently tried to improve the speed of this process by using MySQL transactions using START TRANSACTION and COMMIT . The performance increase ......
Read more >Why Do Long Database Transactions Affect Performance?
When the both operations succeed then they succeed as a unit, there is nothing fancy for transaction here, it just commits.
Read more >Commit very slow - Oracle Communities
I am using the Java Edition 3.2.13 and have run tests with config.setTxnWriteNoSync(true) && config.setTxnWriteNoSync(false)
Read more >The trouble with transaction.atomic - David Seddon
atomic makes it easier to commit these mistakes), but something that can appear at any system that deals with transactions. The only thing...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@shiveshr It is not a good idea to derive performance conclusions out of these tests because they are running on pretty scarce resources, we are not provisioning IO properly in these clusters. BK latency is much shorter than that in a regular deployment, there is enough evidence of that even in our own testing.
In any case, I just wanted to illustrate that the writes themselves were not taking seconds, so it is not BK that is inducing the multi-second latency we are observing.
I believe this is fixed.