Skip to content

[v25.1.x] r/stm_manager: added watchdog logging an error when stm did not stop #26413

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

mmaslankaprv
Copy link
Member

Backport of PR #26259

State machines should be able to stop timely as any issue with stopping
a state machine may lead to a situation in which the whole partition
fails to stop. Added watchdog reporting an error when STM fails to stop.

Signed-off-by: Michał Maślanka <[email protected]>
(cherry picked from commit 9d4dbfb)
@mmaslankaprv mmaslankaprv added this to the v25.1.x-next milestone Jun 11, 2025
@mmaslankaprv mmaslankaprv added the kind/backport PRs targeting a stable branch label Jun 11, 2025
@mmaslankaprv
Copy link
Member Author

/dt

@vbotbuildovich
Copy link
Collaborator

vbotbuildovich commented Jun 11, 2025

CI test results

test results on build#67167
test_class test_method test_arguments test_kind job_url test_status passed reason
ControllerLogLimitMirrorMakerTests test_mirror_maker_with_limits ducktape https://buildkite.com/redpanda/redpanda/builds/67167#01975eff-cd36-41d4-9426-4c59dd3262b8 FLAKY 20/21 upstream reliability is '96.7741935483871'. current run reliability is '95.23809523809523'. drift is 1.5361 and the allowed drift is set to 50. The test should PASS
WriteCachingFailureInjectionE2ETest test_crash_all_with_consumer_group ducktape https://buildkite.com/redpanda/redpanda/builds/67167#01975ee5-18a6-4d03-a2c2-7502715584ef FLAKY 19/21 upstream reliability is '100.0'. current run reliability is '90.47619047619048'. drift is 9.52381 and the allowed drift is set to 50. The test should PASS
test results on build#67217
test_class test_method test_arguments test_kind job_url test_status passed reason
distributed_kv_stm_tests_rpunit distributed_kv_stm_tests_rpunit unit https://buildkite.com/redpanda/redpanda/builds/67217#01976055-d6b9-4e2b-8c8e-7deaf75ffc97 FLAKY 1/2

@mmaslankaprv mmaslankaprv marked this pull request as ready for review June 11, 2025 18:51
@piyushredpanda piyushredpanda merged commit 792b119 into redpanda-data:v25.1.x Jun 11, 2025
12 of 17 checks passed
@piyushredpanda piyushredpanda modified the milestones: v25.1.x-next, v25.1.5 Jun 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/build area/redpanda kind/backport PRs targeting a stable branch
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[v25.1.x] r/stm_manager: added watchdog logging an error when stm did not stop
4 participants