[exporter/clickhouse] Fix data race that caused panic in integration tests #38798

lim123123123 · 2025-03-19T12:32:34Z

Description

clickhouse exporter tests use their own sql drivers to validate the queries sent during the tests.
Each driver has a unique name (which is the same as the test name). This name must be passed to the sql.Open function during database initialization.

Unfortunately, instead of explicitly passing the driver name into the buildDB function, it was passed implicitly through a global variable. This approach works in production but leads to data races when tests are executed in parallel because each test writes its own name to driverName. As a result, integration tests expected driverName to be set to the standard "clickhouse", but instead, it retained the value set by the previous test. This caused the validation callbacks to be triggered incorrectly, leading to test failures (panics).

There were two possible solutions to fix this issue:

Hardcode the "clickhouse" constant in integration tests.
Pass driverName explicitly into sql.Open.
Although the first option might work, I believe it is not a careful way to write tests and code. A global variable that changes during tests introduces bugs that are difficult to debug and reproduce. Therefore, I prefer the second option.

Link to tracking issue

Fixes
Close #32530

…rName into config instead. Fix data race that write into global driverName from muptiple tests in parallel

SpencerTorres · 2025-03-20T02:55:56Z

I have a branch where I'm refactoring some of the driver code, this won't be a problem anymore in that version but this is a good fix to get the integration tests running again. Thanks!

lim123123123 · 2025-03-25T09:58:58Z

Hello, could somebody else review this mr pls? There are 40 loc and it blocks another fix(because it adds an integration test). @dmitryax @Frapschen @hanjm

Frapschen · 2025-04-03T03:25:01Z

I am little confuse, please give more examplianation about :

This approach works in production but leads to data races when tests are executed in parallel because each test writes its own name to driverName.

Frapschen · 2025-04-03T03:26:19Z

exporter/clickhouseexporter/integration_test.go

@@ -91,7 +90,7 @@ func getContainer(t *testing.T, req testcontainers.ContainerRequest) testcontain

 func verifyExportLog(t *testing.T, logExporter *logsExporter) {
 	mustPushLogsData(t, logExporter, simpleLogs(1))
-	db := sqlx.NewDb(logExporter.client, driverName)
+	db := sqlx.NewDb(logExporter.client, clickhouseDriverName)


what is the different use of a const string clickhouse between a var string clickhouse

It was a variable because some tests set this variable with mock names to record their actions. However, this behavior wasn't safe for concurrent use, especially for integration tests. If they were executed after other tests, they used the mock instead of 'clickhouse'. Now, this is a constant, and nobody changes it. Tests that need to use the mock pass it explicitly with the driverName, so this issue no longer occurs. You can read the description of the PR for more details

The logical issue here is that integration tests might use mock drivers from other tests instead of the real Clickhouse driver because those tests set the mock names in driverName. The main fix is to avoid setting mock names in driverName and instead pass these names explicitly in the configuration
Hope that the problem is clear now

like this code, right?

opentelemetry-collector-contrib/exporter/clickhouseexporter/exporter_logs_test.go

Lines 261 to 266 in 4e84b33

func initClickhouseTestServer(t *testing.T, recorder recorder) {

driverName = t.Name()

sql.Register(t.Name(), &testClickhouseDriver{

recorder: recorder,

})

}

Yes . driverName = t.Name() is the problem

lim123123123 · 2025-04-03T08:03:16Z

I am little confuse, please give more examplianation about :

This approach works in production but leads to data races when tests are executed in parallel because each test writes its own name to driverName.

In production, nobody sets the names of their mocks in driverName, so it always defaults to 'Clickhouse'. This works fine, and it would also work if the other tests (those that set this variable) were disabled.

lim123123123 · 2025-04-03T10:08:49Z

@atoulme Do I understand right that we need a review from @dmitryax to merge this?

…tests (open-telemetry#38798)  #### Description clickhouse exporter tests use their own sql drivers to validate the queries sent during the tests. Each driver has a unique name (which is the same as the test name). This name must be passed to the sql.Open function during database initialization. Unfortunately, instead of explicitly passing the driver name into the buildDB function, it was passed implicitly through a global variable. This approach works in production but leads to data races when tests are executed in parallel because each test writes its own name to driverName. As a result, integration tests expected driverName to be set to the standard "clickhouse", but instead, it retained the value set by the previous test. This caused the validation callbacks to be triggered incorrectly, leading to test failures (panics). There were two possible solutions to fix this issue: 1. Hardcode the "clickhouse" constant in integration tests. 2. Pass driverName explicitly into sql.Open. Although the first option might work, I believe it is not a careful way to write tests and code. A global variable that changes during tests introduces bugs that are difficult to debug and reproduce. Therefore, I prefer the second option.  #### Link to tracking issue Fixes Close open-telemetry#32530 --------- Co-authored-by: Antoine Toulme <[email protected]>

#### Description In the current version of the exporter, the createDatabase function uses cfg.buildDSN, which appends the database from the config into the dsn that is passed into the clickhouse driver. As a result, the exporter tries to connect to the database before it is actually created, causing a ClickHouse exception in the start function This pr couldn't be merged until [the integration test fix](#38798)  #### Testing I've added an integration test that checks whether the database was successfully created. It fails now but works with the fix.  --------- Co-authored-by: Antoine Toulme <[email protected]>

#### Description In the current version of the exporter, the createDatabase function uses cfg.buildDSN, which appends the database from the config into the dsn that is passed into the clickhouse driver. As a result, the exporter tries to connect to the database before it is actually created, causing a ClickHouse exception in the start function This pr couldn't be merged until [the integration test fix](open-telemetry#38798)  #### Testing I've added an integration test that checks whether the database was successfully created. It fails now but works with the fix.  --------- Co-authored-by: Antoine Toulme <[email protected]>

fix: clickhouseexporter: delete driverName global variable, add drive…

110afd1

…rName into config instead. Fix data race that write into global driverName from muptiple tests in parallel

lim123123123 requested review from dmitryax and a team as code owners March 19, 2025 12:32

github-actions bot assigned atoulme Mar 19, 2025

github-actions bot added the exporter/clickhouse label Mar 19, 2025

github-actions bot requested review from Frapschen, hanjm and SpencerTorres March 19, 2025 12:33

minor: clickhouseexporter: delete sleep that was used to find the race

123233f

atoulme added Skip Changelog PRs that do not require a CHANGELOG.md entry waiting-for-code-owners labels Mar 19, 2025

SpencerTorres approved these changes Mar 20, 2025

View reviewed changes

lim123123123 mentioned this pull request Mar 20, 2025

[exporter/clickhouse] Fix database creation #38829

Merged

This was referenced Mar 23, 2025

Weekly Report: 2025-03-16 - 2025-03-23 LucaLanziani/opentelemetry-collector-contrib#16

Closed

Weekly Report: 2025-03-16 - 2025-03-23 LucaLanziani/opentelemetry-collector-contrib#17

Closed

github-actions bot mentioned this pull request Mar 25, 2025

Weekly Report: 2025-03-18 - 2025-03-25 #38935

Closed

github-actions bot mentioned this pull request Apr 1, 2025

Weekly Report: 2025-03-25 - 2025-04-01 #39070

Closed

Frapschen reviewed Apr 3, 2025

View reviewed changes

Frapschen approved these changes Apr 3, 2025

View reviewed changes

Merge branch 'main' into k.garmanov/FIX-CLICKHOUSEEXPORTER-TESTS

8375fd3

github-actions bot mentioned this pull request Apr 8, 2025

Weekly Report: 2025-04-01 - 2025-04-08 #39228

Closed

github-actions bot mentioned this pull request Apr 15, 2025

Weekly Report: 2025-04-08 - 2025-04-15 #39396

Closed

Merge branch 'main' into k.garmanov/FIX-CLICKHOUSEEXPORTER-TESTS

c1170fb

atoulme added ready to merge Code review completed; ready to merge by maintainers and removed waiting-for-code-owners labels Apr 15, 2025

songy23 approved these changes Apr 15, 2025

View reviewed changes

songy23 merged commit 1c509bf into open-telemetry:main Apr 15, 2025
182 of 183 checks passed

github-actions bot added this to the next release milestone Apr 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[exporter/clickhouse] Fix data race that caused panic in integration tests #38798

[exporter/clickhouse] Fix data race that caused panic in integration tests #38798

Uh oh!

lim123123123 commented Mar 19, 2025 •

edited

Loading

Uh oh!

SpencerTorres commented Mar 20, 2025

Uh oh!

lim123123123 commented Mar 25, 2025

Uh oh!

Frapschen commented Apr 3, 2025

Uh oh!

Frapschen Apr 3, 2025

Uh oh!

lim123123123 Apr 3, 2025

Uh oh!

lim123123123 Apr 3, 2025

Uh oh!

Frapschen Apr 3, 2025

Uh oh!

lim123123123 Apr 3, 2025

Uh oh!

lim123123123 commented Apr 3, 2025

Uh oh!

lim123123123 commented Apr 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

	func initClickhouseTestServer(t *testing.T, recorder recorder) {
	driverName = t.Name()
	sql.Register(t.Name(), &testClickhouseDriver{
	recorder: recorder,
	})
	}

[exporter/clickhouse] Fix data race that caused panic in integration tests #38798

[exporter/clickhouse] Fix data race that caused panic in integration tests #38798

Uh oh!

Conversation

lim123123123 commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Link to tracking issue

Uh oh!

SpencerTorres commented Mar 20, 2025

Uh oh!

lim123123123 commented Mar 25, 2025

Uh oh!

Frapschen commented Apr 3, 2025

Uh oh!

Frapschen Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

lim123123123 Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

lim123123123 Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Frapschen Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

lim123123123 Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

lim123123123 commented Apr 3, 2025

Uh oh!

lim123123123 commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lim123123123 commented Mar 19, 2025 •

edited

Loading

lim123123123 commented Apr 3, 2025 •

edited

Loading