Investigate TPC-H q4 hanging when not enough memory is allocated #1523

andygrove · 2025-03-13T13:30:44Z

Describe the bug

During benchmarking, I find that TPC-H q4 "hangs" indefinitely in the sort-merge join when there is not much memory allocated. I would expect the operator to be slow and spill but it seems to be in some kind of deadlock situation instead, with the stats never changing except for the "total time for joining".

Steps to reproduce

No response

Expected behavior

No response

Additional context

No response

Kontinuation · 2025-03-19T03:08:21Z

The query blocked because we don't have enough number of blocking threads configured for the tokio runtime.

In merge phase, each spill file will be wrapped by a stream backed by a blocking thread (see read_spill_as_stream), so we'll spawn at least 183 blocking threads when there are 183 spill files to merge spilled data. The default number of blocking thread is 10, this make the query hang indefinitely.

Tuning spark.comet.blockingThreads to a higher value could resolve this problem. We may consider raising the default value of spark.comet.blockingThreads, or improving sort-merge in datafusion to not spawning so many blocking threads.

andygrove · 2025-03-19T17:38:15Z

Thanks for debugging this @Kontinuation. Related to this, we currently create a new tokio runtime per plan. I do wonder if we should just have a global tokio runtime for the executor where we could allocate a higher number of threads that could be shared. Do you have an opinion on that?

andygrove · 2025-03-19T21:34:24Z

I filed an issue in DataFusion: apache/datafusion#15323

Kontinuation · 2025-03-20T01:11:43Z

I prefer reusing the global tokio runtime for running all comet physical plans within the same process. The current tokio runtime per plan approach spawns needlessly large number of threads. We can also have a larger default of max blocking threads and these blocking threads can be better utilized by concurrently running queries.

Having a global tokio runtime may prevent us from re-configuring the number of worker threads and blocking threads in an active Spark context using spark.conf.set, but I don't think it a big problem.

andygrove · 2025-03-21T17:13:59Z

Here is an old PR that switched to using a global tokio runtime. I close the PR because I could not find a good justification for it at the time. Perhaps we should try this again and see if it helps with this issue.

#1104

andygrove · 2025-04-01T14:47:47Z

I filed #1590 for switching to a global tokio runtime

andygrove added the bug Something isn't working label Mar 13, 2025

andygrove added this to the 0.8.0 milestone Mar 13, 2025

andygrove mentioned this issue Mar 19, 2025

Reduce number of tokio blocking threads in SortExec spill apache/datafusion#15323

Closed

andygrove mentioned this issue Apr 1, 2025

Use global tokio runtime per executor process #1590

Closed

andygrove mentioned this issue Apr 6, 2025

perf: Use a global tokio runtime #1614

Merged

andygrove closed this as completed in #1614 Apr 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Investigate TPC-H q4 hanging when not enough memory is allocated #1523

Investigate TPC-H q4 hanging when not enough memory is allocated #1523

andygrove commented Mar 13, 2025

Kontinuation commented Mar 19, 2025

Uh oh!

andygrove commented Mar 19, 2025

Uh oh!

andygrove commented Mar 19, 2025

Uh oh!

Kontinuation commented Mar 20, 2025

Uh oh!

andygrove commented Mar 21, 2025

Uh oh!

andygrove commented Apr 1, 2025

Uh oh!

Investigate TPC-H q4 hanging when not enough memory is allocated #1523

Investigate TPC-H q4 hanging when not enough memory is allocated #1523

Comments

andygrove commented Mar 13, 2025

Describe the bug

Steps to reproduce

Expected behavior

Additional context

Kontinuation commented Mar 19, 2025

Uh oh!

andygrove commented Mar 19, 2025

Uh oh!

andygrove commented Mar 19, 2025

Uh oh!

Kontinuation commented Mar 20, 2025

Uh oh!

andygrove commented Mar 21, 2025

Uh oh!

andygrove commented Apr 1, 2025

Uh oh!