[SPARK-53975][PYTHON] Adds basic Python worker logging support #52689

ueshin · 2025-10-21T23:58:53Z

What changes were proposed in this pull request?

Adds basic Python worker logging support.

The logs from Python's standard logger or print to stdout and stderr will be in the system.session.python_worker_logs view.

spark.sql.pyspark.worker.logging.enabled (False by default)
When set to true, this configuration enables comprehensive logging within Python worker processes that execute User-Defined Functions (UDFs), User-Defined Table Functions (UDTFs), and other Python-based operations in Spark SQL.

For example:

>>> from pyspark.sql.functions import *
>>> import logging
>>>
>>> @udf
... def logging_test_udf(x):
...     logger = logging.getLogger("test")
...     logger.setLevel(logging.INFO)
...     logger.info(f"INFO level message: {x}")
...     print(f"PRINT(STDOUT): {x}")  # INFO level, logger is "stdout"
...     print(f"PRINT(STDERR): {x}", file=sys.stderr)  # ERROR level, logger is "stderr"
...     try:
...         1 / x
...     except:
...         logger.exception(f"1 / {x}")
...     return str(x)
...
>>> spark.conf.set("spark.sql.pyspark.worker.logging.enabled", True)
>>>
>>> spark.range(2).select(logging_test_udf("id")).show()
+-----+
|f(id)|
+-----+
|    0|
|    1|
+-----+

>>> spark.table("system.session.python_worker_logs").orderBy("ts").show(truncate=False)
+--------------------------+-----+---------------------+-------------------------------+-----------------------------------------------------------------------------+------+
|ts                        |level|msg                  |context                        |exception                                                                    |logger|
+--------------------------+-----+---------------------+-------------------------------+-----------------------------------------------------------------------------+------+
|2025-10-21 17:22:01.862654|INFO |INFO level message: 0|{func_name -> logging_test_udf}|NULL                                                                         |test  |
|2025-10-21 17:22:01.863826|INFO |INFO level message: 1|{func_name -> logging_test_udf}|NULL                                                                         |test  |
|2025-10-21 17:22:01.86505 |INFO |PRINT(STDOUT): 0     |{func_name -> logging_test_udf}|NULL                                                                         |stdout|
|2025-10-21 17:22:01.865827|INFO |PRINT(STDOUT): 1     |{func_name -> logging_test_udf}|NULL                                                                         |stdout|
|2025-10-21 17:22:01.87052 |ERROR|PRINT(STDERR): 0     |{func_name -> logging_test_udf}|NULL                                                                         |stderr|
|2025-10-21 17:22:01.871405|ERROR|PRINT(STDERR): 1     |{func_name -> logging_test_udf}|NULL                                                                         |stderr|
|2025-10-21 17:22:01.87188 |ERROR|1 / 0                |{func_name -> logging_test_udf}|{ZeroDivisionError, division by zero, [{NULL, logging_test_udf, <stdin>, 8}]}|test  |
+--------------------------+-----+---------------------+-------------------------------+-----------------------------------------------------------------------------+------+

Why are the changes needed?

The logging in UDF is difficult to collect the logs as they will go to the executor's stderr file.
If there are many executors, need to check the stderr files one-by-one.

Does this PR introduce any user-facing change?

Yes, Python UDF logging is available and collect them via a system view.

How was this patch tested?

Added the related tests.

Was this patch authored or co-authored using generative AI tooling?

No.

HyukjinKwon

I love this!

python/pyspark/sql/tests/test_udf.py

cloud-fan · 2025-10-22T09:09:09Z

should the log be query-centric instead of worker-centric? How can I find logs for a certain query?

core/src/main/scala/org/apache/spark/api/python/PythonWorkerLogCapture.scala

python/pyspark/logger/logger.py

python/pyspark/logger/worker_io.py

Yicong-Huang · 2025-10-22T19:51:12Z

python/pyspark/logger/worker_io.py

+            - func_name: Name of the function that initiated the logging
+            - class_name: Name of the class that initiated the logging if available


do we consider add module name into context?

I think we can add it if necessary.

python/pyspark/logger/worker_io.py

allisonwang-db

This is super awesome! Thank for working on it!

core/src/main/scala/org/apache/spark/storage/BlockId.scala

python/pyspark/logger/worker_io.py

allisonwang-db · 2025-10-22T21:19:37Z

core/src/main/scala/org/apache/spark/api/python/PythonWorkerLogCapture.scala

+            writer.writeLog(
+              PythonWorkerLogLine(System.currentTimeMillis(), seqId.getAndIncrement(), json)


Do we want to limit the number of lines written to block manager?

cc @ivoson @cloud-fan this is important as we don't want users to write unlimited number of logs into block manager.

ueshin

@cloud-fan

should the log be query-centric instead of worker-centric? How can I find logs for a certain query?

Do we have any info to identify a query, like query_id in the executor?
If we have, I can add it to context, then we can query with context.query_id = 'xxx'.

core/src/main/scala/org/apache/spark/api/python/PythonWorkerLogCapture.scala

ueshin · 2025-10-22T21:31:19Z

core/src/main/scala/org/apache/spark/api/python/PythonWorkerLogCapture.scala

+            writer.writeLog(
+              PythonWorkerLogLine(System.currentTimeMillis(), seqId.getAndIncrement(), json)


python/pyspark/logger/logger.py

python/pyspark/logger/worker_io.py

ueshin · 2025-10-22T21:46:36Z

python/pyspark/logger/worker_io.py

+            - func_name: Name of the function that initiated the logging
+            - class_name: Name of the class that initiated the logging if available


I think we can add it if necessary.

core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala

dongjoon-hyun

+1, LGTM (Pending CIs). Thank you for updating the PR.

Please rebase the PR to the master branch because master branch was broken Today for a while and recovered now, @ueshin .

allisonwang-db

Looks great! Very excited for this feature!

allisonwang-db · 2025-10-24T17:30:31Z

core/src/main/scala/org/apache/spark/api/python/PythonWorkerLogCapture.scala

+            writer.writeLog(
+              PythonWorkerLogLine(System.currentTimeMillis(), seqId.getAndIncrement(), json)


cc @ivoson @cloud-fan this is important as we don't want users to write unlimited number of logs into block manager.

python/pyspark/logger/worker_io.py

ueshin · 2025-10-24T18:47:44Z

cc @cloud-fan for another look.

ueshin added 5 commits October 21, 2025 14:19

Add Python worker log definitions for BlockManager.

d2b4575

Introduce python_worker_logs system session view.

26826fd

Add basic logging support.

a2074f5

Provide logging contexts.

384a1a0

Fix.

0eed726

ueshin requested review from HyukjinKwon, allisonwang-db, cloud-fan and zhengruifeng October 21, 2025 23:58

github-actions bot added SQL CORE PYTHON CONNECT labels Oct 21, 2025

HyukjinKwon approved these changes Oct 22, 2025

View reviewed changes

ueshin changed the title ~~[SPARK-53975][PYTHON] Adds basic logging support~~ [SPARK-53975][PYTHON] Adds basic Python worker logging support Oct 22, 2025

zhengruifeng reviewed Oct 22, 2025

View reviewed changes

python/pyspark/sql/tests/test_udf.py Show resolved Hide resolved

zhengruifeng reviewed Oct 22, 2025

View reviewed changes

python/pyspark/sql/tests/test_udf.py Show resolved Hide resolved

zhengruifeng approved these changes Oct 22, 2025

View reviewed changes

Yicong-Huang reviewed Oct 22, 2025

View reviewed changes

allisonwang-db reviewed Oct 22, 2025

View reviewed changes

ueshin commented Oct 22, 2025

View reviewed changes

Fix.

399f591

dongjoon-hyun reviewed Oct 23, 2025

View reviewed changes

core/src/main/scala/org/apache/spark/api/python/PythonWorkerFactory.scala Outdated Show resolved Hide resolved

Fix.

d070c4b

dongjoon-hyun approved these changes Oct 24, 2025

View reviewed changes

allisonwang-db approved these changes Oct 24, 2025

View reviewed changes

Merge branch 'master' into issues/SPARK-53975/basic_logging

07b1d61

Fix.

f0fb846

		- func_name: Name of the function that initiated the logging
		- class_name: Name of the class that initiated the logging if available

		writer.writeLog(
		PythonWorkerLogLine(System.currentTimeMillis(), seqId.getAndIncrement(), json)

[SPARK-53975][PYTHON] Adds basic Python worker logging support #52689

Are you sure you want to change the base?

[SPARK-53975][PYTHON] Adds basic Python worker logging support #52689

Conversation

ueshin commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cloud-fan commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

allisonwang-db left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ueshin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

allisonwang-db left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ueshin commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

ueshin commented Oct 21, 2025 •

edited

Loading

cloud-fan commented Oct 22, 2025 •

edited

Loading

ueshin left a comment •

edited

Loading