Skip to content

Spark 4.1: Optimize UUID writer with fast-path byte parsing.#15302

Open
slfan1989 wants to merge 2 commits intoapache:mainfrom
slfan1989:spark-optimize-uuid-writer
Open

Spark 4.1: Optimize UUID writer with fast-path byte parsing.#15302
slfan1989 wants to merge 2 commits intoapache:mainfrom
slfan1989:spark-optimize-uuid-writer

Conversation

@slfan1989
Copy link
Contributor

Summary

  • Optimize UUID writer to avoid UTF-8 decoding and string parsing overhead
  • Add fast-path for canonical UUID format (8-4-4-4-12 with hyphens)
  • Direct byte-level parsing from UTF8String to ByteBuffer

@github-actions github-actions bot added the spark label Feb 12, 2026
@slfan1989
Copy link
Contributor Author

@nastra @huaxingao Could you please review this PR when you have a moment? Thank you very much!

@singhpk234
Copy link
Contributor

@slfan1989 do you perf benchmarks on how much it improves ? I wonder if its worth the complexity

@slfan1989
Copy link
Contributor Author

@slfan1989 do you perf benchmarks on how much it improves ? I wonder if its worth the complexity

@singhpk234 Thanks for your reply. I’ll try to provide a test report. In theory, there should be a performance improvement.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants