-
Notifications
You must be signed in to change notification settings - Fork 771
Refactor: improve query spill #16605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
While the throughput speed to object storage is quite fast, the I/O is not nearly as good as local SSD/NVMe. For spill operations, if they are I/O bound, could we spill to the local cache disk rather than object storage? We also want to make sure all spill and temp files are cleaned up after the query operation completes or fails. There might be evidence that these temp/spill files persist. |
Yes, latest version support spill to local disk Add config to config file:
|
Doc updated: https://docs.databend.com/guides/data-management/data-recycle#spill-data-storage |
@wubx, this is only for window functions so far, right? The biggest spills we have are aggregation so this won't help yet, is that correct? |
Hi @inviscid, Spill to local disk already supportted, see the doc: https://docs.databend.com/guides/data-management/data-recycle#configuring-spill-storage |
#17550 Databend hope that precisely control the memory occupied by each Query, and then queue the data spill, so that for biggest spills also can spill out to s3. Please feel free to share any suggestions you may have. We would love to hear your thoughts! |
Summary
The text was updated successfully, but these errors were encountered: