Replies: 2 comments 3 replies
-
|
Hello, as I said, I think that it is a good Idea, yet we need to take into acount that it can only work for when we store the data on s3, as otherwise, we are required to change the schema of the I also think that in no case we should save binary data (blobs) in the metadata database itself, as it is not recomended by any of the database vendors, in addition to there bein a hard limit for the binary data size (in pgsql around 1gb without plugins, in maria/mysql up to 4gb), and so I think the implementation should target the object storage xcom backend instead |
Beta Was this translation helpful? Give feedback.
-
|
Another approach also suggested in the PR is to encode the raw bytes as strings such that they become json serializable. Do you have a suggestion of how one would go approaching it that way? Both approaches has different pros and cons. Concerning data allocation, complexity and convenience. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Proposal: I suggest that we should be able to store in memory video/images/raw-bytes through common.io xcom backend. After opening a PR: #61777 I realized that this might not be the implementation the community would do in the common.io provider as @Nataneljpwd pointed out. What data-types should be supported to store in this provider and what is the actual puprose of it?
Current behaviour: We only support json serializable objects. And I could not find any previous discussions about supporting other serializatoin methods.
My optinion: I think it is pretty convenient that it exists both as a fallback that we do not accidentally store big files in S3 but also as a reliable feature to standardize the bucket paths in s3 for all dags that are running in the airflow instance.
Suggestion: I think the purest form of data we can store is raw bytes, and don't care about the contents. Although since json is already implemented and for the sake of not destroy backward compatibility we would support both JSON and raw bytes data.
What are your takes on the topic?
Beta Was this translation helpful? Give feedback.
All reactions