-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support for DuckDB #90
Comments
@johntruckenbrodt |
Awesome! Thanks @constantinius. Let me know if there's anything I can do. |
The cleanest way I found would be to make a specific subclass for the evaluator and simply override the functions acting up. The following snippet works for me: from pygeofilter.backends.sql.evaluate import to_sql_where, SQLEvaluator
from pygeofilter.backends.evaluator import handle
import shapely.geometry
class DuckDBEvaluator(SQLEvaluator):
@handle(ast.Attribute)
def attribute(self, node: ast.Attribute):
if node.name == "geometry":
return f'ST_GeomFromWKB({node.name})'
return f'"{self.attribute_map[node.name]}"'
@handle(values.Geometry)
def geometry(self, node: values.Geometry):
wkb_hex = shapely.geometry.shape(node).wkb_hex
return f"ST_GeomFromHEXEWKB('{wkb_hex}')"
@handle(values.Envelope)
def envelope(self, node: values.Envelope):
wkb_hex = shapely.geometry.box(node.x1, node.y1, node.x2, node.y2).wkb_hex
return f"ST_GeomFromHEXEWKB('{wkb_hex}')"
sql_where = DuckDBEvaluator({
's1:datatake': 's1:datatake',
'datetime': 'datetime',
'sar:instrument_mode': 'sar:instrument_mode',
'end_datetime': 'end_datetime',
'start_datetime': 'start_datetime',
'geometry': 'geometry'
}, {}).evaluate(filter) Afterwards, no string manipulation is required. I'm actually not sure how to backport this into pygeofilter. A quick glance around, and I found vast differences in all of the different SQL based databases with spatial support. I'm thinking that there is no way to provide a general solution. Please let me know if this solution works for you. |
This works like a charm! Thanks a lot @constantinius, this is of great help. I'll be using this solution and give credit of course. Can't this class find its way into |
For those playing the duckdb game, you also need to put quotes around the datetime type like so: @handle(*values.LITERALS)
def literal(self, node):
if isinstance(node, str):
return f"'{node}'"
elif isinstance(node, datetime.datetime):
return f"'{node}'"
else:
return node |
Hi,
I am currently trying to figure out a way to use
pygeofilter
for converting CQL2 to a DuckDB query using the spatial extension. This extension is quite prototypical and I stumble across a couple of caveats. Here's my current approach:Define the CQL2 filter
Optionally add spatial filtering
Convert CQL2 filter to SQL where clause
Create DuckDB query for a geoparquet file:
Here it gets ugly because (1) the WKB column needs to be converted to
GEOMETRY
type and (2) the DuckDB-spatial implementation ofST_GeomFromWKB
cannot read the WKB-HEX representation returned byto_sql_where
.Execute the query:
I wonder, how would you do this? Do you think there is anything that could/should be modified on the
pygeofilter
end or is it entirely up to the DuckDB-spatial package? I'd appreciate any help.The text was updated successfully, but these errors were encountered: