Releases: SANSA-Stack/SANSA-DataLake
Releases · SANSA-Stack/SANSA-DataLake
SANSA DataLake 0.1.2
This version brings the following features and changes:
- Project and filter of subject variables.
- Remove requiring the projection of the filtered object.
- Run queries with only one relevant data source.
- Add support for RDF data (NTriples).
- Add query-time transformations.
- Change the query execution timer to count the full execution time instead of only Spark execution time.
- Improve exception handling and logging messages.
- Change query results format, from
show()
method toforeach(println)
method.
SANSA DataLake 0.1.1
Features
This release makes it possible to read the needed input files (query, mappings and config) from HDFS and S3 bucket.
Dependency Changes
- Apache Spark 2.4.3
SANSA DataLake 0.1.0
Features
- Spark
- Query heterogeneous sources (Data Lake) using SPARQL.
- Supported data sources: CSV, Parquet, MongoDB, Cassandra, JDBC.
- Query heterogeneous sources (Data Lake) using SPARQL.
Dependency Changes
- Apache Spark 2.4.0
- Apache Jena 3.9.0