Skip to content

Releases: SANSA-Stack/SANSA-DataLake

SANSA DataLake 0.1.2

13 Jan 16:32
74780fb
Compare
Choose a tag to compare

This version brings the following features and changes:

  • Project and filter of subject variables.
  • Remove requiring the projection of the filtered object.
  • Run queries with only one relevant data source.
  • Add support for RDF data (NTriples).
  • Add query-time transformations.
  • Change the query execution timer to count the full execution time instead of only Spark execution time.
  • Improve exception handling and logging messages.
  • Change query results format, from show() method to foreach(println) method.

SANSA DataLake 0.1.1

07 Jun 15:52
Compare
Choose a tag to compare

Features

This release makes it possible to read the needed input files (query, mappings and config) from HDFS and S3 bucket.

Dependency Changes

  • Apache Spark 2.4.3

SANSA DataLake 0.1.0

12 Dec 10:57
Compare
Choose a tag to compare

Features

  • Spark
    • Query heterogeneous sources (Data Lake) using SPARQL.
      • Supported data sources: CSV, Parquet, MongoDB, Cassandra, JDBC.

Dependency Changes

  • Apache Spark 2.4.0
  • Apache Jena 3.9.0