Skip to content

jangorecki/data.cube

Folders and files

NameName
Last commit message
Last commit date

Latest commit

c8622b1 · Mar 20, 2018
Mar 18, 2018
Nov 19, 2015
Mar 18, 2018
Mar 18, 2018
Mar 18, 2018
Mar 18, 2018
Mar 18, 2018
Mar 20, 2018
Mar 18, 2018
Mar 18, 2018
Apr 10, 2016
Mar 20, 2018

Repository files navigation

data.cube Build Status

In-memory OLAP cubes R data type. Uses high performance C-implemented data.table R package.

Features

  • scalable multidimensional hierarchical array alternative
  • uses data.table under the hood
  • use base R array query API
    • [.data.cube uses base R [.array method API for slice and dice of dimension keys
  • extends base R array query API
    • slice and dice on dimension attributes of various levels in hierarchy with .(time_year = 2011:2013)
    • aggregate by collapse dimensions with `-`(), also with pre-filtering
    • rollup and cube over provided groupings with `+`() for rollup and `^`() for cube
  • apply.data.cube uses base R apply like API
  • rollup for data.cube
    • bind grouping dimension to retain cube normalization and avoid double counting
  • for pivot use format/as.data.table with dcast.data.table API
  • direct access to data.cube child classes and attributes
  • query optimization
  • works on sharded distributed engine using big.data.table

Installation

install.packages("data.cube", repos = paste0("https://", c(
    "jangorecki.gitlab.io/data.cube",
    "cloud.r-project.org"
)))

Usage

Read manual and check Subset and aggregate multidimensional data with data.cube vignette.

Contact

[email protected]