3
3
Big data stack running in pseudo-distributed mode with the following components:
4
4
5
5
- Hadoop 2.8.5
6
- - Minio RELEASE.2018-12-27T18-33-08Z
7
- - Hive 2.3.4
8
- - Presto 0.215
9
- - Superset 0.28 .1
10
- - Hue 4.3 .0
6
+ - Minio RELEASE.2019-10-12T01-39-57Z
7
+ - Hive 2.3.6
8
+ - Presto 326
9
+ - Superset 0.35 .1
10
+ - Hue 4.5 .0
11
11
12
12
For more details see the following [ post] ( https://johs.me/posts/big-data-stack-running-sql-queries/ ) .
13
13
@@ -23,7 +23,7 @@ If you also want to start Superset and Hue, then run:
23
23
docker-compose -f superset/docker-compose.yml up -d
24
24
docker-compose -f hue/docker-compose.yml up -d
25
25
```
26
- and initialize the databases :
26
+ and initialize:
27
27
```
28
28
./scripts/init-hue.sh
29
29
./scripts/init-superset.sh
@@ -48,16 +48,16 @@ in the different sub-directories.
48
48
Changes compared to original images:
49
49
50
50
- Hadoop updated to version 2.8.5
51
- - Hive update to version 2.3.4
51
+ - Hive update to version 2.3.6
52
52
- S3 support added
53
- - Presto update to 0.215
53
+ - Presto update to 326
54
54
- Presto JDBC driver added to Hue
55
55
56
56
The scripts directory contains some helper scripts:
57
57
58
58
- ` beeline.sh ` : Launch Beeline (Hive CLI) in Hive container
59
59
- ` hadoop-client.sh ` : Start container with Hadoop utilities (host filesystem mounted as ` /host ` ). Useful for moving files to HDFS.
60
- - ` init-hue.sh ` : Initialize Hue database
60
+ - ` init-hue.sh ` : Create admin home folder in HDFS in order to avoid error in Hue File Browser.
61
61
- ` init-superset.sh ` : Initialize Superset database and add Presto as data source
62
62
- ` presto-cli.sh ` : Launch Presto CLI (downloads jar if needed)
63
63
0 commit comments