Skip to content

Commit 70d84b7

Browse files
committed
Time Series: Improve guidance, structure, layout, and wording
- Expand canonical "time series" entry-point page - Add dedicated time series sub-pages about: - Time Series Basics - Advanced Time Series Analysis - Connectivity Options - Video Tutorials - Use "time series" 2-gram everywhere - Improve page about "Industrial Data" - Improve page about "Document Store" - ML: Add section about "Exploratory data analysis (EDA)"
1 parent 500ed8d commit 70d84b7

File tree

13 files changed

+761
-34
lines changed

13 files changed

+761
-34
lines changed

docs/admin/sharding-partitioning.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ partition as a set of shards. For each partition, the number of shards defined
6464
by ``CLUSTERED INTO x SHARDS`` are created, when a first record with a specific
6565
``partition key`` is inserted.
6666

67-
In the following example - which represents a very simple time-series use-case
67+
In the following example - which represents a very simple time series use-case
6868
- we added another column ``part`` that automatically generates the current
6969
month upon insertion from the ``ts`` column. The ``part`` column is further used
7070
as the ``partition key``.
@@ -132,12 +132,12 @@ Then, to calculate the number of shards, you should consider that the size of ea
132132
shard should roughly be between 5 - 100 GB, and that each node can only manage
133133
up to 1000 shards.
134134

135-
Time-series example
135+
Time series example
136136
-------------------
137137

138138
To illustrate the steps above, let's use them on behalf of an example. Imagine
139139
you want to create a *partitioned table* on a *three-node cluster* to store
140-
time-series data with the following assumptions:
140+
time series data with the following assumptions:
141141

142142
- Inserts: 1.000 records/s
143143
- Record size: 128 byte/record

docs/domain/document/index.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,8 +9,11 @@ Storing documents in CrateDB provides the same development convenience like the
99
document-oriented storage layer of Lotus Notes / Domino, CouchDB, MongoDB, and
1010
PostgreSQL's `JSON(B)` types.
1111

12+
- [](inv:crate-reference#type-object)
1213
- [](inv:cloud#object)
14+
- [CrateDB Objects]
1315
- [Unleashing the Power of Nested Data: Ingesting and Querying JSON Documents with SQL]
1416

1517

18+
[CrateDB Objects]: https://youtu.be/aQi9MXs2irU?feature=shared
1619
[Unleashing the Power of Nested Data: Ingesting and Querying JSON Documents with SQL]: https://youtu.be/S_RHmdz2IQM?feature=shared

docs/domain/industrial/index.md

Lines changed: 124 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55
# Industrial Data
66

77
Learn how to use CrateDB in industrial / IIoT / Industry 4.0 scenarios within
8-
engineering, manufacturing, and other operational domains.
8+
engineering, manufacturing, production, and other operational domains.
99

1010
In the realm of Industrial IoT, dealing with diverse data, ranging from
1111
slow-moving structured data, to high-frequency measurements, presents unique
@@ -15,24 +15,110 @@ The complexities of industrial big data are characterized by its high variety,
1515
unstructured features, different data sampling rates, and how these attributes
1616
influence data storage, retention, and integration.
1717

18-
Today's warehouses are complex systems with a very high degree of automation.
19-
The key to the successful operation of these warehouses lies in having a
20-
holistic view on the entire system based on data from various components like
21-
sensors, PLCs, embedded controllers and software systems.
2218

19+
(rauch)=
20+
## Rauch Insights
21+
22+
::::{info-card}
23+
24+
:::{grid-item}
25+
:columns: 8
26+
27+
{material-outlined}`data_exploration;2em`   **Rauch: High-Speed Production Lines**
28+
29+
_Scaling a high-speed production environment with CrateDB._
30+
31+
Rauch is filling 33 cans per second and how that adds up to 400 data records
32+
per second which are being processed, stored, and analyzed. In total, they are
33+
within the range of one to ten billion records persisted in CrateDB.
34+
35+
- [Rauch: High-Speed Production Lines]
36+
37+
The use-case of Rauch demonstrates why traditional databases weren't capable to
38+
deal with so many data records and unstructured data. The benefits of CrateDB
39+
made Rauch choose it over other databases, such as PostgreSQL compatibility,
40+
the support for unstructured data, and its excellent customer support.
41+
42+
:Industry: {tags-secondary}`Food` {tags-secondary}`Packaging` {tags-secondary}`Production`
43+
:Tags: {tags-primary}`SCADA` {tags-primary}`MDE` {tags-primary}`Data Historian` {tags-primary}`Industrial IoT` {tags-primary}`PLC`
44+
:::
45+
46+
:::{grid-item}  
47+
:columns: 4
48+
49+
<iframe width="240" src="https://www.youtube-nocookie.com/embed/gJPmJ0uXeVs?si=J0w5yG56Ld4fIXfm" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
50+
51+
**Date:** 28 Jun 2022 \
52+
**Speaker:** Arno Breuss
53+
:::
2354

55+
::::
2456

57+
58+
(tgw)=
2559
## TGW Insights
2660

61+
62+
::::{info-card}
63+
64+
:::{grid-item}
65+
:columns: 8
66+
67+
{material-outlined}`inventory;2em` &nbsp; **TGW: Data acquisition in high-speed logistics**
68+
69+
_Storing, querying, and analyzing industrial IoT data and metadata without
70+
much hassle._
71+
72+
Today's warehouses are complex systems with a very high degree of automation.
73+
74+
TGW Logistics Group implements key factors to the successful operation of these
75+
warehouses, by having a holistic view on the entire system acquiring data from
76+
various components like sensors, PLCs, embedded controllers, and software
77+
systems.
78+
79+
- [TGW: Fixing data silos in a high-speed logistics environment]
80+
81+
TGW states that all these components can be seen as "data silos",
82+
distributed across the entire site, each of them storing just some pieces of
83+
information in various data structures and different ways to access it.
84+
2785
After trying multiple database systems, TGW Logistics moved to CrateDB for
28-
its ability to aggregate different data formats and ability to query this
29-
information without much hassle.
30-
86+
its ability to aggregate different data formats and the ability to query this
87+
information without further ado.
88+
89+
:Industry: {tags-secondary}`Logistics` {tags-secondary}`Shipping`
90+
:Tags: {tags-primary}`SCADA` {tags-primary}`MDE` {tags-primary}`Data Historian` {tags-primary}`Industrial IoT` {tags-primary}`PLC`
91+
:::
92+
93+
:::{grid-item} &nbsp;
94+
:columns: 4
95+
96+
<iframe width="240" src="https://www.youtube-nocookie.com/embed/6dgjVQJtSKI?si=J0w5yG56Ld4fIXfm" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
97+
98+
**Date:** 22 Jun 2022 \
99+
**Speakers:** Alexander Mann, Jan Weber
100+
:::
101+
102+
::::
103+
104+
105+
106+
::::{info-card}
107+
108+
:::{grid-item}
109+
:columns: 8
110+
111+
{material-outlined}`dashboard;2em` &nbsp; **TGW: Challenges in storing and analyzing industrial data**
112+
113+
_Not All Time-Series Are Equal: Challenges in Storing and Analyzing Industrial Data._
114+
31115
In the second presentation, you will learn how TGW leverages CrateDB to build
32-
digital twins of physical warehouses around the world.
116+
digital twins of physical warehouses around the world, by using its unique set
117+
of features suitable for storing and querying complex industrial big data with
118+
high variety, unstructured features, and at different data frequencies.
33119

34-
- [Fixing data silos in a high-speed logistics environment]
35-
- [Challenges of Storing and Analyzing Industrial Data]
120+
- [CrateDB: Challenges in industrial data]
121+
- [TGW: Storing and analyzing real-world industrial data]
36122

37123
**What's inside**
38124

@@ -47,6 +133,31 @@ digital twins of physical warehouses around the world.
47133
- Real-World Applications: Exploration of actual customer use cases to
48134
illustrate how CrateDB can be applied in various industrial scenarios.
49135

136+
:Industry: {tags-secondary}`Logistics` {tags-secondary}`Shipping`
137+
:Tags: {tags-primary}`Data Historian` {tags-primary}`Industrial IoT` {tags-primary}`Digital Twin`
138+
:::
139+
140+
:::{grid-item} &nbsp;
141+
:columns: 4
142+
143+
<iframe width="240" class="speakerdeck-iframe" style="border: 0px; background: rgba(0, 0, 0, 0.1) padding-box; margin: 0px; padding: 0px; border-radius: 6px; box-shadow: rgba(0, 0, 0, 0.2) 0px 5px 40px; width: 100%; height: auto; aspect-ratio: 560 / 315;" frameborder="0" src="https://speakerdeck.com/player/acb78531a07e4238ac662539b0c23609" title=" Not all time-series are equal ​ Challenges of storing and analyzing industrial data" allowfullscreen="true" data-ratio="1.7777777777777777"></iframe>
144+
145+
**Date:** 23 Nov 2022 \
146+
**Speaker:** Marija Selakovic
147+
148+
149+
<iframe width="240" src="https://www.youtube-nocookie.com/embed/ugQvihToY0k?si=J0w5yG56Ld4fIXfm" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
150+
151+
**Date:** 5 Oct 2023 \
152+
**Speakers:** Alexander Mann, Georg Traar
153+
:::
154+
155+
::::
156+
157+
158+
50159

51-
[Challenges of Storing and Analyzing Industrial Data]: https://youtu.be/ugQvihToY0k?feature=shared
52-
[Fixing data silos in a high-speed logistics environment]: https://youtu.be/6dgjVQJtSKI?feature=shared
160+
[CrateDB: Challenges in industrial data]: https://speakerdeck.com/cratedb/not-all-time-series-are-equal-challenges-of-storing-and-analyzing-industrial-data
161+
[Rauch: High-Speed Production Lines]: https://youtu.be/gJPmJ0uXeVs?feature=shared
162+
[TGW: Fixing data silos in a high-speed logistics environment]: https://youtu.be/6dgjVQJtSKI?feature=shared
163+
[TGW: Storing and analyzing real-world industrial data]: https://youtu.be/ugQvihToY0k?feature=shared

0 commit comments

Comments
 (0)