Skip to content

Commit 61acb92

Browse files
committed
Industrial Data and Time Series Data: Improve guidance and structure
Link to existing content. Improve guidance, structure, and layout. Time Series: Improve guidance, structure, layout, and wording. - Expand canonical "time series" entry-point page - Add dedicated time series sub-pages about: - Time Series Basics - Advanced Time Series Analysis - Connectivity Options - Video Tutorials - Use "time series" 2-gram everywhere - Improve page about "Industrial Data" - Improve page about "Document Store" - ML: Add section about "Exploratory data analysis (EDA)"
1 parent 500ed8d commit 61acb92

File tree

13 files changed

+768
-35
lines changed

13 files changed

+768
-35
lines changed

docs/admin/sharding-partitioning.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ partition as a set of shards. For each partition, the number of shards defined
6464
by ``CLUSTERED INTO x SHARDS`` are created, when a first record with a specific
6565
``partition key`` is inserted.
6666

67-
In the following example - which represents a very simple time-series use-case
67+
In the following example - which represents a very simple time series use-case
6868
- we added another column ``part`` that automatically generates the current
6969
month upon insertion from the ``ts`` column. The ``part`` column is further used
7070
as the ``partition key``.
@@ -132,12 +132,12 @@ Then, to calculate the number of shards, you should consider that the size of ea
132132
shard should roughly be between 5 - 100 GB, and that each node can only manage
133133
up to 1000 shards.
134134

135-
Time-series example
135+
Time series example
136136
-------------------
137137

138138
To illustrate the steps above, let's use them on behalf of an example. Imagine
139139
you want to create a *partitioned table* on a *three-node cluster* to store
140-
time-series data with the following assumptions:
140+
time series data with the following assumptions:
141141

142142
- Inserts: 1.000 records/s
143143
- Record size: 128 byte/record

docs/domain/document/index.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,8 +9,11 @@ Storing documents in CrateDB provides the same development convenience like the
99
document-oriented storage layer of Lotus Notes / Domino, CouchDB, MongoDB, and
1010
PostgreSQL's `JSON(B)` types.
1111

12+
- [](inv:crate-reference#type-object)
1213
- [](inv:cloud#object)
14+
- [CrateDB Objects]
1315
- [Unleashing the Power of Nested Data: Ingesting and Querying JSON Documents with SQL]
1416

1517

18+
[CrateDB Objects]: https://youtu.be/aQi9MXs2irU?feature=shared
1619
[Unleashing the Power of Nested Data: Ingesting and Querying JSON Documents with SQL]: https://youtu.be/S_RHmdz2IQM?feature=shared

docs/domain/industrial/index.md

Lines changed: 131 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -5,34 +5,126 @@
55
# Industrial Data
66

77
Learn how to use CrateDB in industrial / IIoT / Industry 4.0 scenarios within
8-
engineering, manufacturing, and other operational domains.
8+
engineering, manufacturing, production, and other operational domains, or
9+
within similar environments where billions of data records from any kinds of
10+
machines or devices need to be processed, stored, and queried.
911

1012
In the realm of Industrial IoT, dealing with diverse data, ranging from
1113
slow-moving structured data, to high-frequency measurements, presents unique
1214
challenges.
13-
1415
The complexities of industrial big data are characterized by its high variety,
1516
unstructured features, different data sampling rates, and how these attributes
1617
influence data storage, retention, and integration.
1718

18-
Today's warehouses are complex systems with a very high degree of automation.
19-
The key to the successful operation of these warehouses lies in having a
20-
holistic view on the entire system based on data from various components like
21-
sensors, PLCs, embedded controllers and software systems.
19+
With CrateDB, compatible to PostgreSQL, you can do all of that using plain SQL,
20+
with excellent integration capabilities into commodity systems using standard
21+
database access interfaces like ODBC or JDBC, and a proprietary HTTP interface
22+
on top.
23+
24+
25+
(rauch)=
26+
## Rauch Insights
27+
28+
::::{info-card}
29+
30+
:::{grid-item}
31+
:columns: 8
32+
33+
{material-outlined}`data_exploration;2em`   **Rauch: High-Speed Production Lines**
34+
35+
_Scaling a high-speed production environment with CrateDB._
36+
37+
Rauch is filling 33 cans per second and how that adds up to 400 data records
38+
per second which are being processed, stored, and analyzed. In total, they are
39+
within the range of one to ten billion records persisted in CrateDB.
40+
41+
- [Rauch: High-Speed Production Lines]
42+
43+
The use-case of Rauch demonstrates why traditional databases weren't capable to
44+
deal with so many data records and unstructured data. The benefits of CrateDB
45+
made Rauch choose it over other databases, such as PostgreSQL compatibility,
46+
the support for unstructured data, and its excellent customer support.
47+
48+
:Industry: {tags-secondary}`Food` {tags-secondary}`Packaging` {tags-secondary}`Production`
49+
:Tags: {tags-primary}`SCADA` {tags-primary}`MDE` {tags-primary}`Data Historian` {tags-primary}`Industrial IoT` {tags-primary}`PLC`
50+
:::
2251

52+
:::{grid-item}  
53+
:columns: 4
2354

55+
<iframe width="240" src="https://www.youtube-nocookie.com/embed/gJPmJ0uXeVs?si=J0w5yG56Ld4fIXfm" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
2456

57+
**Date:** 28 Jun 2022 \
58+
**Speaker:** Arno Breuss
59+
:::
60+
61+
::::
62+
63+
64+
(tgw)=
2565
## TGW Insights
2666

67+
68+
::::{info-card}
69+
70+
:::{grid-item}
71+
:columns: 8
72+
73+
{material-outlined}`inventory;2em` &nbsp; **TGW: Data acquisition in high-speed logistics**
74+
75+
_Storing, querying, and analyzing industrial IoT data and metadata without
76+
much hassle._
77+
78+
Today's warehouses are complex systems with a very high degree of automation.
79+
80+
TGW Logistics Group implements key factors to the successful operation of these
81+
warehouses, by having a holistic view on the entire system acquiring data from
82+
various components like sensors, PLCs, embedded controllers, and software
83+
systems.
84+
85+
- [TGW: Fixing data silos in a high-speed logistics environment]
86+
87+
TGW states that all these components can be seen as "data silos",
88+
distributed across the entire site, each of them storing just some pieces of
89+
information in various data structures and different ways to access it.
90+
2791
After trying multiple database systems, TGW Logistics moved to CrateDB for
28-
its ability to aggregate different data formats and ability to query this
29-
information without much hassle.
30-
92+
its ability to aggregate different data formats and the ability to query this
93+
information without further ado.
94+
95+
:Industry: {tags-secondary}`Logistics` {tags-secondary}`Shipping`
96+
:Tags: {tags-primary}`SCADA` {tags-primary}`MDE` {tags-primary}`Data Historian` {tags-primary}`Industrial IoT` {tags-primary}`PLC`
97+
:::
98+
99+
:::{grid-item} &nbsp;
100+
:columns: 4
101+
102+
<iframe width="240" src="https://www.youtube-nocookie.com/embed/6dgjVQJtSKI?si=J0w5yG56Ld4fIXfm" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
103+
104+
**Date:** 22 Jun 2022 \
105+
**Speakers:** Alexander Mann, Jan Weber
106+
:::
107+
108+
::::
109+
110+
111+
112+
::::{info-card}
113+
114+
:::{grid-item}
115+
:columns: 8
116+
117+
{material-outlined}`dashboard;2em` &nbsp; **TGW: Challenges in storing and analyzing industrial data**
118+
119+
_Not All Time-Series Are Equal: Challenges in Storing and Analyzing Industrial Data._
120+
31121
In the second presentation, you will learn how TGW leverages CrateDB to build
32-
digital twins of physical warehouses around the world.
122+
digital twins of physical warehouses around the world, by using its unique set
123+
of features suitable for storing and querying complex industrial big data with
124+
high variety, unstructured features, and at different data frequencies.
33125

34-
- [Fixing data silos in a high-speed logistics environment]
35-
- [Challenges of Storing and Analyzing Industrial Data]
126+
- [CrateDB: Challenges in industrial data]
127+
- [TGW: Storing and analyzing real-world industrial data]
36128

37129
**What's inside**
38130

@@ -47,6 +139,31 @@ digital twins of physical warehouses around the world.
47139
- Real-World Applications: Exploration of actual customer use cases to
48140
illustrate how CrateDB can be applied in various industrial scenarios.
49141

142+
:Industry: {tags-secondary}`Logistics` {tags-secondary}`Shipping`
143+
:Tags: {tags-primary}`Data Historian` {tags-primary}`Industrial IoT` {tags-primary}`Digital Twin`
144+
:::
145+
146+
:::{grid-item} &nbsp;
147+
:columns: 4
148+
149+
<iframe width="240" class="speakerdeck-iframe" style="border: 0px; background: rgba(0, 0, 0, 0.1) padding-box; margin: 0px; padding: 0px; border-radius: 6px; box-shadow: rgba(0, 0, 0, 0.2) 0px 5px 40px; width: 100%; height: auto; aspect-ratio: 560 / 315;" frameborder="0" src="https://speakerdeck.com/player/acb78531a07e4238ac662539b0c23609" title=" Not all time-series are equal ​ Challenges of storing and analyzing industrial data" allowfullscreen="true" data-ratio="1.7777777777777777"></iframe>
150+
151+
**Date:** 23 Nov 2022 \
152+
**Speaker:** Marija Selakovic
153+
154+
155+
<iframe width="240" src="https://www.youtube-nocookie.com/embed/ugQvihToY0k?si=J0w5yG56Ld4fIXfm" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
156+
157+
**Date:** 5 Oct 2023 \
158+
**Speakers:** Alexander Mann, Georg Traar
159+
:::
160+
161+
::::
162+
163+
164+
50165

51-
[Challenges of Storing and Analyzing Industrial Data]: https://youtu.be/ugQvihToY0k?feature=shared
52-
[Fixing data silos in a high-speed logistics environment]: https://youtu.be/6dgjVQJtSKI?feature=shared
166+
[CrateDB: Challenges in industrial data]: https://speakerdeck.com/cratedb/not-all-time-series-are-equal-challenges-of-storing-and-analyzing-industrial-data
167+
[Rauch: High-Speed Production Lines]: https://youtu.be/gJPmJ0uXeVs?feature=shared
168+
[TGW: Fixing data silos in a high-speed logistics environment]: https://youtu.be/6dgjVQJtSKI?feature=shared
169+
[TGW: Storing and analyzing real-world industrial data]: https://youtu.be/ugQvihToY0k?feature=shared

0 commit comments

Comments
 (0)