Skip to content

Commit 30de813

Browse files
committed
Solutions: Add explanation texts, by CodeRabbit
Teaser texts have been missing on the "time series", "analytics", and "machine learning" sections.
1 parent 54b2c19 commit 30de813

File tree

4 files changed

+74
-39
lines changed

4 files changed

+74
-39
lines changed

docs/solution/analytics/index.md

Lines changed: 20 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -5,24 +5,26 @@
55
CrateDB provides real-time analytics on raw data stored for the long term.
66
:::
77

8-
In all domains of real-time analytics where you absolutely must have access to all
9-
the records, and can't live with any down-sampled variants, because records are
10-
unique, and need to be accounted for within your analytics queries.
11-
12-
If you find yourself in such a situation, you need a storage system which
13-
manages all the high-volume data in its hot zone, to be available right on
14-
your fingertips, for live querying. Batch jobs to roll up raw data into
15-
analytical results are not an option, because users' queries are too
16-
individual, so you need to run them on real data in real time.
17-
18-
:::{todo}
19-
**Instructions:**
20-
Elaborate a bit longer about the topic domain and the ingredients of this section
21-
in an abstract way, concisely highlighting and summarizing relevant benefits,
22-
like the `../analytics/index`, `../industrial/index`, and `../longterm/index`
23-
pages are doing it already.
24-
Use concise language, active voice, and avoid yapping.
25-
:::
8+
CrateDB eliminates the trade-off between data accessibility and storage costs
9+
by keeping all high-volume raw data in the hot zone without requiring
10+
downsampling or aggregation. Unlike traditional systems that force you to
11+
choose between real-time query capabilities and long-term retention,
12+
CrateDB handles billions of unique records while maintaining fast query
13+
performance on the full dataset.
14+
15+
Traditional analytics pipelines rely on pre-aggregated rollups or batch
16+
processing to handle query loads, limiting users to predefined metrics
17+
and losing the granularity needed for ad-hoc analysis. CrateDB's
18+
distributed architecture scales horizontally to support individual,
19+
exploratory queries on complete raw datasets in real time, enabling
20+
analysts to discover insights that would be invisible in downsampled data.
21+
22+
By keeping all records immediately available for querying, you avoid the
23+
complexity of maintaining separate hot and cold storage tiers, ETL
24+
pipelines for aggregation, or data movement processes. Your analytics
25+
queries run directly on raw data across any time range, delivering the
26+
accuracy and flexibility that business intelligence and data science
27+
teams require.
2628

2729
With CrateDB, compatible to PostgreSQL, you can do all of that using plain SQL.
2830
Other than integrating well with commodity systems using standard database

docs/solution/index.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,12 @@
33

44
# Solutions and use cases
55

6+
:::{div} sd-text-muted
7+
CrateDB is a distributed and scalable SQL database for storing and analyzing
8+
massive amounts of data in near real-time, even with complex queries. It is
9+
PostgreSQL-compatible, and based on Lucene.
10+
:::
11+
612
:::{toctree}
713
:hidden:
814
time-series/index
@@ -12,7 +18,6 @@ analytics/index
1218
machine-learning/index
1319
:::
1420

15-
1621
## Explanations
1722

1823
:::{div} sd-text-muted

docs/solution/machine-learning/index.md

Lines changed: 24 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -11,14 +11,30 @@ CrateDB provides a vector type natively, and adapters for integrating
1111
with machine learning frameworks.
1212
:::
1313

14-
:::{todo}
15-
**Instructions:**
16-
Elaborate a bit longer about the topic domain and the ingredients of this section
17-
in an abstract way, concisely highlighting and summarizing relevant benefits,
18-
like the `../analytics/index`, `../industrial/index`, and `../longterm/index`
19-
pages are doing it already.
20-
Use concise language, active voice, and avoid yapping.
21-
:::
14+
Modern AI and machine learning applications demand efficient storage and
15+
retrieval of high-dimensional vectors, seamless integration with ML frameworks,
16+
and the ability to combine traditional analytics with semantic search capabilities.
17+
From retrieval-augmented generation (RAG) systems to predictive maintenance models,
18+
organizations need a unified platform that handles vector embeddings, training datasets,
19+
and production model artifacts without juggling multiple specialized systems.
20+
21+
CrateDB unifies vector search, time series analysis, and ML operations in a single
22+
platform. Store and query high-dimensional embeddings using native FLOAT_VECTOR support
23+
with HNSW-based similarity search, integrate directly with LangChain and LlamaIndex for
24+
AI applications, and leverage MLflow and PyCaret for end-to-end MLOps workflows. Whether
25+
you're building semantic search engines, training forecasting models on large time series
26+
datasets, or implementing hybrid search combining full-text and vector similarity, CrateDB
27+
eliminates data movement and infrastructure complexity.
28+
29+
By keeping vector embeddings, training data, and model metadata in one queryable system,
30+
you avoid the overhead of synchronizing between specialized vector databases, data lakes,
31+
and model registries. Your ML pipelines remain agile, your queries span structured and
32+
vector data seamlessly, and your infrastructure stays lean.
33+
34+
With CrateDB, compatible to PostgreSQL, you can do all of that using plain SQL.
35+
Other than integrating well with commodity systems using standard database
36+
access interfaces like ODBC or JDBC, it provides a proprietary HTTP interface
37+
on top.
2238

2339
## Vector store
2440

docs/solution/time-series/index.md

Lines changed: 24 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -5,18 +5,30 @@
55
Use CrateDB to store and query massive amounts of time series data.
66
:::
77

8-
CrateDB is a distributed and scalable SQL database for storing and analyzing
9-
massive amounts of data in near real-time, even with complex queries. It is
10-
PostgreSQL-compatible, and based on Lucene.
11-
12-
:::{todo}
13-
**Instructions:**
14-
Elaborate a bit longer about the topic domain and the ingredients of this section
15-
in an abstract way, concisely highlighting and summarizing relevant benefits,
16-
like the `../analytics/index`, `../industrial/index`, and `../longterm/index`
17-
pages are doing it already.
18-
Use concise language, active voice, and avoid yapping.
19-
:::
8+
Time series data represents one of the fastest-growing data types across industries,
9+
from IoT sensors and industrial equipment to application metrics and financial transactions.
10+
The challenge lies not just in handling the sheer volume of incoming data points, but in
11+
maintaining query performance across both real-time streams and historical datasets while
12+
managing storage costs effectively.
13+
14+
Traditional databases struggle with the unique characteristics of time series workloads:
15+
high write throughput, time-based queries spanning variable ranges, the need for downsampling
16+
and aggregation, and retention policies that balance storage with analytical requirements.
17+
Many organizations find themselves cobbling together multiple systems—one for ingestion,
18+
another for querying, and yet another for long-term storage—creating operational complexity
19+
and data silos.
20+
21+
CrateDB handles time series data natively through its distributed architecture, combining
22+
high-speed ingestion with powerful SQL analytics across any time range. Its partitioning
23+
capabilities enable efficient data lifecycle management, while built-in functions for
24+
downsampling, interpolation, and time-window operations simplify complex analytical tasks.
25+
You can query billions of data points in seconds, whether analyzing recent trends or exploring
26+
patterns across years of historical data.
27+
28+
With CrateDB, compatible to PostgreSQL, you can do all of that using plain SQL.
29+
Other than integrating well with commodity systems using standard database
30+
access interfaces like ODBC or JDBC, it provides a proprietary HTTP interface
31+
on top.
2032

2133
::::{grid} 1 2 2 2
2234
:margin: 4 4 0 0

0 commit comments

Comments
 (0)