Skip to content

Commit d9ba13b

Browse files
authored
Merge pull request #3729 from slabko/azure-data-factory-manual
New article "Bringing Azure Data into ClickHouse"
2 parents 7544225 + 87f3351 commit d9ba13b

34 files changed

+539
-1
lines changed

docs/integrations/data-ingestion/azure-data-factory/index.md

Lines changed: 503 additions & 0 deletions
Large diffs are not rendered by default.

docs/integrations/data-ingestion/data-ingestion-index.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,7 @@ For more information check out the pages below:
1616
| [Apache Spark](/integrations/apache-spark) | A multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters |
1717
| [Amazon Glue](/integrations/glue) | A fully managed, serverless data integration service provided by Amazon Web Services (AWS) simplifying the process of discovering, preparing, and transforming data for analytics, machine learning, and application development. |
1818
| [Azure Synapse](/integrations/azure-synapse) | A fully managed, cloud-based analytics service provided by Microsoft Azure, combining big data and data warehousing to simplify data integration, transformation, and analytics at scale using SQL, Apache Spark, and data pipelines. |
19+
| [Azure Data Factory](/integrations/azure-data-factory) | A cloud-based data integration service that enables you to create, schedule, and orchestrate data workflows at scale. |
1920
| [Apache Beam](/integrations/apache-beam) | An open-source, unified programming model that enables developers to define and execute both batch and stream (continuous) data processing pipelines. |
2021
| [dbt](/integrations/dbt) | Enables analytics engineers to transform data in their warehouses by simply writing select statements. |
2122
| [dlt](/integrations/data-ingestion/etl-tools/dlt-and-clickhouse) | An open-source library that you can add to your Python scripts to load data from various and often messy data sources into well-structured, live datasets. |

docs/integrations/data-ingestion/data-sources-index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
slug: /integrations/index
3-
keywords: ['AWS S3', 'PostgreSQL', 'Kafka', 'MySQL', 'Cassandra', 'Redis', 'RabbitMQ', 'MongoDB', 'Google Cloud Storage', 'Hive', 'Hudi', 'Iceberg', 'MinIO', 'Delta Lake', 'RocksDB', 'Splunk', 'SQLite', 'NATS', 'EMQX', 'local files', 'JDBC', 'ODBC']
3+
keywords: ['AWS S3', 'Azure Data Factory', 'PostgreSQL', 'Kafka', 'MySQL', 'Cassandra', 'Data Factory', 'Redis', 'RabbitMQ', 'MongoDB', 'Google Cloud Storage', 'Hive', 'Hudi', 'Iceberg', 'MinIO', 'Delta Lake', 'RocksDB', 'Splunk', 'SQLite', 'NATS', 'EMQX', 'local files', 'JDBC', 'ODBC']
44
description: 'Datasources overview page'
55
title: 'Data Sources'
66
---

docs/integrations/index.mdx

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -82,6 +82,7 @@ import Warpstreamsvg from '@site/static/images/integrations/logos/warpstream.svg
8282
import Bytewaxsvg from '@site/static/images/integrations/logos/bytewax.svg';
8383
import glue_logo from '@site/static/images/integrations/logos/glue_logo.png';
8484
import azure_synapse_logo from '@site/static/images/integrations/logos/azure-synapse.png';
85+
import azure_data_factory_logo from '@site/static/images/integrations/logos/azure-data-factory.png';
8586
import logo_cpp from '@site/static/images/integrations/logos/logo_cpp.png';
8687
import cassandra from '@site/static/images/integrations/logos/cassandra.png';
8788
import deltalake from '@site/static/images/integrations/logos/deltalake.png';
@@ -206,6 +207,7 @@ We are actively compiling this list of ClickHouse integrations below, so it's no
206207
|Apache Spark|<Sparksvg alt="Amazon Spark logo" style={{width: '3rem'}}/>|Data ingestion|Spark ClickHouse Connector is a high performance connector built on top of Spark DataSource V2.|[GitHub](https://github.com/housepower/spark-clickhouse-connector),<br/>[Documentation](/integrations/data-ingestion/apache-spark/index.md)|
207208
|Azure Event Hubs|<Azureeventhubssvg alt="Azure Events Hub logo" style={{width: '3rem'}}/>|Data ingestion|A data streaming platform that supports Apache Kafka's native protocol|[Website](https://azure.microsoft.com/en-gb/products/event-hubs)|
208209
|Azure Synapse|<Image img={azure_synapse_logo} size="logo" alt="Azure Synapse logo"/>|Data ingestion|A cloud-based analytics service for big data and data warehousing.|[Documentation](/integrations/azure-synapse)|
210+
|Azure Data Factory|<Image img={azure_data_factory_logo} size="logo" alt="Azure Data Factory logo"/>|Data ingestion|A cloud-based data integration service that enables you to create, schedule, and orchestrate data workflows at scale.|[Documentation](/integrations/azure-data-factory)|
209211
|C++|<Image img={logo_cpp} alt="Cpp logo" size="logo"/>|Language client|C++ client for ClickHouse|[GitHub](https://github.com/ClickHouse/clickhouse-cpp)|
210212
|Cassandra|<Image img={cassandra} alt="Cassandra logo" size="logo"/>|Data ingestion|Allows ClickHouse to use [Cassandra](https://cassandra.apache.org/) as a dictionary source.|[Documentation](/sql-reference/dictionaries/index.md#cassandra)|
211213
|CHDB|<Chdbsvg alt="CHDB logo" style={{width: '3rem' }}/>|AI/ML|An embedded OLAP SQL Engine|[GitHub](https://github.com/chdb-io/chdb#/),<br/>[Documentation](https://doc.chdb.io/)|

scripts/aspell-dict-file.txt

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1003,3 +1003,34 @@ clickpipes
10031003
clickpipes
10041004
--docs/integrations/data-ingestion/clickpipes/mysql/source/rds_maria.md--
10051005
clickpipes
1006+
--docs/integrations/data-ingestion/azure-data-factory/index.md--
1007+
DataItem
1008+
ServiceBase
1009+
adfCopyDataDebugSuccess
1010+
adfCopyDataSinkSelectPost
1011+
adfCopyDataSource
1012+
adfCreateLinkedServiceButton
1013+
adfLinkedServicesList
1014+
adfNewCopyDataItem
1015+
adfNewDatasetConnectionSuccessful
1016+
adfNewDatasetItem
1017+
adfNewDatasetPage
1018+
adfNewDatasetProperties
1019+
adfNewDatasetQuery
1020+
adfNewLinedServicePane
1021+
adfNewLinkedServiceBaseUrlEmpty
1022+
adfNewLinkedServiceCheckConnection
1023+
adfNewLinkedServiceExpressionFieldFilled
1024+
adfNewLinkedServiceParams
1025+
adfNewLinkedServiceSearch
1026+
adfNewPipelineItem
1027+
azureDataFactoryPage
1028+
azureDataStoreAccessKeys
1029+
azureDataStoreSettings
1030+
azureHomePage
1031+
azureHomeWithDataFactory
1032+
azureNewDataFactory
1033+
azureNewDataFactoryConfirm
1034+
azureNewDataFactorySuccess
1035+
azureNewResourceAnalytics
1036+
microsoft

sidebars.js

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -948,6 +948,7 @@ const sidebars = {
948948
],
949949
},
950950
"integrations/data-ingestion/aws-glue/index",
951+
"integrations/data-ingestion/azure-data-factory/index",
951952
"integrations/data-ingestion/azure-synapse/index",
952953
"integrations/data-ingestion/etl-tools/apache-beam",
953954
{
Loading

0 commit comments

Comments
 (0)