> Hey There!, I am Shubham Dalvi
「 I am a data engineer with a passion for big data, distributed computing, and data visualization 」
✌️ Enjoy solving data problems
❤️ Passionate about big data technologies, distributed systems, and data visualizations
📧 Reach me : [email protected]
This project demonstrates how to set up a PostgreSQL database hosted on Docker, create tables, and transfer data from PostgreSQL to Snowflake using Airbyte. The project includes setting up the environment, creating the database schema, and automating the data transfer process.
- Technologies Used
- Skills Demonstrated
- Setup PostgreSQL on Docker
- Creating the Database and Tables
- Configuring Airbyte
- Data Transfer to Snowflake
- Usage Instructions
- PostgreSQL: For database management.
- Docker: To containerize and host the PostgreSQL instance.
- Airbyte: To extract and load data from PostgreSQL to Snowflake.
- Snowflake: For data warehousing.
- Python: For scripting and automation.
- Data Engineering: Setting up and managing a PostgreSQL database in Docker.
- Docker: Containerizing services for ease of use and portability.
- ETL Processes: Automating data transfer from PostgreSQL to Snowflake.
- Airbyte Configuration: Setting up and managing data connectors.
Start by pulling the PostgreSQL Docker image and running a container.
docker pull postgres
docker run --name my_postgres -e POSTGRES_PASSWORD=mysecretpassword -d postgres
Connect to the PostgreSQL instance running inside the Docker container.
docker exec -it my_postgres psql -U postgres
Create a new database for storing your data.
CREATE DATABASE my_database;
Define the schema and create tables within the database.
CREATE TABLE transactions (
transaction_id SERIAL PRIMARY KEY,
amount DECIMAL(10, 2),
transaction_date DATE
);
Install and run Airbyte to handle the data transfer.
docker-compose up -d
Create a new connection in Airbyte, specifying PostgreSQL as the source and Snowflake as the destination. Map the tables and fields as needed.
Trigger the data sync from PostgreSQL to Snowflake using Airbyte's interface or API.
- Ensure Docker and Airbyte are installed and running.
- Set up PostgreSQL on Docker using the provided commands.
- Create the necessary tables in PostgreSQL.
- Configure Airbyte with the appropriate source and destination settings.
- Trigger the data sync and verify the results in Snowflake.