Back to Projects
Crypto ELT Pipeline

Crypto ELT Pipeline

Modern ELT pipeline for cryptocurrency market analysis using Dagster, PyAirbyte, dbt, DuckDB, Polars, and Streamlit.

Jan 2024 - Present 2 years, 3 months Ongoing

Tech Stack

PythondbtDagsterDuckDBPolarsStreamlit

Overview

Modern ELT pipeline analyzing cryptocurrency market trends through OHLC candlestick charts and volatility metrics. Features incremental extraction, Medallion architecture (Bronze/Silver/Gold), and full data lineage.

Key Features

Tech Stack

LayerTechnology
OrchestrationDagster
ExtractionPyAirbyte
Transformationdbt, Polars
StorageDuckDB
QualityPandera
VisualizationStreamlit
APIFastAPI

Supported Cryptocurrencies

Bitcoin, Ethereum, XRP, Solana, Cardano, Avalanche, Polkadot, BNB, Chainlink, Dogecoin (10 coins)

Architecture

┌─────────────────────────────────────────────────────────────┐
│                    DuckDB (crypto.duckdb)                    │
│  ┌─────────────┐  ┌─────────────┐  ┌──────────────────────┐ │
│  │ raw schema  │→ │staging schema│→ │ mart schema          │ │
│  │(Bronze)     │  │(Silver)      │  │ (Gold)               │ │
│  └──────┬──────┘  └──────┬──────┘  └──────────┬───────────┘ │
└─────────┼────────────────┼───────────────────┼──────────────┘
          │                │                   │
    Dagster IO        dbt runs            dbt creates
    Manager writes    transformations     final tables


                                    ┌─────────────────┐
                                    │   Streamlit     │
                                    │   Dashboard     │
                                    └─────────────────┘

Lessons Learned

View All Projects