Skip to content

Tutorials

Step-by-step learning paths for Spindle. Each tutorial explains concepts, walks through code, and links to runnable notebooks and example scripts.

Choose Your Track

graph LR
    A[New to Spindle?] --> B[Beginner Track]
    B --> C[Intermediate Track]
    C --> D[Fabric Track]
    C --> E[Advanced Track]

    F[Building Fabric pipelines?] --> D
    G[Automating with CI/CD?] --> E

Beginner Track

Zero to generating data. No prior Spindle experience required.

# Tutorial What you'll learn Time
01 Hello Spindle Install, generate your first dataset, verify FK integrity 10 min
02 Explore All Domains Survey all 13 domains, compare schemas and table counts 15 min
03 Custom Schemas Build a .spindle.json schema from scratch 20 min
04 Output Formats Export to CSV, Parquet, SQL INSERT, Excel, CDM 15 min

Prerequisites: Python 3.10+, basic pandas familiarity

Intermediate Track

Real-world data engineering patterns. Requires completing the Beginner track (or equivalent experience).

# Tutorial What you'll learn Time
05 Star Schema Transform 3NF to dimensional model with surrogate keys 20 min
06 Streaming Emit events with rate limiting, anomalies, and burst windows 20 min
07 Chaos Engineering Inject realistic data quality issues to test pipeline resilience 20 min
08 Validation Gates Run quality checks and quarantine bad records 15 min
09 Composite Domains Generate multi-domain datasets with cross-domain FK relationships 20 min
18 Fidelity Reporting Profile real data, infer a schema, generate synthetic, and measure statistical fidelity 20 min

Prerequisites: Beginner track, familiarity with data pipelines

Fabric Track

Microsoft Fabric workflows. Write to Lakehouse, Warehouse, SQL Database, and Eventstream.

# Tutorial What you'll learn Time
10 Fabric Lakehouse Write Delta tables to Lakehouse, query with Spark SQL 20 min
11 Fabric Warehouse Generate DDL and load a Fabric Data Warehouse 15 min
12 Fabric Streaming Stream events to Fabric Eventstream with anomaly injection 20 min
13 Medallion Architecture Build a Bronze/Silver/Gold pipeline with chaos and validation 25 min
14 Lakehouse Profiling Profile Fabric Delta tables, infer schema, generate synthetic, measure fidelity 25 min

Prerequisites: Intermediate track, Fabric workspace with a Lakehouse

Advanced Track

Automation, declarative pipelines, and Day 2 operations.

# Tutorial What you'll learn Time
15 Scenario Packs Run pre-built YAML scenario packs for end-to-end workflows 20 min
16 GSL Specs Write declarative YAML specs that orchestrate generation, chaos, and validation 15 min
17 Day 2 Operations Incremental generation, time-travel snapshots, PII masking 25 min
18 CI Integration Automate data generation in CI/CD pipelines 15 min

Prerequisites: Intermediate track, CI/CD experience (for tutorial 18)


How Tutorials, Guides, and Examples Relate

Content type Purpose Where
Tutorials (you're here) Learn concepts step by step docs/tutorials/
Guides Reference for a specific feature docs/guides/
Example scripts Runnable Python code examples/scenarios/
Notebooks Interactive Jupyter notebooks examples/notebooks/

Tutorials reference guides for deep dives and link to notebooks/scripts for hands-on practice.