// what I do
Areas of focus
Click any area to see the related projects.
Pipelines & Orchestration
Airflow and Kedro pipelines, containerised with Docker for reproducible runs.
see airflow projects
Lakehouse & Modeling
Databricks, Delta Lake, Unity Catalog governance, and dimensional data modeling.
see databricks projects
Spark & Declarative ETL
Apache Spark transforms and declarative pipeline patterns for batch workloads.
see pyspark projects
MLOps
End-to-end MLflow tracking, model registry, and deployment workflows.
see mlflow projects
Streamlit Apps
Internal tools, dashboards, and data products shipped as Streamlit apps.
see streamlit projects
Data Platforms
Building meshpatato — a sandbox for modern data engineering patterns.
see kedro projects