projects

Selected work

all repos on GitHub

Waste Reduction Hub

Streamlit app that turns messy retail operations data into clear waste signals — hotspots by stream, weekly trends, and downloadable summaries for the ops team.

Fuse and Forge

Streamlit utility that fuses scattered CSV/Parquet drops into a single typed dataset — schema inference, dedupe, one-click export — plus a playground for forging quick data prototypes.

marvel-characters

End-to-end MLOps pipeline for Marvel character data — ingestion, training, tracking, and deployment.

Spark Declarative Pipelines

Experiments with declarative pipeline patterns on Apache Spark, captured in notebooks.

bytemaster · Databricks App

Databricks-native application exploring app patterns on the lakehouse platform.

unity_catalog

Hands-on work with Databricks Unity Catalog — governance, lineage, and access patterns.

kedro_databricks

Running Kedro pipelines on Databricks — packaging, deployment, and orchestration notes.

data_engineering_with_kedro

Reference Kedro project demonstrating modular data engineering pipelines.

data_modeling

Notebooks exploring dimensional and analytical data modeling techniques.

docker-airflow-master

Dockerised Apache Airflow setup for local development of orchestrated pipelines.

Resume · Streamlit

Interactive résumé built as a Streamlit app.

MLFlow

Notebooks demonstrating MLflow tracking, projects, and model registry.

Flask

Building web APIs with Flask for serving data and model endpoints.

chatbot

Python chatbot experiment.

streamlit_app

Streamlit application sandbox for rapid data app prototyping.

Streamlit

Collection of Streamlit experiments and component patterns.

de_case_study

Data engineering case study — modeling, transformations, and pipeline design.

inventory

Inventory management prototype exploring data flow and storage patterns.

bytemaster_stocks

Stocks data exploration under the bytemaster umbrella.

stocks

Stock market data experiments and analysis scripts.

MLOps

MLOps testing ground — CI, packaging, and deployment experiments.

meshpatato

Source for this portfolio site.

retail_store_app

Retail Store POS Streamlit app — Fuse & Forge: cashier, customer lookup, product catalog, tax, and checkout.

neon_database

Neon Postgres setup powering the Retail Data Intelligence Ecosystem.

Data Ingestion · Lakeflow Declarative Pipeline

Declarative Lakeflow ingestion pipeline for the retail data ecosystem.