Project Index

Projects

Portfolio projects from data engineering and AI delivery, each linked to verified public sources when available.

Built for measurable production outcomes

Nov 2025 - Dec 2025

Automated Job Market Data Pipeline

Data Engineering (GCP)

  • Built automated ingestion with GCP Cloud Functions and Cloud Scheduler.
  • Implemented raw landing zone in Cloud Storage and incremental ELT in BigQuery.
  • Designed analytics-ready datasets with partitioning, clustering, and data quality checks.
GCP Cloud FunctionsCloud SchedulerCloud StorageBigQueryELT

Oct 2024 - Dec 2024

Advanced RAG with MLflow

MLOps / AI

  • Developed a RAG pipeline for document upload, embedding storage, and contextual querying.
  • Integrated MLflow for experiment tracking and pipeline deployment.
  • Optimized ChromaDB ingestion/retrieval for speed and persistence tradeoffs.
LangflowLlamaIndexLangChainReactFlaskChromaDBOllamaMLflow

Jun 2024 - Oct 2024

Chat with your Resume

AI Engineering

  • Built a CPU-friendly RAG system to chat with resume content in a web interface.
  • Applied caching and fast inference patterns for constrained hardware.
  • Combined retrieval, reranking, and LLM serving across open model providers.
LangChainPythonReactFlaskChromaDBFAISSHuggingFaceCohereRedisGroq

Apr 2023 - May 2024

Solar Energy Prediction

Machine Learning

  • Built a weekly solar energy forecasting model with reported 97% accuracy.
  • Used an LSTM architecture combined with attention mechanisms.
  • Processed 50hertz API historical data (2010-2019) for training and evaluation.
LSTMAttentionScikit-learnNumPyPandasMatplotlib