Project Index
Projects
Portfolio projects from data engineering and AI delivery, each linked to verified public sources when available.
Nov 2025 - Dec 2025
Automated Job Market Data Pipeline
Data Engineering (GCP)
- Built automated ingestion with GCP Cloud Functions and Cloud Scheduler.
- Implemented raw landing zone in Cloud Storage and incremental ELT in BigQuery.
- Designed analytics-ready datasets with partitioning, clustering, and data quality checks.
GCP Cloud FunctionsCloud SchedulerCloud StorageBigQueryELT
Oct 2024 - Dec 2024
Advanced RAG with MLflow
MLOps / AI
- Developed a RAG pipeline for document upload, embedding storage, and contextual querying.
- Integrated MLflow for experiment tracking and pipeline deployment.
- Optimized ChromaDB ingestion/retrieval for speed and persistence tradeoffs.
LangflowLlamaIndexLangChainReactFlaskChromaDBOllamaMLflow
Jun 2024 - Oct 2024
Chat with your Resume
AI Engineering
- Built a CPU-friendly RAG system to chat with resume content in a web interface.
- Applied caching and fast inference patterns for constrained hardware.
- Combined retrieval, reranking, and LLM serving across open model providers.
LangChainPythonReactFlaskChromaDBFAISSHuggingFaceCohereRedisGroq
Apr 2023 - May 2024
Solar Energy Prediction
Machine Learning
- Built a weekly solar energy forecasting model with reported 97% accuracy.
- Used an LSTM architecture combined with attention mechanisms.
- Processed 50hertz API historical data (2010-2019) for training and evaluation.
LSTMAttentionScikit-learnNumPyPandasMatplotlib