Welcome to my GitHub profile! I'm a Data Science enthusiast passionate about turning data into meaningful insights. I enjoy solving real-world problems using core techniques in Statistics and Machine Learning, advanced methods in Natural Language Processing, Deep Learning, and Generative AI, and modern AI Frameworks such as LangChain, LangGraph, and Agentic AI. I focus on building intelligent systems that can reason, plan, and interact autonomously.
- Languages: Python, SQL, R
- ML & DS Tools: Scikit-learn, Pandas, NumPy, Matplotlib, Seaborn, NLTK, Statsmodels
- ML Techniques: Regression, Classification, Clustering, Causal Inference, NLP
- Deep Learning & Generative AI: PyTorch, Keras, LLMs, RAG systems
- AI Frameworks: LangChain, LangGraph, Agentic AI
- Web & Deployment: Streamlit, Flask, Pickle, Git/GitHub
- Other Tools: Jupyter, VS Code, Google Colab, Excel, Tableau
Interactive app that answers questions about YouTube videos using their transcripts, powered by a RAG pipeline with FAISS embeddings and Google Gemini LLM.
Causal inference using OLS regression to estimate impact of skilled management interventions.
Email spam classifier using Multinomial Naive Bayes, deployed with Streamlit.
Content-based movie recommendation engine using NLTK lemmatization and cosine similarity.
Unsupervised learning on Mauna Loa gas data using PCA and K-Means clustering.
- Deep Learning with PyTorch
- Time Series Forecasting
- Docker for ML model deployment
Thanks for stopping by! ๐