I'm a Data Scientist with expertise in Machine Learning, NLP, Generative AI, and Large Language Models. Currently, I'm pursuing my Masterβs in Data Analytics at San Jose State University. I'm passionate about solving complex problems using data and creating impactful solutions for the future.
My journey into Data Science and AI has been fueled by curiosity, problem-solving, and a drive to transform data into meaningful insights. With a strong foundation in Machine Learning, NLP, and Generative AI, I have worked on predictive modeling, large-scale data analysis, and AI-powered automation.
Currently, Iβm pursuing a Masterβs in Data Analytics at San Jose State University (Graduating May 2025). My experience spans across startups (Emigrait, Varidus & Sensegrass) and industry leaders like Samsung, where I have developed end-to-end ML pipelines, optimized models for business applications, and worked on LLMs, LangChain, and AI-driven search systems.
I specialize in Machine Learning (Supervised, Unsupervised, XGBoost, Random Forest), NLP (Transformers, RAG, Prompt Engineering), Deep Learning (CNNs, RNNs, LSTMs), and Large Language Models (OpenAI, Gemini, Hugging Face, LangChain). My skill set includes Python, SQL, MongoDB, TensorFlow, PyTorch, Data Visualization (Tableau, Power BI), and Cloud (AWS, GCP, Azure).
I am passionate about applying AI to solve real-world problems, optimizing predictive models, and enhancing explainability in ML systems. I thrive on building scalable AI solutions, improving LLM consistency, and integrating cutting-edge AI techniques into business strategies.
Letβs connect! Iβm always open to collaborations, discussions, and innovative AI-driven projects. Check out my GitHub for my latest work, or reach out via LinkedIn or via email [email protected].
- Statistical Programming Language:
Python
Numpy
Pandas
SciPy
,R
- Machine Learning and Deep Learning Frameworks:
Scikit-Learn
PyTorch
Tensorflow
- Cloud and Databases:
AWS
Google GCP
Microsoft Azure
SQL Server
MySQL
BigQuery
- Data Visualization Tools:
Matplotlib
Seaborn
Tableau
PowerBI
Looker
Google Analytics
- AI Prowess:
Large Language Models (LLMs)
Retrieval Augmented Systems (RAGs)
Fine-Tuning
Generative AI
Agentic AI
Transformers
LangChain
GPTs
HuggingFace
Gemini
Llama
WhisperAI
BERT
RoBERTa
BART
LangGraph
Groq
OpenAI
Diffusion Models
GANs
CGANs
StyleGANs
- Machine Learning:
Supervised Learning
Unsupervised Learning
Regression
Classification
Random Forest
XGBoost
Ensemble Learning
- Natural Language Processing (NLP):
NLTK
SpaCy
Embedding Models
- Deep Learning:
ANNs
CNNs
RNNs
LSTMs
GRUs
Project Name | Description | Languages/ Tools Used | Repository Link |
---|---|---|---|
Multimodal AI Agent for Web Search & Stock Analysis![]() |
This project demonstrates a multimodal AI agent setup that integrates different AI models and tools for web searching and stock analysis. The AI agents use the Groq model (powered by Llama-3.3-70b) to gather and analyze information from web search engines (DuckDuckGo) and financial data sources (Yahoo Finance) in a highly collaborative manner. | LLaMA-3.3-70b (LLM), DuckDuckGo API, YFinanceTools, Groq Model API | Project |
Multimodal AI Agent - News Summarization & Sentiment Analysis |
This project presents the design and implementation of a personalized news aggregator built using multi-modal AI techniques. The system collects news articles, generates summaries using language models, and performs sentiment analysis to deliver relevant and customized content to users. The agent leverages state-of-the-art AI models and frameworks, demonstrating how intelligent automation can be applied to enhance user experiences. | NLTK, Requests, NewsAPI, OpenAI GPT-3.5 Turbo (LLM) | Project |
Text Generation Web Application![]() |
This project showcases an AI-Powered Text Generator built using Flask, the Google Gemini model, and the LangChain framework. The app allows users to input text and receive AI-generated content instantly, providing a powerful tool for content creators and innovators in the AI space. | LangChain, Google Gemini 1.5 Pro (LLM), Flask (Web Development) | Project |
Querying PDFs with AI![]() |
In this project, an AI-powered system is built that intelligently queries and extracts answers from PDF documents. By leveraging tools like LangChain, FAISS, and OpenAI, we transformed raw text into searchable data, enabling the automatic retrieval of relevant information. This project serves as a practical introduction to using AI for document analysis, demonstrating how to automate the process of finding answers within large PDF files efficiently and effectively. | LangChain, FAISS, OpenAI API | Project |
Fashion Recommendation System Using Image Features![]() |
This project demonstrates the process of building a Fashion Recommendation System using image features. By leveraging computer vision and pre-trained deep learning models, this system analyzes the visual characteristics of fashion items (e.g., color, texture, style) and recommends similar or complementary products. | Tensorflow, VGG16 | Project |
- Advanced deep learning techniques
- Real-time data streaming and processing
- Deployment of AI models using Docker & Kubernetes
- Building efficient NLP pipelines with Hugging Face and LangChain
Feel free to reach out to me through via email or any of the following platforms:
- Email: [email protected]
- LinkedIn: Ravjot Singh
- GitHub: Ravjot Singh
- Medium: Ravjot's Blog
Thank you for visiting my GitHub! π