Hi there, I am

Sameer Baig.

AL/ML Engineer đź’» Chatbots | Gen & Local AI Modeling

Check my work

Services

icon-3

AI Chatbots & Virtual Assistants

I build production-grade chatbots with RAG pipelines, fine-tuning, and domain-specific LLMs for precise, context-aware responses. Integrations include help desks, CRMs, and internal tools—shipped with auth, rate limits, analytics, and fallbacks for reliability.

icon-3

Generative AI Modeling & Local AI

I deliver custom and composable gen-AI features (images, text, audio, code). Where privacy/costs matter, I deploy local or VPC-isolated models to keep data in-house. Credit metering, caching, and observability ensure predictable performance and spend.

icon-3

Natural Language Processing (NLP)

From embeddings to classic ML, I implement sentiment analysis, summarization, topic modeling, and recommendation systems. Data pipelines, evaluation harnesses, and A/B testing are standard so models stay accurate and useful over time.

icon-3

Computer Vision

I build CV workloads like detection, OCR, and segmentation using YOLO/R-CNN/OpenCV. Deployed as scalable APIs with proper batching, GPU utilization, and latency budgets—plus guardrails and QA datasets for consistent real-world accuracy.

icon-3

Predictive Modeling & Time Series

I implement forecasting, anomaly detection, and causal features for KPIs and operations. Expect clean feature stores, backtests, and drift monitoring so predictions remain trustworthy as data and seasonality shift.

icon-3

Reinforcement Learning

For sequential decision problems, I prototype and productionize RL agents with safe reward design, simulation loops, and offline evaluation. Telemetry, replay buffers, and guardrails keep learning stable and auditable.

Experiences

Feb 2023 - Apr 2025

AI/ML Engineer

@DataNova Analytics

Designed and deployed ML solutions across finance, healthcare, and retail.

Built custom NLP pipelines for sentiment analysis and entity recognition.

Developed time-series forecasting models (ARIMA, LSTM, Prophet).

Used TensorFlow, PyTorch, and Scikit-learn for deep learning projects.

Led model deployments using AWS EC2 and Docker.

Collaborated cross-functionally to deliver data-driven products.

Dec 2021 - Jan 2023

Machine Learning Research Associate

@VisionTech Labs

Built object detection systems using YOLOv4 and R-CNN.

Experimented with GANs for synthetic image generation.

Published internal docs on model optimization and transfer learning.

Worked with PhDs and research engineers on deep learning experiments.

Assisted in deploying models on embedded and edge AI hardware.

Oct 2020 - Nov 2021

Junior Data Scientist

@ByteShift Solutions

Cleaned and structured large datasets using Pandas and NumPy.

Built regression and classification models for internal projects.

Assisted senior data scientists with feature engineering and evaluation.

Worked on early NLP tasks like text classification.

Created data visualization dashboards with Matplotlib and Seaborn.

Skills

Programming Languages

logo-Python

Python

logo-TypeScript

TypeScript

logo-JavaScript

JavaScript

AI/ML Frameworks

logo-PyTorch

PyTorch

logo-TensorFlow

TensorFlow

logo-scikit-learn

scikit-learn

logo-Keras

Keras

logo-Hugging Face

Hugging Face

NLP & Computer Vision

logo-spaCy

spaCy

logo-OpenAI

OpenAI

logo-OpenCV

OpenCV

Data Science

logo-NumPy

NumPy

logo-Pandas

Pandas

logo-Jupyter

Jupyter

logo-SciPy

SciPy

logo-Plotly

Plotly

Databases

logo-PostgreSQL

PostgreSQL

logo-MongoDB

MongoDB

logo-MySQL

MySQL

logo-Redis

Redis

logo-SQLite

SQLite

DevOps & MLOps

logo-Docker

Docker

logo-Kubernetes

Kubernetes

logo-Terraform

Terraform

logo-MLflow

MLflow

logo-Apache Airflow

Apache Airflow

Cloud & Hosting

logo-Google Cloud

Google Cloud

logo-Vercel

Vercel

logo-Netlify

Netlify

APIs & Developer Tools

logo-Node.js

Node.js

logo-Express

Express

logo-Postman

Postman

logo-Git

Git

logo-GitHub

GitHub

Recent Works

project-PrepwiseBot — Voice Interview Coach

PrepwiseBot — Voice Interview Coach

Public

Voice-first mock interview system that plans topics from candidate intake, runs a live VAPI session, and outputs rubric-scored feedback with a study plan (Gemni orchestration).

Gemni

VAPI

project-ImaginePro — Generative Image Editing

ImaginePro — Generative Image Editing

Public

SaaS for object removal, recolor, background change, generative fill, and restore. Deterministic params, NSFW guardrails, and credit billing with Stripe; Cloudinary Generative under the hood.

Cloudinary

Redis

BullMQ

project-PandocAI — Platform-Grounded Healthcare Assistant

PandocAI — Platform-Grounded Healthcare Assistant

Public

RAG-first chatbot that answers policy & doctor-finder queries with citations, tool calling for slot peek & prefilled booking links, and strict non-diagnostic guardrails.

RAG

OpenAI

Qdrant

project-OCR-SOL — Confidence-Aware OCR Pipeline

OCR-SOL — Confidence-Aware OCR Pipeline

Public

CV-first pipeline with OpenCV preprocessing, engine routing (Tesseract/Paddle), confidence scoring, regex/dictionary post-correction, and optional LLM span cleanup.

OpenCV

Tesseract

PaddleOCR

LLM

project-MyPDFBuddy — Streaming RAG Chat for PDFs

MyPDFBuddy — Streaming RAG Chat for PDFs

Public

Uploads & parses PDFs, indexes chunks with page anchors, and answers with streaming responses and page-level citations; table extraction and optional OCR for scans.

RAG

OpenAI

Qdrant

SSE

project-FaceSight — Real-Time Face Recognition

FaceSight — Real-Time Face Recognition

Public

Webcam capture, Haar cascade detection, dataset builder, and LBPH recognition with live labels; privacy-first local processing.

OpenCV

LBPH

Haar

Computer Vision

Get in Touch

I'm available for freelancing.