Ayush Sharma Portfolio

CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS • CAUTION DO NOT CROSS •

SUSPECT PROFILE

CLASSIFIED

6 5 4

SHARMA, AYUSH

SUBJECT INFORMATION

NAME:Ayush Sharma

ALIAS:AI Engineer

LAST SEEN:Boston, MA

CURRENT AFFILIATION:ReSupply (AI Engineer) since 06/2025

CLEARANCE:M.S. in Artificial Intelligence, Boston University

KNOWN FOR:Building end-to-end AI systems for real-world workflows like pricing, logistics, HR, voice/chat support, etc. to improve efficiency and business outcomes, with prior research experience in computer vision and LLM systems.

KNOWN COMMUNICATION CHANNELS

✉ ayushsharmacorp@gmail.com EMAIL in linkedin.com/in/ayush025 LINKEDIN </> github.com/iamcalledayush GITHUB

VERIFIED EMPLOYMENT RECORDS • PROFESSIONAL HISTORY LOGGED • VERIFIED EMPLOYMENT RECORDS • PROFESSIONAL HISTORY LOGGED • VERIFIED EMPLOYMENT RECORDS • PROFESSIONAL HISTORY LOGGED • VERIFIED EMPLOYMENT RECORDS • PROFESSIONAL HISTORY LOGGED • VERIFIED EMPLOYMENT RECORDS • PROFESSIONAL HISTORY LOGGED • VERIFIED EMPLOYMENT RECORDS • PROFESSIONAL HISTORY LOGGED •

EVIDENCE BOARD

EXPERIENCE FILES

Subject's Work Experience — Click any case file to examine further

#001

THE RESUPPLY FILE

AI Engineer @ ReSupply | Boston, MA | June 2025 - Present

ONGOING

THE RESUPPLY FILE

AI Engineer @ ReSupply | Boston, MA | June 2025 - Present

AI Voice Agents

$60K/mo Saved, AI Handles 20K+ Monthly Calls

Built and deployed multi-model voice agents with live safety guardrails to fully automate customer and driver support calls.

RAG + Eval Pipeline

Custom retrieval system powering voice agents' knowledge, with automated call quality grading via dual-model evaluation.

Intent Classification

7K+ Automatable Tickets, 1K+ Misroutes - 30% Workload Cut using LLM Intent Tagging and Clustering

Clustered 25K+ monthly support conversations to identify which tickets AI agents could handle, eliminating the need for additional hires.

Resume-Ranking Tool

Improved ReSupply’s HR screening process by building an internal resume-ranking tool with Breezy ATS automation, Playwright, and LLM-based evaluation pipeline for resume retrieval, location filters, and custom JD-based comparative candidate ranking.

Geospatial Optimization

Automated dispatch routing and zone recommendations for haulers using MapBox travel-time isochrones and ZIP code patterns.

Pricing Model

Improved company's pricing algorithm by combining ML models with economic modeling using demand elasticity, Producer Price Indexing, and rigorous A/B testing.

Semantic Item DB

0.82 Top-1, 0.79 Weighted Top-5 similarity, 0.017s Latency

Built a semantic search database for item retrieval that powers recommendations, autocomplete, and query handling across user-facing features.

TOOLS AT THE SCENE:

Pipecat, Twilio, Deepgram, ElevenLabs, OpenAI API (GPT 4, o3), Llama Guard 3, Transformer Embeddings (GTE-large, E5-Large-v2), FAISS, LLM-as-a-judge Evaluation, Google Gemini API (2.5 family), Intent Classification, UMAP, HDBSCAN

#002

THE NSF RESEARCH FILE

AI Researcher @ National Science Foundation, Boston University | Boston, MA | Oct 2023 - May 2025

UNDER REVIEW

THE NSF RESEARCH FILE

AI Researcher @ National Science Foundation, Boston University | Boston, MA | Oct 2023 - May 2025

DNS Domain Names Prediction

ACM SIGCOMM 2026 Submission | Advisor: Mark Crovella

Fine-tuned LLMs and trained ML models to predict unknown DNS domain names using reinforcement learning with a structured reward schema optimizing for novelty and out-of-distribution generalization.

MiniARC AGI SOTA

Advisor: Iddo Drori

Surpassed the previous MiniARC benchmark (33.1%) on abstract reasoning by combining re-ranking, meta-learning, and multimodal approaches to advance human-level AI reasoning, achieving 41.81% accuracy.

TOOLS AT THE SCENE:

Open Source GPT models, Ollama, Reinforcement Learning, VERL framework, CUDA, Multiprocessing, Data Augmentation, Meta Learning, Multimodal Reasoning, AlphaCode, Prompting Techniques

#003

THE SCHNEIDER ELECTRIC FILE

ML Intern @ Schneider Electric | Bangalore | Jan 2023 - Jul 2023

CLOSED

THE SCHNEIDER ELECTRIC FILE

ML Intern @ Schneider Electric | Bangalore | Jan 2023 - Jul 2023

Global Data Housekeeping Project

Built automated data-cleaning and validation pipelines that cut cloud migration time by 80% across 11.25 million records.

Ticket Classification

Automated the classification of incoming complaint and request tickets using deep learning NLP models.

TOOLS AT THE SCENE:

NumPy, Pandas, SQL, Transformer Models (BERT and RoBERTa Embeddings), LSTM Models

#004

THE JP MORGAN FILE

SWE Intern @ JP Morgan | Bangalore | Jul 2022 - Aug 2022

CLOSED

THE JP MORGAN FILE

SWE Intern @ JP Morgan | Bangalore | Jul 2022 - Aug 2022

Stock Visualization

Built an interactive visualization page for analyzing stock market data across 50,000+ data points.

Prediction Model

Improved stock price prediction accuracy from 78.5% to 83.8% by applying recurrent neural network and LSTM models to processed market data.

TOOLS AT THE SCENE:

Python, NumPy, Pandas, Matplotlib, Sequence Models (Recurrent Neural Networks, LSTM models)

#005

THE PUNJAB NATIONAL BANK FILE

ML Intern @ PNB | India | May 2021 - Aug 2021

CLOSED

THE PUNJAB NATIONAL BANK FILE

ML Intern @ Punjab National Bank | Jhansi | May 2021 - Aug 2021

Customer Segmentation

Clustered banking customers by transaction patterns and demographics to optimize personalized loan and credit card offers.

Fraud Detection

Trained a gradient-boosted model (XGBoost) to flag anomalous transactions, improving fraud detection rate by 20%.

TOOLS AT THE SCENE:

Python, Pandas, NumPy, Scikit-learn, XGBoost, K-Means Clustering, Customer Segmentation, Model Evaluation

AUTHORIZED PERSONNEL ONLY • TECHNICAL INVENTORY • AUTHORIZED PERSONNEL ONLY • TECHNICAL INVENTORY • AUTHORIZED PERSONNEL ONLY • TECHNICAL INVENTORY • AUTHORIZED PERSONNEL ONLY • TECHNICAL INVENTORY • AUTHORIZED PERSONNEL ONLY • TECHNICAL INVENTORY • AUTHORIZED PERSONNEL ONLY • TECHNICAL INVENTORY •

WEAPONS INVENTORY

CONFISCATED

Subject's Weapons and Tools — On Record

AI SYSTEMS

LLM APIs and SDKs (OpenRouter, Anthropic Claude, OpenAI, Google Gemini) RAG AI Guardrails and Safety Transformers and Embedding Models Multimodal Systems Fine-Tuning Reinforcement Learning Techniques Data Synthesis Prompt Engineering Techniques Diffusion Models

FRAMEWORKS AND LIBRARIES

LangChain (certificate) LangGraph Pipecat Deepgram ElevenLabs Ollama PyTorch TensorFlow Scikit-Learn Hugging Face AutoGen for Multi-Agent Systems (certificate) NumPy Pandas Matplotlib FAISS

LANGUAGES AND PRODUCTION TOOLS

Python SQL FastAPI REST APIs Docker Git AWS Playwright Streamlit Render Gradio

SUBJECT OPERATIONS • CLASSIFIED PROJECT FILES • SUBJECT OPERATIONS • CLASSIFIED PROJECT FILES • SUBJECT OPERATIONS • CLASSIFIED PROJECT FILES • SUBJECT OPERATIONS • CLASSIFIED PROJECT FILES • SUBJECT OPERATIONS • CLASSIFIED PROJECT FILES • SUBJECT OPERATIONS • CLASSIFIED PROJECT FILES •

KNOWN OPERATIONS

EXECUTED

Subject's Personal Projects

OP-001

Logical Reasoning Evaluation of LLMs

Tech Stack: Python, PyTorch, Hugging Face, T5, GPT-2, GPT-Neo, GPT-3.5, CUDA Parallel Computing, Cloud GPUs

Built a 105-task syllogism reasoning dataset called LogiSphere. Fine-tuned and adversarially tested multiple LLMs, exposing common failures in pattern learning, hallucinations, and logical reasoning.
Results - GPT-3.5: 63% • GPT-Neo: 47% • GPT-2 XL: 24%, GPT-2 L: 19%, GPT-2 S: 10%, T5: 7%.

GitHub ↗

OP-002

AI Research Companion

Tech Stack: Python, OpenAI GPT-4o-mini, LangChain, FAISS Vector DB, MPNet Sentence Transformer Embeddings, Hugging Face, RAG, Streamlit, PyPDF2

Built and deployed an AI assistant using GPT-4o-mini, FAISS, and Hugging Face all-mpnet-base-v2 embeddings to let users upload and query multiple research papers and arXiv links through conversational RAG, contextual memory, and LaTeX Math rendering.

GitHub ↗ Live Demo ↗

OP-003

Debt Collection Analytics for WGBH (Team Lead)

Debt Collection Analytics for WGBH

Tech Stack: Python, SQL, PyMySQL, Pandas, NumPy, Matplotlib, Seaborn, Statistical Data Analysis, Data Visualization

Led a team project for WGBH, a public radio station in Boston, MA, analyzing Massachusetts debt-collection court data using SQL and Python to uncover trends across 177,248+ debt-collector court cases, 233,074 Capias warrant cases, and 2,912 wage-garnishment cases. Identified major debt-collection entities, litigation patterns, mortgage-servicing leaders, and state/country-wise debt-collector distributions through large-scale statistical analysis and visualization workflows.

GitHub ↗

OP-004

Text-Guided Video Generation with Diffusion Models

Tech Stack: Python, PyTorch, Stable Diffusion, ControlNet, LoRA, OpenCV, Gradio, Streamlit, CLIP, ResNet-18, Optical Flow, Ebsynth, CUDA

Built a text-guided video-to-video generation pipeline using diffusion models to transform input videos into stylized, temporally consistent outputs. Extended the Rerender-A-Video framework with adaptive key-frame sampling, optical-flow/frame-difference analysis, CLIP-based prompt-image evaluation, cross-frame attention, FreeU, and Ebsynth-based frame propagation to reduce flickering and improve visual coherence across generated videos.

GitHub ↗ Detailed Report ↗

OP-005

Autonomous Driving Car Detection System

Tech Stack: Python, YOLO Object Detection, OpenCV, TensorFlow, Deep Learning, Computer Vision, CNNs, Image Processing, Autonomous Driving Perception, Bounding Box Detection, Real-Time Detection Pipelines

Built an autonomous driving perception system using YOLO-based object detection and computer vision techniques to identify vehicles and road objects in real time from driving footage, enabling accurate bounding-box localization and multi-object traffic scene understanding for self-driving applications.

GitHub ↗

OP-006

Animal Image Generation with GANs

Tech Stack: Python, PyTorch, GANs, BigGAN, Normalization Techniques, Self-Attention Mechanisms, Residual Networks (ResNet Blocks)

Built and optimized GAN and BigGAN architectures in PyTorch to generate realistic dog images using the Stanford Dogs dataset, implementing techniques such as spectral normalization, self-attention, conditional batch normalization, and residual networks to reduce mode collapse and image artifacts. Improved image fidelity significantly across training iterations under constrained Kaggle GPU resources.

GitHub ↗ Detailed Report ↗

OP-007

Style Transfer Art Generator

Tech Stack: Python, TensorFlow, Keras, VGG-19, Streamlit, NumPy, Pillow, Matplotlib, CNNs, Transfer Learning, Neural Style Transfer

Built a Neural Style Transfer model that takes in a content image and a style image, then generates a new artwork by preserving the content structure while transferring artistic style features using a VGG-19 CNN. Implemented content loss, style loss with Gram matrices, multi-layer style extraction, and iterative image optimization over 500 epochs to produce stylized generated images.

GitHub ↗

OP-008

Jazz Music Generator

Tech Stack: Python, Keras, TensorFlow, LSTM Networks, RNNs, NumPy, Music21, MIDI Processing, Sequence Modeling, Deep Learning

Built a jazz music generation system using an LSTM-based sequence model trained on a corpus of jazz music to learn note/chord patterns and generate new MIDI-style jazz compositions that sound like a full-band performance.

GitHub ↗

ACADEMIC RECORDS • VERIFIED TRAINING HISTORY • ACADEMIC RECORDS • VERIFIED TRAINING HISTORY • ACADEMIC RECORDS • VERIFIED TRAINING HISTORY • ACADEMIC RECORDS • VERIFIED TRAINING HISTORY • ACADEMIC RECORDS • VERIFIED TRAINING HISTORY • ACADEMIC RECORDS • VERIFIED TRAINING HISTORY •

TRAINING DOSSIER

VERIFIED

Where was the subject trained?

Boston University

M.S. Artificial Intelligence with a Master's Thesis | 2023 – 2025

Coursework: Machine Learning, Deep Learning, Natural Language Processing, Computer Vision, Artificial Intelligence, Data Science, Metrics and Evaluation in NLP and LLMs, Directed Reasearch Study

Shri Mata Vaishno Devi University

B.Tech Computer Science | 2019 – 2023

Coursework: Neural Networks and Fuzzy Sets, Artificial Intelligence, Machine Learning, Soft Computing, Digital Image Processing, Nature Inspired Algorithms, High Performance Computing, Engineering and Discrete Mathematics

ACTIVE COMMUNICATION CHANNELS • SUBJECT REMAINS AVAILABLE FOR CONTACT • ACTIVE COMMUNICATION CHANNELS • SUBJECT REMAINS AVAILABLE FOR CONTACT • ACTIVE COMMUNICATION CHANNELS • SUBJECT REMAINS AVAILABLE FOR CONTACT • ACTIVE COMMUNICATION CHANNELS • SUBJECT REMAINS AVAILABLE FOR CONTACT • ACTIVE COMMUNICATION CHANNELS • SUBJECT REMAINS AVAILABLE FOR CONTACT •

WANTED

AYUSH SHARMA

For building AI systems that make manual repetitive processes obsolete

To establish communication with this subject:

✉ in </>

END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE • END OF DOSSIER • SUBJECT STILL ACTIVE •