ADVEX AI
, Founding ML Engineer
, Dec '24 - Mar '25, San Francisco, CA
Generated high quality synthetic data (images / masks) using diffusion models to solve low data regime / out-of-distribution computer vision tasks.
- Owned engineering and research initiatives across the entire pipeline, from better LoRA training for SDXL to automated captioning with VLMs to deployment infrastructure to full stack development of the self-serve platform
- Successfully delivered self-serve and bespoke offerings to clients, including multi billion dollar companies in logistics and manufacturing in the US and abroad
THERAP SERVICES
, Member of Technical Staff, AI Research
, Jul '24 - Nov '24, Remote
Directed AI strategy and led two research initiatives at a leading electronic health record (EHR) provider serving individuals with intellectual and developmental disabilities (I/DD), working directly with the C suite.
- Architected pipeline, trained initial models, and validated scalability of a LLM-based tool to classify and extract data from natural language logs from caretakers using the Therap platform, using Llama and BERT to handle 30 requests/second on average. Patented and featured at the Therap National Conference
- Prototyped an on-device VLM-based event log generation for cameras placed in homes of I/DD individuals to improve monitoring and safety while maintaining individual privacy, generating text-based logs to ship off device (also patented)
WASHINGTON UNIVERSITY
, Head Teaching Assistant
, Jan '22 - May '24, St. Louis, MO
Led TA efforts for Advanced Machine Learning (SP24), Theory of Machine Learning (FL23-SP24), and Data Science (FL21-FL22).
- Held office hours, managed team of ~30 TAs for a ~225 student class, reworked curriculum, occassionally lectured; received departmental award for excellence
SQUARE
, Software Engineering Intern
, June '23 - Aug '23, San Francisco, CA
Designed and revitalized high throughput critical infrastructure for Square Banking.
- Pioneered effort to deconstruct legacy service responsible for all balance tracking at Square, designed new architecture and data model
- Led an intern hack week team to develop an internal AI service-specific research agent, using entirely local LLMs (Nous-Hermes-Llama2-13b) and RAG with a handrolled vector DB
WINTICS
, ML Engineering Intern
, February '23 - Apr '23, Paris, France
Trained on-camera computer vision models to generate analytics for Smart Cities.
- MaskRCNN / ConvNeXT for real-time boundary detection / classification of pedestrians, cyclists, etc in frame as well as clothing type and coloration. Deployed to embedded devices on client cameras (PyTorch, ONNX, TensorRT), including use at the Paris Olympics
- All work done in French
MICROSOFT
, ML Engineering Intern
, June '22 - Aug '22, Seattle, WA
Researched and implemented deep learning methods for anomaly detection in high-dimensional time series data generated by large scale distributed systems.
- Pioneered ML work for the Athena team to diagnose anomalies within the Azure cloud, detecting outages before they occur and finding the root cause (patented)
- Trained novel graph attention model to process 10,000+ time-series emitted per team (.95 F1), and deployed end-to-end pipeline in Azure ML to client teams
- First place team at the annual intern Puzzle Hunt (à la MIT Mystery Hunt) out of 1300 entrants
4GIVING
, Software Development Intern
, June '21 - Aug '21, Minneapolis, MN
Created an AI agent to generate first fundraisers and automate profile creation for client non-profits.
- Fine-tuned GPT3 with existing platform data to create sample fundraisers (titles and descriptions), using web-scraping to effectively prompt the model with non-profit info (e.g. mission statement, social media, etc)
- Developed full stack webapp to package the experience and make onboarding seamless