Work

ADVEX AI

, Founding ML Engineer

, Dec '24 - Mar '25, San Francisco, CA

Generated high quality synthetic data (images / masks) using diffusion models to solve low data regime / out-of-distribution computer vision tasks.

  • Owned engineering and research initiatives across the entire pipeline, from better LoRA training for SDXL to automated captioning with VLMs to deployment infrastructure to full stack development of the self-serve platform
  • Successfully delivered self-serve and bespoke offerings to clients, including multi billion dollar companies in logistics and manufacturing in the US and abroad

THERAP SERVICES

, Member of Technical Staff, AI Research

, Jul '24 - Nov '24, Remote

Directed AI strategy and led two research initiatives at a leading electronic health record (EHR) provider serving individuals with intellectual and developmental disabilities (I/DD), working directly with the C suite.

  • Architected pipeline, trained initial models, and validated scalability of a LLM-based tool to classify and extract data from natural language logs from caretakers using the Therap platform, using Llama and BERT to handle 30 requests/second on average. Patented and featured at the Therap National Conference
  • Prototyped an on-device VLM-based event log generation for cameras placed in homes of I/DD individuals to improve monitoring and safety while maintaining individual privacy, generating text-based logs to ship off device (also patented)

WASHINGTON UNIVERSITY

, Head Teaching Assistant

, Jan '22 - May '24, St. Louis, MO

Led TA efforts for Advanced Machine Learning (SP24), Theory of Machine Learning (FL23-SP24), and Data Science (FL21-FL22).

  • Held office hours, managed team of ~30 TAs for a ~225 student class, reworked curriculum, occassionally lectured; received departmental award for excellence

SQUARE

, Software Engineering Intern

, June '23 - Aug '23, San Francisco, CA

Designed and revitalized high throughput critical infrastructure for Square Banking.

  • Pioneered effort to deconstruct legacy service responsible for all balance tracking at Square, designed new architecture and data model
  • Led an intern hack week team to develop an internal AI service-specific research agent, using entirely local LLMs (Nous-Hermes-Llama2-13b) and RAG with a handrolled vector DB

WINTICS

, ML Engineering Intern

, February '23 - Apr '23, Paris, France

Trained on-camera computer vision models to generate analytics for Smart Cities.

  • MaskRCNN / ConvNeXT for real-time boundary detection / classification of pedestrians, cyclists, etc in frame as well as clothing type and coloration. Deployed to embedded devices on client cameras (PyTorch, ONNX, TensorRT), including use at the Paris Olympics
  • All work done in French

MICROSOFT

, ML Engineering Intern

, June '22 - Aug '22, Seattle, WA

Researched and implemented deep learning methods for anomaly detection in high-dimensional time series data generated by large scale distributed systems.

  • Pioneered ML work for the Athena team to diagnose anomalies within the Azure cloud, detecting outages before they occur and finding the root cause (patented)
  • Trained novel graph attention model to process 10,000+ time-series emitted per team (.95 F1), and deployed end-to-end pipeline in Azure ML to client teams
  • First place team at the annual intern Puzzle Hunt (à la MIT Mystery Hunt) out of 1300 entrants

4GIVING

, Software Development Intern

, June '21 - Aug '21, Minneapolis, MN

Created an AI agent to generate first fundraisers and automate profile creation for client non-profits.

  • Fine-tuned GPT3 with existing platform data to create sample fundraisers (titles and descriptions), using web-scraping to effectively prompt the model with non-profit info (e.g. mission statement, social media, etc)
  • Developed full stack webapp to package the experience and make onboarding seamless