About — Aevyra

Why I built Aevyra

Over the past year, I kept running into the same problem across different parts of the stack — evaluation, serving, prompt optimization, agent tracing. The models are good. The tooling around using them well in production isn't. Aevyra is my attempt to build what's missing.

Experience

Applied AI Engineer

Meta · 2022 – Mar 2026

Started on TorchServe — #4 all-time contributor, 177+ merged PRs. Led integration of TensorRT-LLM, vLLM, torch.compile, and torch.export. Architected a 75% cost reduction in multi-model GPU inference. Established TorchServe as the industry standard for PyTorch serving at scale (4.4K+ stars).

Moved to the Enterprise Llama team deploying custom Llama solutions across organizations in different verticals — fine-tuning on proprietary datasets, data and model distillation, architectural customizations. Led technical strategy for Llama adoption across Meta's enterprise and partner ecosystems. Speaker at Google Cloud Next '25. Active contributor to Llama Cookbook (18.2K+ stars).

Computer Vision Engineer

LG Electronics · 2018 – 2021

Prototyped production computer vision systems — instance segmentation, object detection and tracking, pose estimation, gesture recognition. Built the applied ML competency that translated directly into LLM inference work at Meta.

Software Engineer

Cisco Systems · 2013 – 2018

Layer 2 forwarding in Cisco's Nexus 7000 and 9000 data center switches. Built container telemetry infrastructure using Docker, Contiv, and Kibana. Early work at the intersection of systems engineering and machine learning for network serviceability.

Design Engineer

Texas Instruments · 2006 – 2011

RTL verification and design for image processors and graphics processors in a 3G modem SoC. The hardware foundation that informs how I think about performance and systems today.

Core skills

Llama LLM Fine-tuning Model Distillation PyTorch torch.compile torch.export TorchServe Inference Optimization TensorRT-LLM vLLM RAG Production ML Systems Agent Tracing LLM Evaluation Prompt Optimization Open Source

Get in touch

If you're deploying agents in production and hitting walls — with tracing, evaluation, debugging, or knowing which model is actually right for your task — I'd like to hear about it.

LinkedIn → agunapal [at] aevyra.ai

The Person Behind Aevyra

Why I built Aevyra

Experience

Core skills

Get in touch