0
Years in production AI/ML engineering
Application — CERN IT-CD-PI-2026-63-LD · Machine Learning Engineer
5+ years shipping production AI/ML services — model training, serving, and LLM/agent systems — with a prior CERN collaboration on Rucio. Ready to bring enterprise-scale MLOps and open-source depth to CERN's next-generation ML platform.
0
Years in production AI/ML engineering
0
Users impacted by shipped AI tooling
0
Cost reduction on optimized inference
0
Models fine-tuned & quantized for production
About
My career bridges two worlds: I've built patent-pending AI systems at JPMorganChase under strict enterprise constraints, while also contributing to CERN's open-source ecosystem through Google Summer of Code (Rucio, 2022). That dual perspective — shipping at scale and collaborating in open-source scientific communities — is exactly what this role demands.
Full ownership from model training and HPO through quantization (ONNX/AWQ/GPTQ) to GPU-optimized inference at 100+ tok/s.
Kubernetes-native deployments, CI/CD pipelines, observability dashboards, and release governance for 250K+ user workloads.
Enterprise RAG/CAG architectures, agentic frameworks, evaluation suites, and drift controls — patent-pending work.
Reference architectures, unified documentation (Google Season of Docs), and cross-team collaboration with researchers and engineers.
Why CERN, Why Me
Featured Work
Re-architected the Rucio WebUI with CERN collaborators to improve scientific data management usability for ATLAS and CMS experiments, handling petabyte-scale data movement and long-lived operational workflows.
Built a patent-pending multimodal language agent framework with LLM/RAG/CAG patterns, secure multi-tenant access, and productionized services for internal workflows.
Fine-tuned and quantized multiple LLM/SLM models with ONNX, AWQ, and GPTQ, deploying GPU-optimized inference services for large-context, high-throughput usage.
Delivered an evaluation dashboard with LLM-as-a-Critic, self-reflection, and black-box methods for adaptive thresholding, quality tracking, and lifecycle governance.
Technical Skills
Tools & Technologies
Experience
Progression from core engineering to senior AI platform ownership:
Led delivery of multimodal agentic AI systems, LLM evaluation frameworks, and GPU-optimized inference pipelines, including serving reliability, lifecycle observability, and release controls.
Contributed core features to the Rucio WebUI — the scientific data management system used by LHC experiments including ATLAS and CMS. Collaborated directly with CERN scientists and engineers on re-architecting web interface components for petabyte-scale data workflows. Worked within CERN's open-source development model, participating in code reviews, community discussions, and upstream contribution practices.
Authored and consolidated developer documentation across multiple open-source repositories (Vocabulary, Vue-Vocabulary, Fonts), improving onboarding and contributor productivity with a unified documentation site.
Education
M.S. Computer Science
Georgia Institute of Technology
2025
B.E. Information Science & Engineering
Ramaiah Institute of Technology
2017–2021 · CGPA 9.6/10 · Best Outgoing Student
Languages
Publications, Talks & Credentials
Book chapter published in the "Nature Inspired Computing for Data Science" volume series.
ACM CCS Conference — Studying third-agent intervention to reduce harmful influence in LLM-based agent conversations.
IEEE CCEM Poster — Intelligent clinical note generation system reducing physician documentation burden.
Authored implementation guides and unified docs for Vocabulary, Vue-Vocabulary, and Fonts.
Validated cloud and deployment foundations for building scalable, production AI/ML systems.
Awarded for excellence in delivery and leadership while shipping high-impact AI capabilities.