R
I
P
U
N
J
A
Y

S
I
N
G
H

I Don't Build
Wrappers. I Build
Infrastructure.

Production multi-agent systems. Custom LoRA-tuned LLMs. Distributed inference pipelines processing 10,000+ daily requests. I architect the AI infrastructure that enterprise clients trust to ship.

▹Multi-Agent Systems

LangGraph, LangChain, LlamaIndex

▹Custom LLM Training

LoRA, vLLM, KServe, MLflow

▹Cloud & DevOps

AWS, Kubernetes, Terraform, Docker

▹Production Backend

FastAPI, Neo4j, Redis, Celery

The Problem

Your AI Is Held Together With
API Calls & Prayers.

Most “AI products” are thin wrappers around OpenAI. One rate limit, one policy change, one outage — and your entire system folds. I build the opposite: custom-trained models you own, multi-agent orchestration that self-heals, and infrastructure that scales without a single vendor lock-in.

0%Validation Accuracy

-0%Inference Cost

0K+Daily API Requests

0%Faster Inference

❌ What most do

openai.chat.completions.create()

Vendor-locked. No fallback. No ownership.

↓

✓ What I build

LoRA → vLLM → KServe → K8s → Prometheus

Custom models. Your data. Your infrastructure. Zero lock-in.

Your stack vs. mine

Proof of Work

Built. Shipped. In Production.

Not concept demos. Not hackathon prototypes. These systems process real data, serve real users, and run 24/7 without supervision.

LIVE — processing documents

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

[Agent: DocProcessor] → Extracting fields...

[Agent: Normalizer] → Standardizing format...

[Agent: Validator] → Cross-referencing rules...

[Agent: RepairBot] → Auto-correcting entry...

[Agent: Reporter] → Generating audit trail...

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

✓ Document validated — 95% confidence

150+ reports generated today

Autonomous Document Validation

Visa2fly — Production System

LangGraph-powered multi-agent system that autonomously validates complex visa documents across global immigration rules. Seven specialized AI agents work in concert — processing, normalizing, validating, repairing, and reporting — without human intervention.

› 95% validation accuracy
› 100+ concurrent requests daily
› 7 orchestrated AI agents
› 150+ audit reports per day

LangGraphFastAPINeo4jRedisCeleryMLX-VLM

Custom LLM Training & Serving

Inference Cost Elimination

Stop burning API credits. Domain-specific language models fine-tuned with LoRA on proprietary travel and visa datasets, served on an optimized vLLM/KServe stack with intelligent caching via LMCache. Your model, your data, your infrastructure.

-70%Inference Cost

+40%Faster Response

LoRAvLLMKServeMLflowLMCacheKubernetes

Before: OpenAI API$12,000/mo

After: Custom vLLM$3,600/mo

$8,400saved per month

Query

Rules

Docs

Context

Response

Context-Aware RAG System

Neo4j Knowledge Graph

Retrieval-Augmented Generation pipeline powered by a Neo4j knowledge graph storing 10,000+ validation rules and document relationships. Graph-based context retrieval delivers precise, regulation-compliant answers — not hallucinations.

› 10,000+ validation rules indexed
› Graph-based semantic retrieval
› Vector + Knowledge Graph hybrid search
› Regulatory compliance guaranteed

Neo4jQdrantLangChainFastAPIRedis

Production Infrastructure

Ship It.
Keep It Alive.

Building is 10% of the work. The other 90% is keeping it alive under load. Automated CI/CD, real-time observability, auto-scaling infrastructure, and zero-downtime deployments — all battle-tested at scale.

production.log

> Initializing production cluster...> Connecting to observability stack..._

Daily Requests10K+

Latency P9942ms

Uptime99.9%

Active Agents7

Celery Workers8/8

Daily Reports150+

DockerKubernetesNginxJenkinsTerraformPrometheusGrafanaFlowerSupervisordAWS

The Journey

Ripunjay Singh

AI Engineer & Systems Architect

Experience

Visa2fly·AI Engineer

Dec 2024 → Present

Multi-Agent Systems, Custom LLMs, Production ML

Visa2fly·Backend Engineer

Dec 2023 → Nov 2024

SpringBoot, Microservices, CI/CD

Visa2fly·SpringBoot Intern

Jun 2023 → Nov 2023

API Development, Agile

Education

Bennett University·B.Tech Computer Science

2022 → 2026

8.54 CGPA

Recognition

MLH Hackathon Winner — Asia-Pacific

AWS Certified Cloud Practitioner

2 Research Publications in Distributed AI

Microsoft Learn Student Ambassador

GitHub LinkedIn Email

Lets work together

Let's Build Something
That Ships.

You've seen the infrastructure. I architect custom, production-grade AI systems — multi-agent pipelines, fine-tuned LLMs, and scalable cloud infrastructure. Open to relocation and remote opportunities worldwide.

Let's Talk Download Resume

GitHub LinkedIn X / Twitter singhripunjay09@gmail.com

0K+Daily API Requests

0AI Agents in Prod

AWSCertified

ZeroVendor Lock-in

RIPUNJAY SINGH

I Don't Build Wrappers. I Build Infrastructure.