Available for US startups · Remote

Parbhat Kapila · Full-Stack Engineer

Building
Production AI
Systems

Full-stack engineer focused on internal AI tools, RAG infrastructure, and data-heavy SaaS. I ship from zero to live product, then keep it fast, cheap, and reliable under real traffic.

10K+
Documents
99.9%
Uptime
3+
Years
Parbhat Kapila - Full Stack Engineer
Full-Stack Engineer — Featured Work

Production AI Systems

Full-stack & AI projects serving real users, with measurable business impact.

Pipeline Intelligence· Know which deals are already slipping away

Sentinel

Detects deals that are starting to stall before it's visible in a CRM. It models time decay, stage velocity, and engagement signals from live pipeline data. Fast, explainable, and designed for real integration load.

<250ms
Query latency
Live sync
CRM & calendar sync
Predictive
Explainable risk scoring
99.9%
Continuous uptime
Next.jsTypeScriptPostgreSQLPrismaRedisOpenRouterWebhooks
Engineering Infrastructure· 75% reduction in onboarding time for engineering teams

RepoDocs

Automated code documentation system processing 200+ repositories and 100K+ LOC. Reduced onboarding time by 75% with 92% relevance accuracy, serving engineering teams at scale.

<1s
Query latency
TOP-5
deterministic retrieval
~75%
Onboarding time saved
100%
Citation-backed answers
Next.jsTypeScriptPostgreSQLpgvectorGeminiOpenRouterGitHub APIStripe
Enterprise AI Platform· 50–80% AI cost savings in production

Visura

Knowledge operations system processing 10k+ documents with 94%+ accuracy. Reduced processing costs significantly through intelligent architecture, achieving 50–80% AI cost savings via chunk reuse.

94%+
Accuracy
10k+
Documents
50-80%
AI cost saved
SUB-3s
Processing (P50)
Next.jsTypeScriptLangChainGPT-4pgvectorPostgreSQLRedis
Tech Stack

Technologies & Tools

Frontend (Product UI)

TypeScript, React, Next.js (App Router), Tailwind CSS

Backend & APIs

Node.js, Python, FastAPI, Express.js, REST APIs, WebSockets

AI Systems (Production)

OpenAI / GPT-4, LangChain, RAG pipelines, pgvector, embedding search, LLM orchestration

Data & Infrastructure

PostgreSQL, Redis, Object Storage (S3), queues / async processing

Cloud & Deployment

AWS (EC2, S3, RDS), Docker, Vercel, CI/CD (GitHub Actions)

Architecture & Practices

Multi-tenant SaaS, distributed systems, event-driven design, system design, performance optimization

About Parbhat Kapila

Full-Stack & AI Expertise & Impact

System Architecture

Designing scalable, production-grade systems from the ground up. Built multi-tenant SaaS architectures with isolated data models, auto-scaling infrastructure for real production workloads, and cost-optimized deployments driven by architectural tradeoffs, reducing infrastructure spend by 95% without sacrificing reliability.

AI/ML Production

Productionized LLM and RAG systems backed by vector databases at scale. Built retrieval pipelines processing 10,000+ documents with 94%+ accuracy, optimized pgvector queries for sub-200ms latency, and implemented multi-provider LLM orchestration with GPT-4 and resilient fallback strategies.

Performance & Optimization

Driving measurable business impact through technical optimization. Reduced per-document processing costs from $5.00 to $0.05 through architectural changes and chunk reuse, achieved sub-200ms semantic search latency under load, and maintained 99.9% uptime across live production systems.

Full-Stack Ownership

Building and operating production systems used by real users daily. Independently responsible for technical decisions, feature delivery, deployments, monitoring, and post-launch reliability across TypeScript, Next.js, Python, PostgreSQL, Redis, AWS, and Vercel. Owning systems from first commit through live operation.

I'm an AI-focused full-stack engineer building production systems used by real teams. Over the past three years, I've shipped and operated live products handling large data volumes and reduced operational costs by 95%.

I specialize in turning complex AI pipelines into reliable software retrieval, vector storage, and model orchestration optimized for low latency (sub-200ms), high accuracy (94%+), and real production constraints.

I'm seeking a full-time role at a startup where execution matters and engineers are trusted to ship systems that deliver measurable business value.

Professional Journey

Full-Stack & AI Engineer Experience

May 2022 - Present

AI Full-Stack Developer | Product Builder

Independent Product Development

Built, shipped, and operated multiple production AI products used by real teams. Took full responsibility for system design, feature delivery, reliability, and iteration driven by live usage across independently run SaaS applications.

Owned backend services, data stores, AI pipelines, and deployment infrastructure, including authentication, payments, and third-party integrations. Debugged production incidents, performance bottlenecks, and scaling limits while shipping improvements continuously without breaking live systems.

Next.jsTypeScriptPythonPostgreSQLRedisOpenAIpgvectorDockerAWS
Contact

Let's Work Together

Open to full-time remote engineering roles at US startups building production AI systems. Best fit for teams that value ownership, speed, and engineers who ship and maintain what they build. Comfortable aligning with US time zones and working directly with founders in fast-moving environments.

Let's build together

Available for roles & projects

Book