Stop building demos.
Start shipping agents.
A 12-week live cohort that takes you from “I’ve never shipped an LLM app” to building, evaluating, and deploying production agents — taught by a Fortune-500 Principal AI Architect who ships them for a living.
Early-bird from ₹39,999 · no-cost EMI · 7-day refund · waitlist gets the best price.
The 2026 stack you’ll master
Not theory. Receipts.
Taught by someone who actually ships agents in production — at Fortune-500 scale.
Three things every other course skips
Cheap recorded courses are 12 months out of date. ₹1L “masterminds” put 400 people in a room. This is built around the parts that actually get you hired.
Demos are easy. Production is the job.
Anyone can wire up a demo in a weekend. We teach the 80% nobody else covers: evals so you know it works, observability so you know when it breaks, guardrails so it doesn’t get jailbroken, and cost engineering so it doesn’t bankrupt you.
2026 stack, not 2024 reruns.
Most courses are 12+ months behind. We teach the Nov-2025 MCP spec, the Claude Agent SDK & Skills, A2A v1.0 (now under the Linux Foundation), LangGraph 1.x, deep-research & computer-use agents, and reasoning models in agent loops — version-pinned and refreshed every cohort.
Every session ends with code shipped.
No 3-hour lecture marathons. Every live session ends with you committing working code. You don’t watch agents get built — you build them, break them, and fix them, with a Principal Architect reviewing your repo.
12 weeks, week by week
Every week ends with shippable code. Three portfolio projects build across the cohort.
01LLM Fundamentals & Context EngineeringP1▾
Call multiple providers reliably, get structured outputs, engineer context like a 2026 engineer.
- The 2026 model landscape — Claude Opus 4.7/Sonnet 4.6, GPT-5.5, open models — choosing by cost, latency & capability.
- Context engineering > prompt engineering — System prompts, 1M context windows, what to put in context vs retrieve vs tool.
- Structured outputs — JSON schema / Pydantic, tool-calling-as-extraction, why “respond in JSON” fails.
- Prompt caching & token economics — Anthropic cache breakpoints, when caching pays off, measuring it.
- Reasoning & extended thinking — When to turn it on, effort levels, and the cost trade-offs.
Lab → Build a provider-agnostic LLM client + structured-extraction pipeline with a cost report.
02Tool Use & Your First Agent LoopP1▾
Understand what an agent really is — a loop + tools + state — and build one from scratch, then in LangGraph.
- Tool / function calling — Schemas, parallel tool calls, and the agent loop demystified — built in raw Python first.
- Intro to LangGraph 1.x — Nodes, edges, state, and why graphs beat chains.
- ReAct vs plan-then-act — Tool design: idempotency, error surfaces, descriptions as prompts.
- Guarding the loop — Max steps, timeouts, and tool-error handling.
Lab → Give your assistant 3 real tools with step limits and error handling.
03RAG, Vector DBs & Retrieval That WorksP1▾
Build production RAG: hybrid search + reranking + agentic retrieval on Qdrant.
- Chunking & embeddings — Qwen3 embeddings and a sane indexing strategy.
- Qdrant in production — Collections, metadata filtering, native hybrid search + Reciprocal Rank Fusion.
- Reranking — Cross-encoder reranking and actually measuring retrieval quality.
- Agentic RAG — Let the agent decide whether/what to retrieve, multi-hop; when GraphRAG earns its keep.
Lab → Index a real corpus; hybrid + rerank; expose retrieval as an agent tool.
04Memory, Evals & Deploy → Project 1 shipsP1▾
Add memory, prove it works with evals, deploy it as a real API.
- Agent memory — Short-term state vs long-term (LangGraph store / Mem0 / Letta) — and when you actually need it.
- Evals — Why “it looks good” isn’t an answer; Ragas for RAG + DeepEval (pytest-for-LLMs).
- LLM-as-judge — The basics, plus building a small golden dataset.
- Deploying an agent — FastAPI service, streaming, env/secrets, and a Dockerfile.
Lab → Ship Project 1: deployed URL, eval suite in CI, demo video, blog post.
05LangGraph Deep Dive + Framework LandscapeP2▾
Master durable LangGraph (HITL, checkpointing) and place it among CrewAI / MAF / OpenAI SDK / ADK.
- State, checkpointing & durability — Reducers, durable execution, and replay.
- Human-in-the-loop — Interrupts, time-travel debugging, and LangGraph Studio.
- Streaming & subgraphs — Fan-out / fan-in patterns.
- Framework comparison — CrewAI, Microsoft Agent Framework, OpenAI Agents SDK, Google ADK — and why we standardize on LangGraph.
Lab → Refactor a fragile agent into a durable graph with checkpointing + human approval.
06MCP & the Claude Agent SDKP2▾
Build your own MCP server, consume MCP tools, and use the Claude Agent SDK + Skills.
- MCP spec 2025-11-25 — Tools, resources, prompts; client/server architecture; the MCP Registry.
- Build an MCP server — FastMCP in Python — expose your retrieval + a real API as MCP tools.
- New spec capabilities — Elicitation (form + OAuth/URL mode), sampling, async tasks; OAuth 2.1 for remote servers.
- Claude Agent SDK + Skills — Package instructions + scripts + resources as auto-invoked Skills.
Lab → Convert your project’s tools into an MCP server other agents can call.
07Multi-Agent Orchestration, Sub-Agents & AsyncP2▾
Design multi-agent systems that actually work — supervisor, hierarchical, delegation, long-running async.
- Supervisor / router patterns — When multi-agent beats one good agent — and when it’s over-engineering.
- Hierarchical & plan-and-execute — The deep-research-agent pattern (LangChain Deep Agents).
- Sub-agents & delegation — Context isolation per sub-agent.
- Long-running / async agents — Background tasks, task IDs, start/check/cancel; durability for hour-long runs.
Lab → Build a supervisor over 3 sub-agents with delegation + an async research task.
08A2A Protocol + Observability & AgentOps → Project 2 shipsP2▾
Make agents interoperate via A2A and instrument everything with tracing + online/offline evals.
- A2A v1.0 — Agent Cards (incl. signed cards), discovery, task lifecycle; MCP + A2A together.
- A2A interop — Expose one agent over A2A; consume an external A2A agent.
- Observability — Langfuse / LangSmith / Phoenix — trace every node, cost & latency per run.
- AgentOps — Online vs offline evals, CI regression suites, alerting on quality drift.
Lab → Ship Project 2: multi-agent system, custom MCP server, A2A interop, full observability.
09Fine-Tuning & Model Customization (full week)P3▾
Decide when to fine-tune, then do it: LoRA/QLoRA with Unsloth, quantize, serve on vLLM.
- When to fine-tune — Prompt → RAG → tools → then fine-tune; for style/format/latency/cost, not fresh knowledge.
- PEFT / LoRA / QLoRA — How they work; 2026 defaults (r=16, α=16, all-linear, DoRA); dataset construction.
- Fine-tune with Unsloth — A small open model (Llama 4 / Qwen3) on Colab/RunPod — no local GPU needed.
- Quantize & serve — GGUF / AWQ / GPTQ, then serve the adapter on vLLM; eval tuned vs base.
Lab → Fine-tune + serve a model; produce a base-vs-tuned eval report.
10Production Deployment & LLMOpsP3▾
Take an agent from notebook to a cost- and latency-bounded production service.
- Packaging — FastAPI + Docker; async, streaming, concurrency, structured logging.
- Orchestration — Kubernetes basics for LLM services — health checks, autoscaling, secrets.
- Cost engineering — Prompt caching, batching, model routing, token budgets, cost dashboards.
- SLOs — Latency budgets, P95, fallbacks/timeouts/circuit breakers — Vivek’s 50% cost-cut playbook.
Lab → Deploy a containerized agent with caching, model routing, an SLO, and a fallback.
11Voice & Multimodal + AI Safety + Frontier PatternsP3▾
Add voice/vision/document capabilities, harden against attacks, survey the frontier.
- Voice / speech-to-speech — OpenAI Realtime 2 (GPT-Realtime-2) with MCP + image input + SIP.
- Vision & document AI — Multimodal Claude/GPT-5.5; Claude Skills for Excel/PDF/Docx workflows.
- AI safety — Prompt-injection defenses, output guardrails, basic red-teaming, PII handling.
- Frontier patterns — Deep-research, computer-use & browser agents, reasoning in loops — real vs hype.
Lab → Build a voice agent calling an MCP tool; red-team a deployed agent and add guardrails.
12Capstone Ship + Demo Day + CareerP3▾
Ship Project 3, present it, and convert the cohort into a job.
- Capstone polish — README, demo video, blog post, deploy, and an eval report.
- Demo Day — Each learner presents (5 min) to the cohort + invited hiring contacts.
- The AI engineer interview — Agent system design, evals, cost, debugging; resume + LinkedIn positioning.
- Certification & next steps — Earn the credential; lifetime Discord; cohort-2 alumni perks.
Lab → Ship your capstone. Present at Demo Day. Earn the certification (if you pass).
You leave with 3 deployed projects
Not toy notebooks — portfolio-grade systems with live URLs, GitHub repos, demo videos, and blog posts. The exact thing recruiters open first.
“Smart Assistant”
A single-agent, tool-using assistant with production RAG (Qdrant hybrid + rerank), memory, an eval suite, deployed as a streaming FastAPI service.
“Agent Mesh”
A multi-agent system — supervisor + specialist sub-agents — with a custom MCP server you build, A2A interop, and full Langfuse observability.
Capstone
A production-grade agentic application in your chosen domain (e.g. GST query agent, KYC agent, multilingual support) with evals, guardrails, cost SLOs, and deployment.
Pass all three rubrics and the capstone bar to earn the certification + LinkedIn badge. It’s earned, not handed out — that’s what makes it worth something.
Vivek Tripathi
Principal AI Architect · Thomson Reuters
- ~12 years across Thomson Reuters, Deloitte & Accenture
- Architect of ACES (100+ agents) & SolutionIQ (−75% sales effort)
- $91M+ in total value delivered with AI/ML systems
- CTO Excellence Award · MCP & LangGraph certified
Why learn from Vivek
Most “AI gurus” have never shipped an agent to a single user.
Vivek builds agents that real businesses depend on. He architected ACES — 100+ specialized LangGraph agents serving 3,000+ enterprise users across APAC, EMEA, and North America — and built LLM systems that uncovered $46M+ in hidden revenue across 200+ multinational customers. He’s cut agent costs by 50% in production, won a CTO Excellence Award, and mentored 500+ engineers.
Most “AI gurus” have never shipped an agent to a single paying user. Vivek has shipped to thousands.
More than lectures
A small cohort means real access. Here’s what comes with your seat.
Earned certification
Not handed out. You must ship all 3 projects, pass the rubrics, and clear the capstone bar to earn the credential + LinkedIn badge.
Mock interviews
2 mock interviews per learner in the final weeks — agent system design, evals, cost, debugging — with feedback.
Portfolio reviews
At least one 1:1 portfolio and resume review, plus async feedback after Project 1.
Guest industry lectures
Live sessions from a LangGraph engineer, an AgentOps/eval expert, and an AI hiring manager.
Lifetime community
Permanent access to the Ship Agents Discord — alumni network, job leads, future cohort perks.
Office hours
Weekly group office hours every week of the cohort, plus 1:1 time depending on your tier.
One cohort. Two ways in.
Early-bird opens 6 July 2026 · No-cost EMI (3 & 6 months) via Razorpay · prices incl. 18% GST
Builder
Everything you need to build the portfolio.
early-bird · then ₹49,999 · $649 for international
- ✓All 12 live sessions + recordings
- ✓3 portfolio projects + rubric feedback
- ✓Weekly group office hours
- ✓Lifetime Discord community
- ✓Earned certification + LinkedIn badge
- ✓1 × 20-min 1:1 with Vivek
- ✓2 mock interviews (panel)
- ✓1 portfolio/resume review
Career
For the job-switchers who want access + career help.
early-bird · then ₹74,999 · $899 for international
- ✓Everything in Builder, plus:
- ✓3 × 30-min 1:1s with Vivek
- ✓1 extra mock interview with Vivek
- ✓Deep portfolio review + LinkedIn rewrite
- ✓Small-group mentorship pod
- ✓Priority referral intro to network (on Distinction)
7-day refund from cohort start, no questions asked. Women-in-tech & need-based scholarships available. Early-bird, EMI & scholarship seats are non-refundable.
Questions, answered
Who is this for?+
Software engineers with ~3–5 years’ experience who are comfortable in Python but have little or no prior LLM-app experience. If you can build a REST API and use Git, you’re ready.
What are the prerequisites?+
2+ years writing Python (classes, async, type hints), comfort with Git/GitHub and the command line, having built at least one REST API, and basic Docker literacy. No JS/TS, no heavy maths, no GPU required.
What’s the time commitment?+
5–6 hours/week for 12 weeks: one 2-hour live session on Sundays (7–9pm IST), optional Wednesday office hours, and ~3 hours on the week’s hands-on lab.
Are sessions recorded?+
Yes. Every live session and office hour is recorded, so you’re covered if you can’t attend live or you’re in a US timezone.
What will I have at the end?+
Three deployed, portfolio-grade agentic systems (live URLs + GitHub repos + demo videos + blog posts), an earned certification with a LinkedIn badge, and the vocabulary to interview for Agentic AI Engineer roles.
Is there a job guarantee?+
No — we don’t make promises we can’t keep. You get a real portfolio, mock interviews, a resume/LinkedIn review, and referral intros for top performers. The work gets you hired; we set you up to do the work.
What’s the refund policy?+
Full refund within 7 days of cohort start, no questions asked, processed within 30 working days. After Day 7, no refunds. Early-bird, EMI, and scholarship seats are non-refundable.
Do you offer EMI or scholarships?+
Yes — no-cost EMI (3 & 6 months) via Razorpay, plus women-in-tech and need-based scholarships. Join the waitlist to be notified when applications open.
Cohort 1 · 50 seats · starts 23 August 2026
Join the waitlist
Early-bird opens 6 July 2026. Waitlist members get the best price and 24 hours of early access before public enrolment. Seats are capped at 50.