Rohit Bhardwaj

Director of Architecture, Expert in cloud-native solutions

Rohit Bhardwaj is a Director of Architecture working at Salesforce. Rohit has extensive experience architecting multi-tenant cloud-native solutions in Resilient Microservices Service-Oriented architectures using AWS Stack. In addition, Rohit has a proven ability in designing solutions and executing and delivering transformational programs that reduce costs and increase efficiencies.

As a trusted advisor, leader, and collaborator, Rohit applies problem resolution, analytical, and operational skills to all initiatives and develops strategic requirements and solution analysis through all stages of the project life cycle and product readiness to execution.
Rohit excels in designing scalable cloud microservice architectures using Spring Boot and Netflix OSS technologies using AWS and Google clouds. As a Security Ninja, Rohit looks for ways to resolve application security vulnerabilities using ethical hacking and threat modeling. Rohit is excited about architecting cloud technologies using Dockers, REDIS, NGINX, RightScale, RabbitMQ, Apigee, Azul Zing, Actuate BIRT reporting, Chef, Splunk, Rest-Assured, SoapUI, Dynatrace, and EnterpriseDB. In addition, Rohit has developed lambda architecture solutions using Apache Spark, Cassandra, and Camel for real-time analytics and integration projects.

Rohit has done MBA from Babson College in Corporate Entrepreneurship, Masters in Computer Science from Boston University and Harvard University. Rohit is a regular speaker at No Fluff Just Stuff, UberConf, RichWeb, GIDS, and other international conferences.

Rohit loves to connect on http://www.productivecloudinnovation.com.
http://linkedin.com/in/rohit-bhardwaj-cloud or using Twitter at rbhardwaj1.

Presentations

AI-Era System Design & Architecture Mastery Katas

Monday, 9:00 AM EST

Classic system design teaches you how to scale requests. AI-era architecture teaches you how to scale reasoning, retrieval, tokens, tools, trust, and cost.

In the AI era, the best architects do not just draw boxes. They design authority, evidence, fallback, observability, and cost controls into every system.

Modern system design has entered a new era. It’s no longer enough to optimize for uptime and latency — today’s systems must also be AI-ready, token-efficient, trustworthy, and resilient. Whether building global-scale apps, powering recommendation engines, or integrating GenAI agents, architects need new skills and playbooks to design for scale, speed, and reliability.

This full-day workshop blends classic distributed systems knowledge with AI-native thinking. Through case studies, frameworks, and hands-on design sessions, you’ll learn to design systems that balance performance, cost, resilience, and truthfulness — and walk away with reusable templates you can apply to interviews and real-world architectures.

Learning Outcomes

By the end of this workshop, participants will be able to:

Apply a 7-step AI-era system design framework to both interviews and real-world architecture reviews.
Design systems that scale across requests, users, regions, tokens, retrieval calls, model latency, and inference cost.
Architect production-grade RAG 2.0 pipelines using chunking, embeddings, hybrid search, reranking, GraphRAG, semantic caching, and freshness controls.
Design agentic architectures with bounded tools, permission scopes, human escalation, audit trails, and safe degradation.
Build multi-provider resilience strategies using model routing, fallback models, circuit breakers, budget caps, and graceful degradation ladders.
Apply AI security patterns for prompt injection, indirect prompt injection, data exfiltration, unsafe tool use, and authorization bypass.
Define AI observability and SLOs for hallucination rate, grounding quality, retrieval precision, drift, latency, token cost, and escalation rate.
Practice architecture katas for e-commerce, ride-sharing, search, fraud detection, social feed, video streaming, and AI assistants.
Defend trade-offs under interview pressure using clear diagrams, capacity estimates, failure-mode reasoning, and cost-aware decisions.
Leave with reusable templates for AI System Design Canvas, RAG Checklist, Agent Safety Checklist, Token Capacity Planner, AI SLO Dashboard, and Chaos Runbook.

AI Inference at Scale: Reliability, Observability, Cost & Sustainability

Tuesday, 8:30 AM EST

AI inference is no longer a simple model call—it is a multi-hop DAG of planners, retrievers, vector searches, large models, tools, and agent loops. With this complexity comes new failure modes: tail-latency blowups, silent retry storms, vector store cold partitions, GPU queue saturation, exponential cost curves, and unmeasured carbon impact.

In this talk, we unveil ROCS-Loop, a practical architecture designed to close the four critical loops of enterprise AI:
•Reliability (Predictable latency, controlled queues, resilient routing)
•Observability (Full DAG tracing, prompt spans, vector metrics, GPU queue depth)
•Cost-Awareness (Token budgets, model tiering, cost attribution, spot/preemptible strategies)
•Sustainability (SCI metrics, carbon-aware routing, efficient hardware, eliminating unnecessary work)

KEY TAKEAWAYS
•Understand the four forces behind AI outages (latency, visibility, cost, carbon).
•Learn the ROCS-Loop framework for enterprise-grade AI reliability.
•Apply 19 practical patterns to reduce P99, prevent retry storms, and control GPU spend.
•Gain a clear view of vector store + agent observability and GPU queue metrics.
•Learn how ROCS-Loop maps to GCP, Azure, Databricks, FinOps & SCI.
•Leave with a 30-day action plan to stabilize your AI workloads.

⸻

AGENDA
1.The Quiet Outage: Why AI inference fails
2.X-Ray of the inference pipeline (RAG, agents, vector, GPUs)
3.Introducing the ROCS-Loop framework
4.19 patterns for Reliability, Observability, FinOps & GreenOps
5.Cross-cloud mapping (GCP, Azure, Databricks)
6.Hands-on: Diagnose an outage with ROCS
7.Your 30-day ROCS stabilization plan
8.Closing: Becoming a ROCS AI Architect

Architecting Microservices for Agentic AI Integration

Tuesday, 10:30 AM EST

Autonomous LLM agents don’t just call APIs — they plan, retry, chain, and orchestrate across multiple services. That fundamentally changes how we architect microservices, define boundaries, and operate distributed systems.
This session delivers a practical architecture playbook for Agentic AI integration — showing how to evolve from simple request/response designs to resilient, event-driven systems. You’ll learn how to handle retry storms, contain failures with circuit breakers and bulkheads, implement sagas and outbox patterns for correctness, and version APIs safely for long-lived agents.
You’ll leave with reference patterns, guardrails, and operational KPIs to integrate agents confidently—without breaking production systems.

Problems Solved

Microservices collapse under agent retries or fan-out behavior
Lack of event logs or compensations breaks agent re-planning
Failures cascade due to missing bulkheads or circuit breakers
Non-deterministic APIs cause unpredictable agent actions
Ops teams can’t separate or monitor agent vs human traffic

Why Now

Agentic frameworks (Agentforce, LangGraph, CrewAI) are entering production.
Traditional microservices assume human or synchronous clients — not autonomous retriers.
Reliability, determinism, and observability must now be built into API contracts.
Agent traffic adds new stress patterns and compliance visibility requirements.

What Is Agentic AI in Microservices

Agents plan, retry, and chain service calls — requiring deterministic, idempotent APIs.
Services must be tool-callable (stable operationId, strict input/output schemas).
Systems must survive retry storms, fan-out, and long-lived sessions.

Agenda
Opening: The Shift to Agent-Driven Systems How autonomous agents change microservice assumptions. Why request/response architectures fail when faced with planning, chaining, and self-healing agents.

Pattern 1: Event-Driven Flows Use events, queues, and replay-safe designs to decouple agents from synchronous APIs. Patterns: pub/sub, event sourcing, and replay-idempotency.

Pattern 2: Saga and Outbox Patterns Manage long workflows with compensations. Ensure atomicity and reliability between DB and event bus. Outbox → reliable publish; Saga → rollback on failure.

Pattern 3: Circuit Breakers and Bulkheads Contain agent-triggered failure storms. Apply timeout, retry, and fallback policies per domain. Prevent blast-radius amplification across services.

Pattern 4: Service Boundary Design Shape services around tasks and domains — not low-level entities. Example: ReserveInventory, ScheduleAppointment, SubmitClaim. Responses must return reason codes + next actions for agent clarity. Avoid polymorphic or shape-shifting payloads.

Pattern 5: Integrating Agent Frameworks Connect LLM frameworks (Agentforce, LangGraph) safely to services. Use operationId as the agent tool name; enforce strict schemas. Supervisor/planner checks between steps. Asynchronous jobs: job IDs, progress endpoints, webhooks.

Pattern 6: Infrastructure and Operations

Observability: Tag agent runs (x-agent-run-id), trace retries, success/failure.
Versioning: Use SemVer, deprecation headers, and multi-version gateways.
Resilience: Autoscale on retry rate, degrade gracefully, and run failover drills.

Wrap-Up: KPIs and Guardrails for Production Key metrics: retry rate, success ratio, agent throughput, event replay lag. Lifecycle governance: monitoring, versioning, deprecation, and sunset plans.

Key Framework References

Salesforce Agentforce – agentic orchestration and guardrail templates
LangGraph / CrewAI – multi-agent planning and coordination patterns
Cloud Native Patterns: Saga, Outbox, Circuit Breaker, Bulkhead, Event-Driven Architecture
OpenTelemetry + Prometheus: Observability for agent vs human traffic
OWASP LLM Top-10: Guardrails for safe function calling and data handling

Takeaways

Blueprint for agent-friendly microservices architecture
Patterns for event-driven, saga, and outbox consistency
Guardrails: circuit breakers, bulkheads, least privilege APIs
Framework integration checklist (Agentforce, LangGraph, etc.)
Ops playbook for observability, versioning, and resilience
KPIs to measure readiness: retry rate, grounding accuracy, and agent success ratio

Beyond APIs: Orchestration Patterns & MCP for Multi-Agent Systems

Tuesday, 1:00 PM EST

Enterprises are moving from single AI agents to networks of agents that trigger thousands of API calls, retries, and tool-chains per prompt. Without orchestration discipline and APIs built for AI-scale, systems buckle under bursty load, retry storms, cache-miss spikes, inconsistent decisions, and runaway costs.

This talk shows how to combine MCP (Model Context Protocol) with proven inter-agent orchestration patterns — Supervisor, Pub/Sub, Blackboard, Capability Router — and how to harden APIs for autonomous traffic using rate limits, dedupe, backpressure, async workflows, resilient caching, and autoscaling without bill shock.

You’ll also learn the AIRLOCK Framework for governing multi-agent behavior with access boundaries, identity checks, rate controls, least-privilege routing, observability, compliance filters, and kill-switches.

You will walk away with a practical blueprint for building multi-agent systems that are fast, safe, reliable, and cost-predictable.

KEY TAKEAWAYS
Pattern Literacy: When to use Orchestrator, Pub/Sub, Blackboard, Router

MCP Fluency: Standardize agent↔tool integration

API Scaling: Rate limits, dedupe, backpressure, async, caching

Resilience: Bulkheads, jitter, circuit breakers, autoscaling guardrails

Observability: Trace chain-ID/tool-ID across agents & tools

AIRLOCK Governance: Access boundaries, identity, rate controls, least-privilege routing, compliance, kill-switches

AGENDA

Why AI Changes Load Patterns
Bursty workloads · fan-out · retry amplification · cost spikes
MCP 101
Standardized agent→tool access · hot-swappable tools
Orchestration Patterns
Supervisor · Pub/Sub · Blackboard · Capability Router
Architecting APIs for AI Traffic
Multi-dimensional rate limits · dedupe · backpressure · SWR caching · async
Resilience & Autoscaling
Circuit breakers · bulkheads · kill-switches · budget caps
Observability & Governance
Chain-ID tracing · anomaly detection · AIRLOCK boundaries

Run an AWS Generative-AI Well-Architected Review

Tuesday, 3:00 PM EST

A live, end-to-end walkthrough of an AWS Well-Architected Review for a GenAI app. You’ll learn how to apply the AWS Generative AI Lens across the six pillars, then add Bedrock Guardrails and Knowledge Bases (RAG) to raise reliability, safety, and accuracy. You’ll leave with a reusable checklist and a prioritized remediation plan.

Who it’s for & why

Cloud architects, AI/ML engineers, SRE/Platform teams, and security leads
Moving from PoCs → production with need for a repeatable WA review process

What you’ll learn

How to apply Generative AI Lens questions per pillar
Spot hot spots: data flow, prompts, vector DBs, GPU scaling
Map risks → fixes with Guardrails + RAG patterns

What you’ll take away

WA Review scorecard & template
Pillar-by-pillar remediation backlog
Sample Guardrail policies (safety, PII, toxic output)
RAG/Knowledge Base reference architecture

Claude Certified Architect Readiness for Developers

Tuesday, 5:00 PM EST

Certification-readiness talk with architecture scenarios, exam-domain mapping, practical examples, and production-design guidance.
Claude is no longer just a chatbot for writing answers. It is becoming part of how developers design, build, review, and automate software. Claude Code can help developers work across repositories, Claude Code GitHub Actions can respond to issues and pull requests, MCP can connect Claude to external tools and systems, and the Claude Agent SDK enables developers to build custom agentic workflows. This creates a new skill requirement for architects: knowing how to design Claude-powered systems that are safe, measurable, governable, and production-ready.
This talk provides a practical readiness roadmap for developers and architects preparing for Claude architecture work and Claude certification-style expectations. We will cover Claude platform fundamentals, Claude Code workflows, MCP/tool governance, Agent SDK patterns, API design, RAG, evals, observability, security, and enterprise deployment concerns. Participants will also work through certification-style scenarios that test architectural judgment, not memorization.

The goal is simple: do not just learn Claude. Learn how to architect with Claude.
Claude's certification should not be treated as a badge. It should be treated as proof that an architect can design safe, production-ready Claude-powered systems.
Main audience promise
By the end of the talk, participants will understand what they need to study, practice, and demonstrate to become Claude architecture-ready.
They will leave with:

A Claude architecture domain map
A certification-readiness roadmap
A practical Claude Code workflow model
An MCP/tool governance model
Production architecture patterns for Claude-powered systems
Scenario-style questions for self-assessment
A 30-day preparation plan
Primary focus
This is not a generic Claude intro. It is a Claude architecture readiness talk.
It should cover:
Claude platform landscape
Claude Code workflow
Agent SDK
MCP/tool use
Claude API patterns
Claude-powered RAG
agentic workflows
context management
prompt and instruction design
evals
observability
security
data boundaries
cost awareness
deployment and governance
certification-style scenarios

GraphRAG & Explainable AI: Building Trustworthy LLM Outputs

Wednesday, 9:00 AM EST

Most enterprise LLM failures aren’t technical — they’re trust failures. Models hallucinate, drift from source truth, or produce outputs with no provenance. For regulated industries, that’s unacceptable. This session introduces GraphRAG — a breakthrough approach combining knowledge graphs (Neo4j) with retrieval-augmented generation to deliver traceable, explainable, and auditable AI outputs. You’ll learn how to design, evaluate, and deploy GraphRAG architectures aligned with the EU AI Act, NIST AI Risk Management Framework, and enterprise AI governance standards.

Problems Solved

LLM answers without evidence or traceability
Stale or inconsistent retrieval data
Non-compliance with transparency and provenance regulations
Lack of explainability for model outputs
Low confidence from regulators, auditors, and executives

Why Now

Enterprise AI adoption slowed by lack of trust and explainability
Regulations (EU AI Act, NIST AI RMF) now require provenance and model transparency
Executives demand evidence-based reasoning, not black-box answers

What GraphRAG Is

Combines knowledge graphs (Neo4j) with retrieval-augmented generation
Returns answers with structured evidence paths — connecting entities → relationships → source documents → LLM response
Goes beyond flat vector search to capture contextual meaning, hierarchy, and causality

Where It Applies

Insurance: Claims approvals and denials with transparent justification
Healthcare: Patient summaries with provenance and compliance
Finance: Audit trails, credit-risk reasoning, regulatory reporting
Policy & Legal: Regulatory interpretation and case law summaries

Why It’s Valuable

Establishes trust with executives, auditors, and regulators
Improves faithfulness, groundedness, and transparency of model outputs
Reduces disputes, compliance risks, and hallucination-related rework
Creates structured AI reasoning pipelines aligned with governance frameworks

Agenda
Opening & Problem Context Why trust is the bottleneck for enterprise AI. Examples of LLMs failing in regulated use cases — what breaks when outputs lack provenance.
Pattern 1: Anatomy of GraphRAG Understanding how GraphRAG extends RAG with Neo4j graphs. Schema design for entities, relationships, and evidence paths. Structured retrieval from graph → vector → generator.

Pattern 2: Architecture & Data Flow End-to-end GraphRAG blueprint: Ingestion → Entity extraction → Graph population → Retrieval orchestration → Response grounding. Contrast with plain RAG and vector-only approaches.

Pattern 3: Explainability & Evaluation Metrics for evaluating explainability: Faithfulness, groundedness, and coverage. How to trace model answers back to graph nodes and documents. Integration with AI observability platforms (PromptLayer, Arize, etc.).

Pattern 4: Compliance & Governance Alignment Connecting GraphRAG design to regulatory frameworks:

EU AI Act: Transparency, traceability, human oversight
NIST AI RMF: Trustworthiness and accountability
ISO 42001: AI Management Systems Implementing provenance tags and explainability layers as compliance enablers.

Pattern 5: Real-World Scenarios Industry case patterns:

“Why was this insurance claim denied?”
“Which regulation does this contract violate?”
“Which patient data contributed to this summary?” Each example maps relationships, evidence, and trace paths through Neo4j.

Wrap-Up & Discussion Recap of GraphRAG architecture and design patterns. Checklist for adoption: schema templates, metrics, and governance integration. Q/A and enterprise discussion on explainable AI roadmaps.

Key Framework References

Microsoft GraphRAG: Open-source structured hierarchical retrieval pattern
Neo4j Graph Data Science & LLM Integration Guide
EU AI Act & NIST AI RMF: Provenance, explainability, and risk transparency
ISO/IEC 42001: AI governance and management principles
Gartner & Forrester: Trust and transparency as core adoption barriers

Takeaways

GraphRAG design blueprint (schema + ingestion + retriever)
Evaluation metrics: faithfulness, groundedness, coverage
Reference architecture diagrams for Neo4j + RAG + LLM stack
Playbook for integrating explainability with compliance frameworks

From Brute Force to Brilliance: Algorithmic Thinking in the Age of AI

Wednesday, 11:00 AM EST

Coding interviews and production systems share the same challenge: transforming vague problems into correct, efficient, and explainable solutions. This talk introduces a 7-step algorithmic thinking framework that begins with a brute-force baseline and evolves toward an optimized, production-grade solution—using AI assistants like ChatGPT and GitHub Copilot to accelerate ideation, edge-case discovery, and documentation, without sacrificing rigor.
Whether you’re solving array or graph problems, optimizing data pipelines, or refactoring legacy logic, this framework builds the discipline of clarity before optimization—and shows how to use AI responsibly as a thinking partner, not a shortcut.

Why This Talk Now (in the AI Era)

AI is already in your workflow: 51% of professional developers use AI tools daily; 84% plan to adopt. (Stack Overflow Developer Survey)
AI boosts productivity, but needs structure: Controlled studies show developers complete tasks ~56% faster with GitHub Copilot—but correctness still requires disciplined reasoning. (arXiv)
Engineering leaders demand ROI + rigor: 71% of organizations report regular GenAI use, but need trustworthy frameworks to reduce “hallucination debt.” (McKinsey)
Interviews still test DS&A: Problem-solving frameworks outperform memorization. (Google Tech Dev Guide)

Problems Solved

Unclear or incomplete problem statements
Over-reliance on AI code suggestions without validation
Jumping to optimization before correctness
Failing to reason about time/space complexity
Difficulty communicating trade-offs in reviews or interviews

The 7-Step Algorithmic Thinking Playbook

Clarify – Define inputs, outputs, and constraints precisely.
Baseline – Write the simplest brute-force solution for correctness.
Measure – Analyze time and space complexity; identify bottlenecks.
Map Patterns – Recognize the family (array, tree, graph, DP, greedy).
Refactor – Apply the optimal pattern or data structure.
Validate – Test edge cases and boundary conditions automatically.
Explain – Communicate trade-offs, scalability, and readability.

Learning Outcomes

Apply a repeatable, 7-step problem-solving framework for any coding challenge.
Know when brute force is acceptable—and when optimization matters.
Confidently compare greedy vs. DP or iterative vs. recursive strategies.
Use AI tools responsibly for ideation, validation, and refactoring.
Communicate algorithmic reasoning clearly in code reviews and interviews.

Agenda
Opening: The AI-Accelerated Engineer How AI is reshaping developer workflows—and why algorithmic clarity matters more than ever. Examples of AI code that’s correct syntactically but wrong logically.

Pattern 1: Clarify and Baseline Turning vague questions into crisp specifications. Why starting with brute force improves correctness and confidence.

Pattern 2: Measure and Map Patterns How to quickly estimate complexity and identify known solution families. Mapping problems to arrays, graphs, or DP templates.

Pattern 3: Refactor with AI as a Partner Using Copilot or ChatGPT to suggest refactors, not replace reasoning. Prompt patterns for safe collaboration (“generate + verify + explain”). Spotting hallucinated optimizations.

Pattern 4: Validate and Explain Building automated test scaffolds and benchmark harnesses. AI-assisted edge-case discovery. How to articulate trade-offs in interviews or design docs.

Pattern 5: Framework in Action Live problem walkthrough: From brute-force substring search → optimized sliding window solution → complexity and trade-off explanation. Demonstrate where AI adds value and where human logic rules.

Pattern 6: Guardrails for AI-Assisted Coding Version control hygiene, reproducibility, test coverage. Ensuring deterministic, reviewable AI suggestions. Avoiding “hallucination debt” in production codebases.

Wrap-Up: From Algorithms to Systems Thinking How this framework extends from whiteboard problems to microservices, pipelines, and data workflows. Checklist for using AI as a disciplined amplifier of human reasoning.

Key Framework References

Stack Overflow Developer Survey (2024) – AI adoption statistics
GitHub Copilot Research – Productivity vs correctness studies
McKinsey State of AI Report – ROI benchmarks in engineering teams
Google Tech Dev Guide – Problem-solving and DS&A frameworks
IEEE/ACM Ethical AI Practices – Human-in-the-loop coding

Takeaways

7-Step Algorithmic Thinking Framework — printable reference card
AI Guardrails Checklist for safe Copilot/ChatGPT use in code and reviews
Prompt Templates for structured ideation, verification, and documentation
Live Case Study Walkthrough for clarity, optimization, and explanation
A mindset shift: from memorizing algorithms → to designing reasoning systems

Claude Code Architecture: From Vibe Coding to Governed Engineering Systems

Wednesday, 3:15 PM EST

Claude Code is not just a coding assistant. Used casually, it can create fast prototypes. Used architecturally, it can become a powerful engineering accelerator for discovery, refactoring, test generation, documentation, architecture reviews, and modernization.

This talk teaches architects, tech leads, and senior developers how to use Claude Code as part of a governed software delivery system. We will explore how to structure repositories, write effective CLAUDE.md guidance, create architecture guardrails, generate tests, review AI-produced code, and use Claude Code without turning your codebase into an ungoverned “vibe coding” experiment.

The core message is simple: Claude Code should not replace architecture judgment. It should amplify it.

Anthropic’s own Claude documentation emphasizes prompting clarity, examples, structured guidance, and agentic workflows, which makes architecture-level instructions especially important when using Claude in engineering systems.

Learning Outcomes

Participants will learn how to:

Use Claude Code as an architecture assistant, not just a code generator.
Create a strong CLAUDE.md file for repo-specific engineering guidance.
Apply architecture guardrails for SOLID, API design, testing, security, and maintainability.
Use Claude Code for refactoring, modernization, test generation, and documentation.
Review AI-generated code using architecture quality gates.
Avoid common risks: over-generation, hidden coupling, insecure code, weak tests, and design drift.

Agenda

Why Claude Code Changes Software Architecture
From pair programmer to repo-aware engineering agent.
Claude Code Is Not Your Architect
What Claude can automate, and what humans must still decide.
The Architecture Control File: CLAUDE.md
Repo context, coding standards, architecture rules, test expectations, and anti-patterns.
Architecture Katas with Claude Code
Refactor a service, add tests, improve API boundaries, document architecture decisions.
Quality Gates for AI-Generated Code
SOLID, security, performance, observability, testability, and maintainability checks.
Modernization and Refactoring Workflows
Using Claude Code for legacy analysis, decomposition, migration planning, and safe incremental change.
Security, Governance, and Team Adoption
Permissions, secrets, prompt injection, review workflows, and responsible usage.
Final Playbook
How to introduce Claude Code into an engineering team without losing architecture discipline.

From APIs to MCPs: Making Microservices Agent-Ready with Semantic Context

Wednesday, 5:00 PM EST

LLM agents don’t just fetch data—they decide and act. To support planning and chaining, microservices must expose not only endpoints but also semantic context: what entities mean, which states are valid, which actions come next, and why decisions were made. This talk shows how to evolve from data-only APIs to MCP-aware, semantically rich services using JSON-LD/Schema.org, Hydra-style affordances, domain events, and OpenAPI metadata. You’ll learn retrofit vs greenfield paths, see cross-industry demos, and leave with a migration checklist that makes your services truly agent-ready.

Agenda

Why MCPs matter: the “USB-C for AI apps.”
Gaps in data-only APIs: no semantics, state, or lineage.
Adding semantics: JSON-LD, Hydra, enriched OpenAPI, domain events.
How agents benefit: safer chaining, fewer errors, explainability.
Implementation paths: retrofit vs MCP-native.
Use cases & demos: retail discounts, manufacturing sensors, insurance risk, healthcare consent.

Takeaways

MCPs transform APIs into agent-collaborators, not just data pipes.
Semantic context = better planning, safer automation, fewer hallucinations.
Practical roadmap: enrich existing APIs or design new ones MCP-native.

Securing LLMs in Production: From OWASP Top-10 to Guardrails that Work

Thursday, 9:00 AM EST

Large Language Models unlock new capabilities—and expose brand-new attack surfaces. From prompt injection and data exfiltration to model denial-of-service and insecure plugin calls, adversaries are exploiting weaknesses traditional AppSec never anticipated.
The new OWASP LLM Top-10 provides a shared vocabulary for AI risks; this session turns that list into actionable engineering practice. You’ll learn how to threat-model LLM endpoints, design guardrails that actually block malicious behavior, sandbox tools and plug-ins with least privilege, and align your mitigations to the NIST AI Risk Management Framework for audit-ready governance.

Problems Solved

Unprotected LLM endpoints vulnerable to prompt injection or jailbreaks
Lack of policy filters or content-moderation layers
Plugins/tools running with excessive privileges
Poor data isolation leading to tenant cross-leakage in RAG systems
Missing visibility into misuse, drift, and attack attempts

Why Now

OWASP LLM Top-10 (2024–2025) defines a standard taxonomy of AI risks—widely adopted across the industry.
AI platforms are moving to production faster than security controls can catch up.
Regulators and auditors require demonstrable alignment with frameworks like NIST AI RMF and ISO 42001.
Attackers have begun weaponizing generative models through indirect prompt injection and model-driven DoS.

What You’ll Learn

Threat-modeling methodology for LLM endpoints and agentic flows
Input/output guardrail design: policy filters, allow-lists, block-lists, and content classifiers
Sandboxing tools, plug-ins, and function calls with least privilege and egress control
Sensitive-data redaction and tenancy-aware retrieval for secure RAG pipelines
Red-team drills mapped to OWASP categories and mitigation validation
How to map each control to NIST AI RMF (governance, risk, assurance)

Agenda
Opening: The New AI Attack Surface How LLMs change the threat model. Examples of real-world attacks: prompt injections, indirect injections, model DoS, and exfiltration via vector stores.

Pattern 1: Threat Modeling LLM Endpoints Identify assets, trust boundaries, and high-risk flows. Apply STRIDE-inspired analysis to prompts, context windows, retrieval layers, and plugin calls.

Pattern 2: Designing Input/Output Guardrails Policy filtering, schema validation, and content moderation. Runtime vs compile-time guardrails—what actually works in production. Enforcing determinism and fail-safe defaults.

Pattern 3: Sandboxing and Least Privilege Plugins Secure function calling: scoped IAM, network egress rules, per-plugin secrets, and API key vaulting. Container isolation and ephemeral agent sandboxes.

Pattern 4: Data Protection and Tenancy in RAG Redacting sensitive data before embedding. Segregating tenant vectors and access policies. Auditing data lineage and evidence paths.

Pattern 5: Red Team & Evaluation Frameworks Running adversarial simulations aligned with OWASP LLM Top-10. Common exploits and how to detect them. Integrating automated red-team tests into CI/CD pipelines.

Pattern 6: Governance & Framework Mapping Mapping mitigations to NIST AI RMF (categories RA, MA, ME). Building dashboards and executive summaries for risk reporting.

Wrap-Up & Action Plan Summarize practical controls that can be implemented within 30 days. Introduce the Guardrail Policy Starter Kit + Red-Team Runbook templates. Live checklist review for readiness maturity.

Key Framework References

OWASP LLM Top-10 (2024–2025) – Prompt Injection, Data Exfiltration, DoS, Insecure Plugins
NIST AI RMF (2023) – Governance + Risk + Assurance Categories
ISO/IEC 42001 – AI Management System Standard
MITRE ATLAS – Adversarial Tactics for AI Systems
OWASP SAMM + ASVS – Integrating AI security into AppSec programs

Takeaways

Clear understanding of OWASP LLM Top-10 risks in plain language
Guardrail Policy Starter Kit (template YAML + reference policies)
Sandboxing Playbook for tools and plugins (scoped IAM, network controls)
Red-Team Runbook for testing and validation
NIST AI RMF Mapping Guide for executive and audit reporting
A practical 30-day roadmap to move from reactive patching → resilient AI security