GenAI System • 2026RAGLLM EvaluationVector SearchGuardrailsDeployment

Portfolio RAG: Grounded Recruiter Q&A (Citations + Evals)

RAG system that answers recruiter questions about my projects and resume with source citations and evaluation scoring.

Golden-set Q/A

30+ labeled questions

Faithfulness

Target: ≥ 90% grounded

Avg latency

Target: < 3.0s

Cost / query

Tracked (tokens + retrieval)

RAG system with citations and evaluation dashboard

Problem

Recruiters and hiring managers want fast, trustworthy answers about experience and impact — but typical chatbots hallucinate. The goal was a grounded QA system that only answers using my portfolio + resume sources, with citations and measurable reliability.

Approach

Ingested portfolio case studies + resume PDFs into a document pipeline (chunking + metadata).
Embedded chunks into a vector index and added a retrieval layer (top-k + optional rerank).
Generated answers with strict grounding: citations required; refuse when evidence is missing.
Built an evaluation harness (golden Q/A) to score answer relevance + citation coverage + faithfulness.
Instrumented usage telemetry to monitor latency, failure rates, and cost per query.

Impact

Demonstrates production-style RAG thinking: grounding, evals, guardrails, and monitoring — not just a demo chatbot.
Turns a portfolio into an interactive, verifiable knowledge base recruiters can trust.
Shows AI PM skill set: system design tradeoffs (quality vs latency vs cost) and measurable reliability.

Deployment & Monitoring

Deployed as a web app with API routes for retrieval + generation.
Guardrails: ‘answer only from sources’ + refusal fallback + max context limits.
Monitoring: latency, cost/query, top queries, retrieval hit-rate, and eval regression checks after changes.
Iteration loop: add new golden questions whenever new portfolio content ships.

Risks & Tradeoffs

Over-retrieval: too many chunks increases cost and can dilute grounding — tuned k and chunk size.
Missing evidence: system must refuse instead of guessing — enforced citation requirement.
Stale index: portfolio updates require re-embedding — added a lightweight re-index workflow.

Tools

Next.jsTypeScriptOpenAI API (or equivalent)Vector DBEvaluation Harness

Deep Dive

Live Demo GitHub Repo PRD (1-page)

← LLM Evaluation & Responsible AI Suite AI PM Ops Copilot: JD → Fit-Gap → Tailored Assets (Human-in-the-loop) →