GenAI Engineering Interview Prep
5 min read Quiz at the end
Key GenAI engineering interview topics: RAG, embeddings, agents, hallucinations, fine-tuning, guardrails, cost.
GenAI Engineering Interview Topics
- RAG vs Fine-tuning — RAG: retrieval at runtime, dynamic, no training; Fine-tuning: static knowledge baked into weights
- Embeddings — dense vectors capturing semantic meaning; cosine similarity measures relatedness
- Vector database — stores embeddings, enables fast approximate nearest-neighbour (ANN) search
- Hallucination — LLM generates confident but incorrect content; prevent with RAG, citations, low temp
- Agent vs chain — chain is a predefined sequence; agent decides dynamically which tools to use
- Tool use / function calling — LLM selects external functions with structured input schemas
- Context window — max tokens processed at once; manage with sliding history and summarisation
- Semantic cache — reuse responses for similar queries by embedding similarity threshold
- LLM-as-judge — use an LLM to rate output quality on a rubric (accuracy, clarity, hallucination)
- Chunking — split docs into pieces for indexing; size and overlap affect retrieval precision
- LoRA — train small adapter weights instead of full model; 100x fewer trainable parameters
- Guardrails — input validation + output checking to prevent harmful or off-topic LLM responses
- Streaming — deliver tokens as generated for better perceived performance
- Prompt caching — Anthropic caches repeated system prompts for up to 90% token cost reduction
Topic Quiz · 1 questions
Test your understanding before moving on
1. What is the key difference between an LLM chain and an LLM agent?
💡 A chain is a fixed sequence of steps; an agent uses an LLM to reason about which tools to call and in what order.