📡 You're offline — showing cached content
New version available!
Quick Access
Tutorials Generative AI Engineering GenAI Engineering Interview Prep

GenAI Engineering Interview Prep

5 min read Quiz at the end
Key GenAI engineering interview topics: RAG, embeddings, agents, hallucinations, fine-tuning, guardrails, cost.

GenAI Engineering Interview Topics

  • RAG vs Fine-tuning — RAG: retrieval at runtime, dynamic, no training; Fine-tuning: static knowledge baked into weights
  • Embeddings — dense vectors capturing semantic meaning; cosine similarity measures relatedness
  • Vector database — stores embeddings, enables fast approximate nearest-neighbour (ANN) search
  • Hallucination — LLM generates confident but incorrect content; prevent with RAG, citations, low temp
  • Agent vs chain — chain is a predefined sequence; agent decides dynamically which tools to use
  • Tool use / function calling — LLM selects external functions with structured input schemas
  • Context window — max tokens processed at once; manage with sliding history and summarisation
  • Semantic cache — reuse responses for similar queries by embedding similarity threshold
  • LLM-as-judge — use an LLM to rate output quality on a rubric (accuracy, clarity, hallucination)
  • Chunking — split docs into pieces for indexing; size and overlap affect retrieval precision
  • LoRA — train small adapter weights instead of full model; 100x fewer trainable parameters
  • Guardrails — input validation + output checking to prevent harmful or off-topic LLM responses
  • Streaming — deliver tokens as generated for better perceived performance
  • Prompt caching — Anthropic caches repeated system prompts for up to 90% token cost reduction
Topic Quiz · 1 questions

Test your understanding before moving on

1. What is the key difference between an LLM chain and an LLM agent?
💡 A chain is a fixed sequence of steps; an agent uses an LLM to reason about which tools to call and in what order.