Main Concept

A quick mental guide to when to use or not use a certain model available in Bedrock.

Context

This kind of questions are included on the certification exam.

DEFINITIVE CHART: Amazon Bedrock Models - When to Use What

USE CASEMODEL TO USEWHYKEYWORD CLUES IN QUESTION
SEARCH & RETRIEVAL
Semantic searchTitan Text Embeddings V2Converts text to vectors, finds similar”search by meaning”, “semantic search”, “find similar”
Image search (text + image)Titan Multimodal EmbeddingsHandles text AND images in same space”search products by photo OR description”, “image similarity”
Personalized recommendationsTitan Text Embeddings V2Embeddings capture preferences”recommend similar items”, “personalization”
CHATBOTS & Q&A
Enterprise chatbot with internal docsTitan Text Premier + Bedrock Knowledge BasesOptimized for RAG, native integration”internal knowledge base”, “RAG”, “cite sources”
Simple chatbot, low costTitan Text LiteCheap, sufficient for basic tasks”budget-constrained”, “simple Q&A”, “high volume”
Chatbot with complex reasoningClaude 3 (Sonnet/Opus)Better reasoning, deep analysis”complex reasoning”, “nuanced”, “analysis”
DOCUMENTS & TEXT
Very long documents (100K+ tokens)Claude 3200K context window (largest)“long documents”, “200+ pages”, “entire book”
Contract/legal analysisClaude 3 Sonnet/OpusPrecise reasoning, long context”contract analysis”, “legal documents”, “compliance”
Short document summarizationTitan Text ExpressPrice/performance balance”summarize”, “extract key points” (short docs)
CODE
Code generationTitan Text Premier / Claude 3Both support code generation”generate code”, “write script”, “programming”
Code explanationClaude 3Better for detailed explanations”explain this code”, “debug”, “code review”
IMAGES
Generate images with READABLE textTitan Image GeneratorOnly one that does readable text well”readable text in image”, “signs”, “price tags”, “posters”
Generate artistic imagesStable Diffusion XLMore flexible for art/creativity”artistic”, “creative”, “illustration”
Search images by contentTitan Multimodal EmbeddingsFor search, NOT generation”search images”, “find similar photos”
Analyze/describe imagesClaude 3 (with vision)Understands images, does NOT generate”what’s in this image”, “describe”, “analyze photo”
Detect if image is AI-generatedTitan Image Generator (watermark feature)Invisible watermarking + DetectGeneratedContent API”verify authenticity”, “detect AI-generated”
SPECIFIC TASKS
Simple classification (high volume)Titan Text LiteCheapest for simple tasks”classify into 5 categories”, “tag”, “millions of items”
Structured data extractionTitan Text Express/PremierTable creation, data formatting”extract data”, “create table”, “structure information”
TranslationTitan Text Express (100+ languages)Broad multilingual support”translate”, “multilingual”
Autonomous agents (multi-step)Titan Text Premier + Bedrock AgentsNative integration with Agents”autonomous agent”, “multi-step workflow”, “tool use”
SPECIAL CASES
Open-source / Custom fine-tuningLlama 2/3Open-source model, customizable”open-source”, “fine-tune with our data”, “self-hosted option”
Maximum accuracy (cost doesn’t matter)Claude 3 OpusMost capable (and most expensive)“highest accuracy”, “most capable”, “budget flexible”

KEY FEATURES TABLE (For Quick Comparison):

MODELCONTEXT WINDOWBEST FORPRICEDON’T USE WHEN
Titan Text Lite4K tokensSimple tasks, high volume💰 Cheapest❌ Complex reasoning
Titan Text Express8K tokensGeneral use, balanced💰💰 Medium❌ Very long docs
Titan Text Premier32K tokensRAG, Agents, enterprise💰💰💰 Medium-high❌ Don’t need Bedrock integration
Claude 3 Haiku200K tokensFast, economical💰💰 Medium❌ Maximum capability needed
Claude 3 Sonnet200K tokensCapability/cost balance💰💰💰 Medium-high❌ Very tight budget
Claude 3 Opus200K tokensMaximum capability💰💰💰💰 Most expensive❌ Simple tasks (overkill)
Llama 2 70B4K tokensOpen-source, customizable💰💰 Medium❌ Long docs, need support
Stable Diffusion XLN/A (image)Artistic images💰💰💰 Per image❌ Readable text in images
Titan Image GeneratorN/A (image)Readable text, watermarks💰💰💰 Per image❌ Very creative art
Titan Text Embeddings V28K tokensSearch, RAG, vectors💰 Cheap❌ Doesn’t generate text
Titan Multimodal EmbeddingsN/AMultimodal search💰💰 Medium❌ Doesn’t generate anything

GOLDEN RULES FOR THE EXAM:

✅ ALWAYS CHOOSE THESE IF


  1. Question mentions “Bedrock Knowledge Bases” or “Bedrock Agents” → Titan Text Premier (optimized for this)

  2. Question says “semantic search”, “find similar”, “RAG” → Titan Text Embeddings V2

  3. Question says “long documents”, “200+ pages”, “entire book” → Claude 3 (200K context)

  4. Question says “budget-constrained”, “high volume”, “simple task” → Titan Text Lite

  5. Question says “readable text in image”, “signs”, “posters” → Titan Image Generator

  6. Question mentions “search images by photo or text” → Titan Multimodal Embeddings

❌ NEVER CHOOSE THESE:

  1. GPT-4, ChatGPT, DALL-E → Not in Bedrock
  2. Embeddings model to generate text → Embeddings DON’T generate
  3. Titan Text Lite for complex analysis → Too simple
  4. Claude Opus for simple classification → Overkill (too expensive)

QUICK DECISION DIAGRAM:

What do you need to do?
         |
         ├─ Generate TEXT
         |      ├─ Long documents? → Claude 3
         |      ├─ Bedrock Agents/KB integration? → Titan Text Premier
         |      ├─ Simple task + cheap? → Titan Text Lite
         |      └─ General use? → Titan Text Express
         |
         ├─ SEARCH for similar things
         |      ├─ Text only? → Titan Text Embeddings V2
         |      └─ Text + images? → Titan Multimodal Embeddings
         |
         ├─ Generate IMAGES
         |      ├─ With readable text? → Titan Image Generator
         |      └─ Art/creativity? → Stable Diffusion XL
         |
         └─ ANALYZE images → Claude 3 (vision)

PRACTICE: Which Model Would You Use?

Scenario 1: “E-commerce with 500K products. Customers search by typing ‘comfortable red shoes for walking’.”

  • Answer: Titan Text Embeddings V2 - Semantic search in large catalog

Scenario 2: “Generate promotional posters with readable prices and product names.”

  • Answer: Titan Image Generator - Only one that does readable text well

Scenario 3: “Analyze 150-page contracts to identify risk clauses.”

  • Answer: Claude 3 Sonnet/Opus - 200K context + complex reasoning

Scenario 4: “Classify 1 million emails into spam/not-spam. Limited budget.”

  • Answer: Titan Text Lite - Simple task + high volume + cheap