Main Concept
A quick mental guide to when to use or not use a certain model available in Bedrock.
Context
This kind of questions are included on the certification exam.
DEFINITIVE CHART: Amazon Bedrock Models - When to Use What
| USE CASE | MODEL TO USE | WHY | KEYWORD CLUES IN QUESTION |
|---|---|---|---|
| SEARCH & RETRIEVAL | |||
| Semantic search | Titan Text Embeddings V2 | Converts text to vectors, finds similar | âsearch by meaningâ, âsemantic searchâ, âfind similarâ |
| Image search (text + image) | Titan Multimodal Embeddings | Handles text AND images in same space | âsearch products by photo OR descriptionâ, âimage similarityâ |
| Personalized recommendations | Titan Text Embeddings V2 | Embeddings capture preferences | ârecommend similar itemsâ, âpersonalizationâ |
| CHATBOTS & Q&A | |||
| Enterprise chatbot with internal docs | Titan Text Premier + Bedrock Knowledge Bases | Optimized for RAG, native integration | âinternal knowledge baseâ, âRAGâ, âcite sourcesâ |
| Simple chatbot, low cost | Titan Text Lite | Cheap, sufficient for basic tasks | âbudget-constrainedâ, âsimple Q&Aâ, âhigh volumeâ |
| Chatbot with complex reasoning | Claude 3 (Sonnet/Opus) | Better reasoning, deep analysis | âcomplex reasoningâ, ânuancedâ, âanalysisâ |
| DOCUMENTS & TEXT | |||
| Very long documents (100K+ tokens) | Claude 3 | 200K context window (largest) | âlong documentsâ, â200+ pagesâ, âentire bookâ |
| Contract/legal analysis | Claude 3 Sonnet/Opus | Precise reasoning, long context | âcontract analysisâ, âlegal documentsâ, âcomplianceâ |
| Short document summarization | Titan Text Express | Price/performance balance | âsummarizeâ, âextract key pointsâ (short docs) |
| CODE | |||
| Code generation | Titan Text Premier / Claude 3 | Both support code generation | âgenerate codeâ, âwrite scriptâ, âprogrammingâ |
| Code explanation | Claude 3 | Better for detailed explanations | âexplain this codeâ, âdebugâ, âcode reviewâ |
| IMAGES | |||
| Generate images with READABLE text | Titan Image Generator | Only one that does readable text well | âreadable text in imageâ, âsignsâ, âprice tagsâ, âpostersâ |
| Generate artistic images | Stable Diffusion XL | More flexible for art/creativity | âartisticâ, âcreativeâ, âillustrationâ |
| Search images by content | Titan Multimodal Embeddings | For search, NOT generation | âsearch imagesâ, âfind similar photosâ |
| Analyze/describe images | Claude 3 (with vision) | Understands images, does NOT generate | âwhatâs in this imageâ, âdescribeâ, âanalyze photoâ |
| Detect if image is AI-generated | Titan Image Generator (watermark feature) | Invisible watermarking + DetectGeneratedContent API | âverify authenticityâ, âdetect AI-generatedâ |
| SPECIFIC TASKS | |||
| Simple classification (high volume) | Titan Text Lite | Cheapest for simple tasks | âclassify into 5 categoriesâ, âtagâ, âmillions of itemsâ |
| Structured data extraction | Titan Text Express/Premier | Table creation, data formatting | âextract dataâ, âcreate tableâ, âstructure informationâ |
| Translation | Titan Text Express (100+ languages) | Broad multilingual support | âtranslateâ, âmultilingualâ |
| Autonomous agents (multi-step) | Titan Text Premier + Bedrock Agents | Native integration with Agents | âautonomous agentâ, âmulti-step workflowâ, âtool useâ |
| SPECIAL CASES | |||
| Open-source / Custom fine-tuning | Llama 2/3 | Open-source model, customizable | âopen-sourceâ, âfine-tune with our dataâ, âself-hosted optionâ |
| Maximum accuracy (cost doesnât matter) | Claude 3 Opus | Most capable (and most expensive) | âhighest accuracyâ, âmost capableâ, âbudget flexibleâ |
KEY FEATURES TABLE (For Quick Comparison):
| MODEL | CONTEXT WINDOW | BEST FOR | PRICE | DONâT USE WHEN |
|---|---|---|---|---|
| Titan Text Lite | 4K tokens | Simple tasks, high volume | đ° Cheapest | â Complex reasoning |
| Titan Text Express | 8K tokens | General use, balanced | đ°đ° Medium | â Very long docs |
| Titan Text Premier | 32K tokens | RAG, Agents, enterprise | đ°đ°đ° Medium-high | â Donât need Bedrock integration |
| Claude 3 Haiku | 200K tokens | Fast, economical | đ°đ° Medium | â Maximum capability needed |
| Claude 3 Sonnet | 200K tokens | Capability/cost balance | đ°đ°đ° Medium-high | â Very tight budget |
| Claude 3 Opus | 200K tokens | Maximum capability | đ°đ°đ°đ° Most expensive | â Simple tasks (overkill) |
| Llama 2 70B | 4K tokens | Open-source, customizable | đ°đ° Medium | â Long docs, need support |
| Stable Diffusion XL | N/A (image) | Artistic images | đ°đ°đ° Per image | â Readable text in images |
| Titan Image Generator | N/A (image) | Readable text, watermarks | đ°đ°đ° Per image | â Very creative art |
| Titan Text Embeddings V2 | 8K tokens | Search, RAG, vectors | đ° Cheap | â Doesnât generate text |
| Titan Multimodal Embeddings | N/A | Multimodal search | đ°đ° Medium | â Doesnât generate anything |
GOLDEN RULES FOR THE EXAM:
â ALWAYS CHOOSE THESE IFâŠ
-
Question mentions âBedrock Knowledge Basesâ or âBedrock Agentsâ â Titan Text Premier (optimized for this)
-
Question says âsemantic searchâ, âfind similarâ, âRAGâ â Titan Text Embeddings V2
-
Question says âlong documentsâ, â200+ pagesâ, âentire bookâ â Claude 3 (200K context)
-
Question says âbudget-constrainedâ, âhigh volumeâ, âsimple taskâ â Titan Text Lite
-
Question says âreadable text in imageâ, âsignsâ, âpostersâ â Titan Image Generator
-
Question mentions âsearch images by photo or textâ â Titan Multimodal Embeddings
â NEVER CHOOSE THESE:
- GPT-4, ChatGPT, DALL-E â Not in Bedrock
- Embeddings model to generate text â Embeddings DONâT generate
- Titan Text Lite for complex analysis â Too simple
- Claude Opus for simple classification â Overkill (too expensive)
QUICK DECISION DIAGRAM:
What do you need to do?
|
ââ Generate TEXT
| ââ Long documents? â Claude 3
| ââ Bedrock Agents/KB integration? â Titan Text Premier
| ââ Simple task + cheap? â Titan Text Lite
| ââ General use? â Titan Text Express
|
ââ SEARCH for similar things
| ââ Text only? â Titan Text Embeddings V2
| ââ Text + images? â Titan Multimodal Embeddings
|
ââ Generate IMAGES
| ââ With readable text? â Titan Image Generator
| ââ Art/creativity? â Stable Diffusion XL
|
ââ ANALYZE images â Claude 3 (vision)
PRACTICE: Which Model Would You Use?
Scenario 1: âE-commerce with 500K products. Customers search by typing âcomfortable red shoes for walkingâ.â
- Answer: Titan Text Embeddings V2 - Semantic search in large catalog
Scenario 2: âGenerate promotional posters with readable prices and product names.â
- Answer: Titan Image Generator - Only one that does readable text well
Scenario 3: âAnalyze 150-page contracts to identify risk clauses.â
- Answer: Claude 3 Sonnet/Opus - 200K context + complex reasoning
Scenario 4: âClassify 1 million emails into spam/not-spam. Limited budget.â
- Answer: Titan Text Lite - Simple task + high volume + cheap