AI Models and LLM Systems

AI Models and LLM Systems Graphics Coverage

Primary chapter graphic: LLM Glossary Map, Generative AI Learning Roadmap, Transformer Encoder Decoder Flow, AI Concepts Map, MCP Server Catalog, Frontier Model One-Pager, Generative AI Tech Stack, Reasoning Model Training Path, Open Model Inference Flow, RAG vs Fine-Tuning, Full Fine-Tuning vs LoRA vs RAG, Open Source AI Stack, How LLMs See Text, Visual Token Context Compression. Accepted graphics: 14. Reviewed non-signal pages: 5. Open graphics in review: 0. QA status lives in graphics audit and visual review ledger.

Corpus pages: p. 38, p. 43, p. 74, p. 131, p. 147, p. 158, p. 175, p. 194-195, p. 200-201, p. 230, p. 242, p. 266-267, p. 277, p. 326-327, p. 335-336, p. 343, p. 364, p. 373-374 Coverage: 24 pages; low-confidence extraction ranges: p. 364, p. 373-374

This chapter is part of Marius's owned architecture build corpus. The text routes decisions; durable implementation signal is carried by accepted graphics, reviewed non-signal decisions, and the linked QA audit.

Chapter Visuals

Accepted graphics carry the canonical design signal for this chapter. Each selected source page is either accepted as a graphic or explicitly marked non-signal in the source-faithful ledger. Review and QA state live in visual inventory, visual review ledger, and graphics audit.

LLM Glossary Map

source-page: p. 147
batch: 04
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0147-ai-models-and-llm-systems.json
svg: bbg-p0147-ai-models-and-llm-systems.svg

Generative AI Learning Roadmap

source-page: p. 77
batch: 05
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0077-architecture-source-map-architecture-source.json
svg: bbg-p0077-architecture-source-map-architecture-source.svg

Transformer Encoder Decoder Flow

source-page: p. 130
batch: 08
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0130-search-retrieval-and-rag-search.json
svg: bbg-p0130-search-retrieval-and-rag-search.svg

MCP Server Catalog

source-page: p. 158
batch: 09
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0158-ai-models-and-llm-systems.json
svg: bbg-p0158-ai-models-and-llm-systems.svg

AI Concepts Map

source-page: p. 194
batch: 09
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0194-ai-models-and-llm-systems.json
svg: bbg-p0194-ai-models-and-llm-systems.svg

source-page: p. 73
batch: 13
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0073-api-gateways-and-contracts-api.json
svg: bbg-p0073-api-gateways-and-contracts-api.svg

Generative AI Tech Stack

source-page: p. 222
batch: 13
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0222-cloud-infrastructure-and-iac-cloud.json
svg: bbg-p0222-cloud-infrastructure-and-iac-cloud.svg

Reasoning Model Training Path

source-page: p. 38
batch: 14
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0038-ai-models-and-llm-systems.json
svg: bbg-p0038-ai-models-and-llm-systems.svg

Open Model Inference Flow

source-page: p. 276
batch: 15
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0276-architecture-source-map-architecture-source.json
svg: bbg-p0276-architecture-source-map-architecture-source.svg

RAG vs Fine-Tuning

source-page: p. 366
batch: 15
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0366-platform-selection-and-tradeoffs-platform.json
svg: bbg-p0366-platform-selection-and-tradeoffs-platform.svg

Full Fine-Tuning vs LoRA vs RAG

source-page: p. 326
batch: 17
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0326-ai-models-and-llm-systems.json
svg: bbg-p0326-ai-models-and-llm-systems.svg

Open Source AI Stack

source-page: p. 24
batch: 23
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0024-deployment-release-and-versioning-deployment.json
svg: bbg-p0024-deployment-release-and-versioning-deployment.svg

How LLMs See Text

source-page: p. 364
batch: 25
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0364-ai-models-and-llm-systems.json
svg: bbg-p0364-ai-models-and-llm-systems.svg

Visual Token Context Compression

source-page: p. 373
batch: 26
status: accepted
reviewer-status: reviewed
fidelity-score: 0.9
spec: bbg-p0373-ai-models-and-llm-systems.json
svg: bbg-p0373-ai-models-and-llm-systems.svg

Open Review Queue

none

Reviewed Non-Signal Pages

AI Models And LLM Systems: LLM + Vector Search Map: source p. 195; batch 08; status non-signal/reviewed; ledger reason in visual-review-ledger.json
AI Models And LLM Systems: LLM + Tool Map: source p. 230; batch 08; status non-signal/reviewed; ledger reason in visual-review-ledger.json
AI Models And LLM Systems: LLM + Tool Map: source p. 242; batch 11; status non-signal/reviewed; ledger reason in visual-review-ledger.json
AI Models And LLM Systems: LLM + Embedding Map: source p. 277; batch 12; status non-signal/reviewed; ledger reason in visual-review-ledger.json
AI Models And LLM Systems: Embedding + Pattern Map: source p. 131; batch 16; status non-signal/reviewed; ledger reason in visual-review-ledger.json

Use When

A workflow needs language understanding, extraction, classification, summarization, or generation.

Avoid When

A deterministic rule is cheaper, clearer, and more reliable.

Core Model

Model systems combine prompt, context, tool constraints, evaluation, fallback, and cost controls.
Prefer explicit ownership over accidental coupling. Every boundary should say who owns correctness, cost, data, recovery, and change.
Use corpus page pointers for inspection, and keep the chapter notes focused on reusable design decisions.

Implementation Guidance

Define task type, acceptable error, source context, output schema, review threshold, and regression set.
Write the smallest useful design note: purpose, inputs, outputs, state, failure behavior, observability, and rollback.
Choose the first implementation that can be tested against the real workflow without hiding a known production risk.

Tradeoffs

Fine-tuning can specialize behavior; retrieval keeps knowledge fresh without changing weights.
Centralization reduces duplicated work but can become a bottleneck when every team needs exceptions.
Specialized infrastructure helps at scale, but it must earn its operational cost.

Failure Modes

A model is evaluated by vibes instead of test cases tied to business risk.
The diagram shows boxes but not ownership, retry behavior, data freshness, or user-visible failure.
The system has no proof path for the highest-risk assumption.

Decision Checklist

Measure accuracy, refusal behavior, hallucination rate, latency, cost, and human review load.
Name the owner, source of truth, timeout, retry policy, and evidence that the path works.
Add one regression check for the failure mode most likely to recur.

Neutral Automation Examples

A classifier suggests categories with confidence and routes low-confidence items to a reviewer.
A neutral internal automation starts with fixtures, then adds credentials, permissions, and production scheduling only after the boundary is tested.
A customer-facing workflow keeps irreversible actions behind explicit approval until metrics show it is safe to automate further.

AI Models and LLM Systems ​