Software Architecture for AI
By O’Reilly: Software Architecture Superstream: Architecture Patterns and Antipatterns for AI - O’Reilly Live Events
Emerging Patterns in Agentic AI Link to heading
by Bharani Subramaniam
Append only context Link to heading
- If you change the context after sending to the LLM, it will miss the cache and be slow and expensive.
Constrain tool selection Link to heading
- Too many tools can be overwhelming for the LLM.
- Constraint tool selection to the specific subset by using a prefix in each tool name to help the LLM; e.g. search_, code_, etc.
Mind Your Language Models: An Approach to Architecting Intelligent Systems Link to heading
by Nischal HP
Embedding Models Link to heading
Text Processing & Chunking Link to heading
- Why chunk text?
- Limit on context windows
- Longer the text, lesser the relevance, vaguer the answer
- How to chunk text?
- Content and Context are the drivers
- Semantic
- Sentence
- Sliding window
- Hierarchical
- Content and Context are the drivers
- How to evaluate chunking?
- Real world queries
- Evaluation of several models
- Ranked segments and documents
- Precision, Recall, nDCG
Retrieval Strategy Link to heading
- Improving retrieval
- Query augmentation
- Keyword search
- Vector Search
- Fusion and Reranking