Context Window

The maximum number of tokens a language model can process in a single operation, determining how much information it can consider at once

Retrieval-Augmented Generation

AI architecture that retrieves external information before generating responses to improve accuracy and reduce hallucinations