Retrieval Augmented Generation (RAG) has become an industry standard method for interfacing with large language models by making them 'context aware'. However, there are a number of situations where a text generation problem is not solved by interacting with large vector database containing many documents. These problems require context but where the context is not known before query time and is often unrelated to existing vector stores. Usually, they are questions about single documents where desirable behavior is to allow the document to be specified at runtime.
Read more...