Google Vertex AI Provides RAG Engine for Large Language Model Grounding

Vertex AI RAG Engine is a managed orchestration service aimed to make it easier to connect large language models (LLMs) to external data sources to be more up-to-date, generate more relevant responses, and hallucinate less.

According to Google, its new RAG Engine is the “sweet spot” for developers using Vertex AI to implement a RAG-based LLM, providing a balance between the ease of use of Vertex AI Search and the power of a custom RAG pipeline built using lower-level Vertex AI APIs such as Text Embedding API, Ranking API, etc.

The overall workflow supported by Vertex AI RAG Engine includes distinct steps for data ingestion from a number of different sources; data transformation, such as splitting data into chunks previous to indexing; embedding, which provides a…

Source link

Leave a Comment