Environment Variables
This document outlines the configuration of key environment variables in Youtu-RAG.
Overview
Before running Youtu-RAG, you need to configure the necessary environment variables. A recommended practice is to start by copying the .env.example file from the project root:
Then, edit the .env file to fill in the required API Keys and other configurations.
Web UI
The Youtu-RAG web interface can be configured with the following settings:
LLM API Keys
The core capabilities of Youtu-RAG rely on Large Language Models (LLMs). You need to configure the appropriate models for different functional modules. This is the primary text generation LLM that the Agent relies on for its operations.
Tools
Some Toolkits require their own API Keys or specific configurations.
Search Toolkit (SearchToolkit)
The search toolkit integrates the following two services by default:
- Web Search: Uses the efficient Google Search API provided by Serper. You will need to register and obtain an API Key.
- Web Content Extraction: Uses the Jina AI Reader to convert web page content into an LLM-friendly Markdown format. This also requires registration and an API Key.
Configure them in your .env file as follows:
OCR Toolkit (Youtu-Parsing)
Youtu-RAG integrates an OCR service for extracting text from images and documents:
Embedding Toolkit (Youtu-Embedding)
Youtu-RAG supports two embedding service options:
Option 1: Local Youtu-Embedding Service (2048 dimensions)
Option 2: Other Embedding Service
Reranker Toolkit (Jina-Reranker)
The reranker service is used to improve retrieval accuracy by re-ranking search results:
Database
Vector Database
Youtu-RAG uses ChromaDB as the vector database for storing document embeddings:
Relational Database
A relational database is used for system administration, storing tracing data, and evaluation results:
You can also use other database systems by modifying the connection URL accordingly (e.g., PostgreSQL, MySQL).
MinIO
MinIO is used as object storage for managing RAG-related files:
Tracing & Monitoring
The framework integrates OpenTelemetry and Phoenix for tracing and monitoring the Agent's execution flow.
If you're using Phoenix locally, only the PHOENIX_ENDPOINT is required. For Phoenix Cloud service, you'll also need to configure the PHOENIX_API_KEY.
Memory
Youtu-RAG supports a memory feature for maintaining context across conversations:
Set to true to enable the memory feature, which allows the system to remember previous interactions.
Logging
Configure the logging level for the application:
Available log levels:
DEBUG: Detailed information for diagnosing problemsINFO: General informational messages (recommended for production)WARNING: Warning messages for potentially harmful situationsERROR: Error messages for serious problemsCRITICAL: Critical messages for very serious errors
