Deploying with TencentCloud Ti-one
Guide on deploying Youtu-RAG backend services on TencentCloud Ti-one platform.
This document provides instructions for deploying Youtu Embedding, HiChunk, and Youtu Parsing backend services on TencentCloud Ti-one platform.
Overview
TencentCloud Ti-one is a one-stop machine learning platform that provides model training, deployment, and inference services. Using Ti-one, you can deploy the Youtu-RAG backend services without managing your own GPU infrastructure.
Prerequisites
- A TencentCloud account with Ti-one and Tencent Container Registry (TCR) access
Deploying Services on Ti-one
Step 1: Prepare Container Images
- Build Docker images following the respective Docker deployment guides for each backend service
- Upload the Docker images to TencentCloud Container Registry (TCR). You can follow the TCR documentation for instructions on pushing images to TCR.
Step 2: Create a Model Service
- Log in to the TencentCloud Console and navigate to Ti-one
- Go to Model Services > Online Services
- Click Create Service
Step 3: Configure the Model
- Service Name: Enter a name for your service (e.g.,
youtu-embedding) - Deployment Method: Select Standard deployment
- Source of Machine: Choose Select from CVMs if you have existing instances, or Purchased on TI-ONE to create new instances
- Model Source: Select Container Images and choose the container image you just uploaded to TencentCloud Container Registry (TCR)
- Port: Default to 8501, or specify a custom port matching your container configuration
- Spec: Choose a GPU instance
Step 4: Update the Service Endpoints
- After creating the service, navigate to the service details page
- Find and copy the Call Address under Regular Service Calling in the Service Call section. This will be your service endpoint for Youtu-RAG integration.
Configuring Youtu-RAG
Once all services are deployed on Ti-one, update your .env file with the Ti-one service endpoints. You will also need to check your RAG configuration files as indicated in the local deployment guide to ensure compatibility.
