2.5 KiB
2.5 KiB
GRACE - Despliegue en RunPod
Estado Actual
| Componente | Estado |
|---|---|
| Endpoint ID | r00x4g3rrwkbyh |
| Template ID | 16zimxx2xj |
| Nombre | grace-gpu |
| GPU | RTX 4090 |
| Balance | $73+ |
| Imagen | PENDIENTE |
Problema
El template usa la imagen base runpod/pytorch:2.1.0-py3.10-cuda11.8.0-devel-ubuntu22.04 sin el handler de GRACE. Necesita imagen custom.
Solucion: Build y Push Docker Image
Opcion 1: Build Local
# En maquina con Docker
cd grace/runpod
# Build
docker build -t tzzrsystems/grace:latest .
# Login a DockerHub
docker login
# Push
docker push tzzrsystems/grace:latest
Opcion 2: GitHub Actions (Recomendado)
Crear .github/workflows/build-grace.yml:
name: Build GRACE Docker
on:
push:
branches: [main]
paths: ['runpod/**']
jobs:
build:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Login to DockerHub
uses: docker/login-action@v3
with:
username: ${{ secrets.DOCKERHUB_USERNAME }}
password: ${{ secrets.DOCKERHUB_TOKEN }}
- name: Build and Push
uses: docker/build-push-action@v5
with:
context: ./runpod
push: true
tags: tzzrsystems/grace:latest
Actualizar Template en RunPod
Una vez la imagen este en DockerHub:
curl -X POST "https://api.runpod.io/graphql" \
-H "Content-Type: application/json" \
-H "Authorization: Bearer YOUR_API_KEY" \
-d '{
"query": "mutation { saveTemplate(input: { id: \"16zimxx2xj\", imageName: \"tzzrsystems/grace:latest\" }) { id imageName } }"
}'
Verificar Funcionamiento
# Test EMBEDDINGS (mas rapido)
curl -X POST "https://api.runpod.ai/v2/r00x4g3rrwkbyh/runsync" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"input": {
"contract_version": "2.1",
"routing": {"module": "EMBEDDINGS"},
"payload": {"type": "text", "encoding": "utf-8", "content": "Test"},
"context": {}
}
}'
Modulos Disponibles
| Modulo | Uso | Modelo |
|---|---|---|
| ASR_ENGINE | Audio -> Texto | Whisper Large V3 |
| OCR_CORE | Imagen -> Texto | GOT-OCR 2.0 |
| TTS | Texto -> Audio | XTTS-v2 |
| EMBEDDINGS | Texto -> Vector | BGE-Large |
| FACE_VECTOR | Imagen -> Vector facial | InsightFace |
| AVATAR_GEN | Prompt -> Imagen | SDXL |
Estimacion de Costos
RTX 4090: ~$0.44/hora
- Cold start: 2-4 min
- Idle timeout: 10 seg
Con minWorkers=0 solo pagas por uso real.
Documentacion generada 2025-12-24