52 lines
873 B
Markdown
52 lines
873 B
Markdown
|
|
# GRACE - GPU Real-time Audio Conversion Engine
|
||
|
|
|
||
|
|
## Descripción
|
||
|
|
Servicio de conversión de audio en tiempo real usando GPU.
|
||
|
|
|
||
|
|
## Módulos
|
||
|
|
- **ASR_ENGINE**: Speech-to-Text (Whisper)
|
||
|
|
- **TTS_ENGINE**: Text-to-Speech (XTTS)
|
||
|
|
|
||
|
|
## Endpoint RunPod
|
||
|
|
- ID: `r00x4g3rrwkbyh`
|
||
|
|
- URL: `https://api.runpod.ai/v2/r00x4g3rrwkbyh/runsync`
|
||
|
|
- Workers: 2
|
||
|
|
- GPU: NVIDIA L4
|
||
|
|
|
||
|
|
## Código
|
||
|
|
Ubicación: `s3://architect/gpu-services/grace/code/handler.py`
|
||
|
|
|
||
|
|
## Uso
|
||
|
|
|
||
|
|
### ASR (Speech-to-Text)
|
||
|
|
```json
|
||
|
|
{
|
||
|
|
"input": {
|
||
|
|
"module": "ASR_ENGINE",
|
||
|
|
"data": {
|
||
|
|
"audio_base64": "...",
|
||
|
|
"language": "es"
|
||
|
|
}
|
||
|
|
}
|
||
|
|
}
|
||
|
|
```
|
||
|
|
|
||
|
|
### TTS (Text-to-Speech)
|
||
|
|
```json
|
||
|
|
{
|
||
|
|
"input": {
|
||
|
|
"module": "TTS_ENGINE",
|
||
|
|
"data": {
|
||
|
|
"text": "Hola mundo",
|
||
|
|
"language": "es",
|
||
|
|
"voice_id": "default"
|
||
|
|
}
|
||
|
|
}
|
||
|
|
}
|
||
|
|
```
|
||
|
|
|
||
|
|
## Arquitectura
|
||
|
|
```
|
||
|
|
Cliente → RunPod → GRACE Handler → Whisper/XTTS → Response
|
||
|
|
```
|