Skip to content

marion-santiago/local-voice-agent

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Local Voice Agent Example

This is a modification of the Cartesia Voice Agent using faster_whisper, ollama, and xtts for local inference.

The example includes a custom Next.js frontend, Python agent and a modification of xtts-streaming-server.

Running the example

Prerequisites

  • Node.js
  • Python 3.9-3.12
  • LiveKit Cloud account (or OSS LiveKit server)
  • Ollama (for LLM)

Frontend

Copy .env.example to .env.local and set the environment variables. Then run:

cd frontend
npm install
npm run dev

Agent

Copy .env.example to .env and set the environment variables. Then run:

cd agent
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python main.py dev

TTS Server

cd server
python3 -m venv venv
source venv/bin/activate
pip install --use-deprecated=legacy-resolver -r requirements.txt
python -m unidic download
python main.py

About

An example Voice Pipeline Agent using local models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 74.7%
  • Python 22.4%
  • CSS 2.6%
  • JavaScript 0.3%