Audio to Speech Translator(EN)

AI-powered tool for seamless speech-to-text, translation, and speech synthesis. Built with modern NLP and text-to-speech technologies.

Features

Transcription: Converts audio to text using Whisper.
Translation: Translates text to English.
Text-to-Speech: Converts translated text to natural speech using ElevenLabs.
Gradio Interface: Simple, intuitive UI.

Getting Started

Clone the Repo

git clone https://github.com/diegoruny/StS-translator
cd StS-translator

Install Dependencies

python -m venv .venv
.venv\Scripts\Activate
pip install -r requirements.txt

Set Up API Key

Create a .env file:

ELEVENLABS_API_KEY=your_elevenlabs_api_key_here

Usage

Run the App
```
python app.py
```
Use the Interface
- Access the app at http://127.0.0.1:7860/.
- Upload or record audio, receive translated speech.

Tech Stack

Whisper for transcription.
Translator for language conversion.
ElevenLabs for text-to-speech.
Gradio for the user interface.

Acnowledgements

This project was inspired by a YouTube based on a post and has been extended and customized to enhance user experience and functionality.

License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio to Speech Translator(EN)

Features

Getting Started

Usage

Tech Stack

Acnowledgements

License

About

Releases

Packages

Languages

diegoruny/StS-translator

Folders and files

Latest commit

History

Repository files navigation

Audio to Speech Translator(EN)

Features

Getting Started

Usage

Tech Stack

Acnowledgements

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages