Clank

Clank is a voice-controlled LED automation project that combines speech recognition, local AI models, and ESP32-controlled hardware. Built on top of the Moonshine speech recognition system, it enables voice-activated control of LED lights through natural language commands.

Screencast_20241123_181801.webm

Vision

Clank allows users to control LED lights through simple spoken commands. The system flow is:

User Speech: Audio is captured via the default microphone using sounddevice
Voice Activity Detection: Using Silero VAD to detect speech segments
Transcription: Speech is transcribed into text using Moonshine's speech recognition model
AI Processing: The text is sent to a locally hosted LLM (running at 127.0.0.1:5000) for interpretation
LED Control: The LLM returns structured JSON output which will be used to control LEDs via ESP32 GPIOs

Current Status

The project has achieved several key milestones:

Speech Recognition: Successfully implemented using Moonshine's ONNX models
Voice Activity Detection: Integrated Silero VAD for accurate speech detection
Command Processing: LLM successfully generates structured JSON responses for LED control

Example Response:

{
  "action": "led_control",
  "parameters": {
    "color": "blue",
    "state": "on",
    "brightness": 50
  }
}

Features

Implemented

Audio Capture: Uses sounddevice for real-time audio input
Speech Detection: Silero VAD for precise voice activity detection
Speech Recognition: Moonshine-powered transcription
Command Processing: Local LLM interpretation with structured JSON output

In Progress

ESP32 Integration: Development of firmware to receive and process LLM commands
LED Control: GPIO management for LED state and brightness control

Planned

Extended Hardware Control: Support for multiple LED arrays
Advanced Voice Commands: More complex lighting patterns and scenes
Web Interface: Configuration and monitoring dashboard

Installation

Set Required Environment Variable:
```
export KERAS_BACKEND=torch
```

Clone the Repository:

git clone https://github.com/cycloarcane/clank.git
cd clank

Install Python Dependencies:
```
pip install -r requirements.txt
```
Configure Local LLM: Ensure your local LLM server is running at 127.0.0.1:5000

Usage

Run the voice control script:

python voice_LED_control.py

Available voice commands:

"Computer turn on red LED"
"Computer set blue LED to 50%"
"Computer turn off green LED"

Project Structure

voice_LED_control.py: Main script for voice capture and processing
onnx_model.py: Moonshine model wrapper for speech recognition
requirements.txt: Python dependencies
README.md: Project documentation

Acknowledgments

This project heavily builds upon the Moonshine speech recognition system and their live_captions demo. Special thanks to the Moonshine team:

@misc{jeffries2024moonshinespeechrecognitionlive,
      title={Moonshine: Speech Recognition for Live Transcription and Voice Commands}, 
      author={Nat Jeffries and Evan King and Manjunath Kudlur and Guy Nicholson and James Wang and Pete Warden},
      year={2024},
      eprint={2410.15608},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2410.15608}, 
}

Contributing

Contributions are welcome! If you'd like to help build Clank, please submit a pull request or open an issue for any feature requests or bug fixes.

Contact

For questions or support:

Email: [email protected]
GitHub: cycloarcane

License

This project is licensed under a modified non-commercial GNU 3.0 license.

Join us in building the future of voice-controlled lighting! 🎤💡

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
ESP32LEDs		ESP32LEDs
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clank

Vision

Current Status

Features

Implemented

In Progress

Planned

Installation

Usage

Project Structure

Acknowledgments

Contributing

Contact

License

About

Releases

Packages

Languages

License

cycloarcane/clank

Folders and files

Latest commit

History

Repository files navigation

Clank

Vision

Current Status

Features

Implemented

In Progress

Planned

Installation

Usage

Project Structure

Acknowledgments

Contributing

Contact

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages