A minimal, native macOS app that transcribes YouTube videos and local video files with exceptional accuracy. Simple, fast, and elegant.
- Easy YouTube Integration: Just paste (⌘V) any YouTube URL
- Local Video Support: Drag & drop or click to upload local video files
- Smart Formatting: Automatically formats transcriptions into readable paragraphs
- Multiple Export Options: Copy or save transcriptions with a single click
- Progress Tracking: Real-time progress bars for both download and transcription
- Native Performance: Built with SwiftUI for optimal macOS integration
- Dark and Light Modes: Seamless integration with your system preferences
1. Download the App: Get the latest version from the releases page.
2. Install Prerequisites: Vocal requires Python, Homebrew, and yt-dlp for downloading and processing YouTube videos.
To install Python on macOS, open your Terminal and use the following command:
brew install python
If you don’t have Homebrew, you can install it by running:
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
Finally, install yt-dlp using Homebrew:
brew install yt-dlp
3. Launch Vocal: Once prerequisites are installed, open Vocal and transcribe your videos!
- Built with SwiftUI and AVFoundation
- Uses Apple's Speech Recognition framework for high-quality transcription
- Integrated with yt-dlp for reliable YouTube video downloading
- Native macOS window management and system integration
- Content Creation: Quickly transcribe video content for blogs or articles
- Research: Convert video interviews or lectures into searchable text
- Accessibility: Make video content accessible through text
- Note Taking: Transform video lessons into written notes
We welcome contributions! Here's how you can help:
- Clone the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
Please ensure your PR:
- Maintains the minimal, focused design philosophy
- Follows the existing code style
- Includes appropriate tests
- Updates documentation as needed
- Multi-language support
- Timestamp support
- Advanced export formats (PDF, SRT, VTT)
- Video mini-player while transcribing
- Quick edit mode for transcriptions
This project is licensed under the MIT License - see the LICENSE file for details.
- Website: Nuanc.me
- Report issues: GitHub Issues
- Follow updates: @Nuancedev
- yt-dlp for YouTube video downloading
- Apple's Speech Recognition framework for transcription
- The open source community for inspiration and support