bark.cpp

Roadmap / encodec.cpp / ggml

Inference of SunoAI's bark model in pure C/C++.

Description

With bark.cpp, our goal is to bring real-time realistic multilingual text-to-speech generation to the community.

Plain C/C++ implementation without dependencies
AVX, AVX2 and AVX512 for x86 architectures
CPU and GPU compatible backends
Mixed F16 / F32 precision
4-bit, 5-bit and 8-bit integer quantization
Metal and CUDA backends

Models supported

Models we want to implement! Please open a PR :)

Demo on Google Colab (#95)

Here is a typical run using bark.cpp:

make -j && ./main -p "This is an audio generated by bark.cpp"

   __               __
   / /_  ____ ______/ /__        _________  ____
  / __ \/ __ `/ ___/ //_/       / ___/ __ \/ __ \
 / /_/ / /_/ / /  / ,<    _    / /__/ /_/ / /_/ /
/_.___/\__,_/_/  /_/|_|  (_)   \___/ .___/ .___/
                                  /_/   /_/

bark_tokenize_input: prompt: 'This is an audio generated by bark.cpp'
bark_tokenize_input: number of tokens in prompt = 513, first 8 tokens: 20795 20172 20199 33733 58966 20203 28169 20222

Generating semantic tokens: [========>                                          ] (17%)

bark_print_statistics:   sample time =    10.98 ms / 138 tokens
bark_print_statistics:  predict time =   614.96 ms / 4.46 ms per token
bark_print_statistics:    total time =   633.54 ms

Generating coarse tokens: [==================================================>] (100%)

bark_print_statistics:   sample time =     3.75 ms / 410 tokens
bark_print_statistics:  predict time =  3263.17 ms / 7.96 ms per token
bark_print_statistics:    total time =  3274.00 ms

Generating fine tokens: [==================================================>] (100%)

bark_print_statistics:   sample time =    38.82 ms / 6144 tokens
bark_print_statistics:  predict time =  4729.86 ms / 0.77 ms per token
bark_print_statistics:    total time =  4772.92 ms

write_wav_on_disk: Number of frames written = 65600.

main:     load time =   324.14 ms
main:     eval time =  8806.57 ms
main:    total time =  9131.68 ms

Here are typical audio pieces generated by bark.cpp:

audio1.mp4

audio2.mp4

Usage

Here are the steps to use Bark.cpp

Get the code

git clone --recursive https://github.com/PABannier/bark.cpp.git
cd bark.cpp
git submodule update --init --recursive

Build

In order to build bark.cpp you must use CMake:

mkdir build
cd build
cmake ..
cmake --build . --config Release

Prepare data & Run

# Install Python dependencies
python3 -m pip install -r requirements.txt

# Download the Bark checkpoints and vocabulary
python3 download_weights.py --out-dir ./models --models bark-small bark

# Convert the model to ggml format
python3 convert.py --dir-model ./models/bark-small --use-f16

# run the inference
./build/examples/main/main -m ./models/bark-small/ggml_weights.bin -p "this is an audio generated by bark.cpp" -t 4

(Optional) Quantize weights

Weights can be quantized using the following strategy: q4_0, q4_1, q5_0, q5_1, q8_0.

Note that to preserve audio quality, we do not quantize the codec model. The bulk of the computation is in the forward pass of the GPT models.

./build/examples/quantize/quantize ./ggml_weights.bin ./ggml_weights_q4.bin q4_0

Seminal papers

Bark
- Text Prompted Generative Audio
Encodec
- High Fidelity Neural Audio Compression
GPT-3
- Language Models are Few-Shot Learners

Contributing

bark.cpp is a continuous endeavour that relies on the community efforts to last and evolve. Your contribution is welcome and highly valuable. It can be

bug report: you may encounter a bug while using bark.cpp. Don't hesitate to report it on the issue section.
feature request: you want to add a new model or support a new platform. You can use the issue section to make suggestions.
pull request: you may have fixed a bug, added a features, or even fixed a small typo in the documentation, ... you can submit a pull request and a reviewer will reach out to you.

Coding guidelines

Avoid adding third-party dependencies, extra files, extra headers, etc.
Always consider cross-compatibility with other operating systems and architectures

Name	Name	Last commit message	Last commit date
Latest commit przemoc ci : add building in MSYS2 environments (Windows) (PABannier#153 ) Apr 22, 2024 d8bd0fd · Apr 22, 2024 History 92 Commits
.github/workflows	.github/workflows	ci : add building in MSYS2 environments (Windows) (PABannier#153 )	Apr 22, 2024
.vscode	.vscode	sync: use `encodec`'s latest version as a submodule (PABannier#124 )	Feb 13, 2024
assets	assets	doc/mnt: update README with proper cloning instructions + banner (PAB…	Oct 29, 2023
encodec.cpp @ 68e1801	encodec.cpp @ 68e1801	model : supporting Bark small model (PABannier#149 )	Apr 20, 2024
examples	examples	cmake : fix server example building on MSYS2 environments (Windows) (P…	Apr 22, 2024
.gitignore	.gitignore	Minor improvements (PABannier#147 )	Apr 16, 2024
.gitmodules	.gitmodules	sync: use `encodec`'s latest version as a submodule (PABannier#124 )	Feb 13, 2024
CMakeLists.txt	CMakeLists.txt	dbg: fix the forward pass (PABannier#139 )	Apr 10, 2024
LICENSE	LICENSE	DOC Add license file (PABannier#25 )	Jul 30, 2023
README.md	README.md	model : supporting Bark small model (PABannier#149 )	Apr 20, 2024
bark.cpp	bark.cpp	model : supporting Bark small model (PABannier#149 )	Apr 20, 2024
bark.h	bark.h	model : supporting Bark small model (PABannier#149 )	Apr 20, 2024
convert.py	convert.py	model : supporting Bark small model (PABannier#149 )	Apr 20, 2024
download_weights.py	download_weights.py	model : supporting Bark small model (PABannier#149 )	Apr 20, 2024
requirements.txt	requirements.txt	mnt: add back requirements.txt (PABannier#141 )	Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bark.cpp

Description

Usage

Get the code

Build

Prepare data & Run

(Optional) Quantize weights

Seminal papers

Contributing

Coding guidelines

About

Releases

Packages

Languages

License

ochafik/bark.cpp

Folders and files

Latest commit

History

Repository files navigation

bark.cpp

Description

Usage

Get the code

Build

Prepare data & Run

(Optional) Quantize weights

Seminal papers

Contributing

Coding guidelines

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages