SPTK 4.0 (Under Construction)

The Speech Signal Processing Toolkit (SPTK) is a software for speech signal processing tools for UNIX environments.

Documentation

See this page for a reference manual.

Requirements

GCC 4.8+

Installation

The latest release can be installed through Git:

git clone https://github.com/sp-nitech/SPTK.git
cd SPTK
make -j 4  # Please change the number of jobs depending on your environment.

Then the SPTK commands can be used by adding SPTK/bin/ directory to the PATH environment variable. If you would like to use a part of the SPTK library, please link the static library SPTK/lib/libsptk.a.

Examples

The SPTK provides some examples. Go to an example directory and execute run.sh, e.g.,

cd egs/analysis_synthesis/mgc
./run.sh

The below is a simple example that decreases the volume of input audio. You may need to install sox command on your system.

sox -t wav input.wav -c 1 -t s16 -r 16000 - |
    x2x +sd | sopr -m 0.5 | x2x +ds -r |
    sox -c 1 -t s16 -r 16000 - -t wav output.wav

If you would like to draw figures, please prepare a python environment.

cd tools; make env; cd ..
. ./tools/venv/bin/activate
impulse -l 32 | gseries impulse.png
deactivate

Changes from SPTK3

Input and output types are changed to double from float
Drawing commands are implemented in Python
No memory leaks
New features:
- Provide signal processing classes written in C++
- Conversion from/to log area ratio (lar2par and par2lar)
- Entropy calculation (entropy)
- Huffman coding (huffman, huffman_encode, and huffman_decode)
- Mel-cepstrum postfilter (mcpf)
- Mel-filter-bank extraction (fbank)
- Nonrecursive MLPG (mlpg -R 1)
- Pitch extraction by DIO used in WORLD (pitch -a 3)
- Scalar quantization (quantize and dequantize)
- Stability check of LPC coefficients (lpccheck)
- Subband decomposition (pqmf and ipqmf)
Obsoleted commands:
- acep, agcep, and amcep -> amgcep
- bell
- c2sp -> mgc2sp
- cat2 and echo2
- da
- ds, us, us16, and uscd -> sox
- fig
- gc2gc -> mgc2mgc
- gcep, mcep, and uels -> mgcep
- glsadf, lmadf, and mlsadf -> mglsadf
- ivq and vq -> imsvq and msvq
- lsp2sp -> mglsp2sp
- mgc2mgclsp and mgclsp2mgc
- psgr and xgr
- raw2wav, wav2raw, wavjoin, and wavsplit -> sox
Separated commands:
- dtw -> dtw and dtw_merge
- mglsadf -> mglsadf and imglsadf
- train -> train and mseq
- ulaw -> ulaw and iulaw
- vstat -> vstat and median
Renamed commands:
- c2ir -> c2mpir
- mgclsp2sp -> mglsp2sp

Relationship at a glance

Authors

Keiichi Tokuda - Produce and Design - Nagoya Institute of Technology
Keiichiro Oura - Nagoya Institute of Technology
Takenori Yoshimura - Main Maintainer - Nagoya Institute of Technology
Takato Fujimoto - Nagoya Institute of Technology
Yoshihiko Nankaku - Nagoya Institute of Technology

Contributors

Cassia Valentini - The University of Edinburgh
- Calculated the coefficients of the 6th- and 7th-order modified Pade approximation.

License

This software is released under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 532 Commits
.github/workflows		.github/workflows
asset		asset
doc		doc
egs		egs
include/SPTK		include/SPTK
src		src
test		test
third_party		third_party
tools		tools
.clang-format		.clang-format
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPTK 4.0 (Under Construction)

Documentation

Requirements

Installation

Examples

Changes from SPTK3

Relationship at a glance

Authors

Contributors

License

About

Releases

Packages

Languages

License

filblue/SPTK

Folders and files

Latest commit

History

Repository files navigation

SPTK 4.0 (Under Construction)

Documentation

Requirements

Installation

Examples

Changes from SPTK3

Relationship at a glance

Authors

Contributors

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages