The Speech Signal Processing Toolkit (SPTK) is a software for speech signal processing tools for UNIX environments.
See this page for a reference manual.
- GCC 4.8+
The latest release can be installed through Git:
git clone https://github.com/sp-nitech/SPTK.git
cd SPTK
make -j 4 # Please change the number of jobs depending on your environment.
Then the SPTK commands can be used by adding SPTK/bin/
directory to the PATH
environment variable.
If you would like to use a part of the SPTK library, please link the static library SPTK/lib/libsptk.a
.
The SPTK provides some examples.
Go to an example directory and execute run.sh
, e.g.,
cd egs/analysis_synthesis/mgc
./run.sh
The below is a simple example that decreases the volume of input audio.
You may need to install sox
command on your system.
sox -t wav input.wav -c 1 -t s16 -r 16000 - |
x2x +sd | sopr -m 0.5 | x2x +ds -r |
sox -c 1 -t s16 -r 16000 - -t wav output.wav
If you would like to draw figures, please prepare a python environment.
cd tools; make env; cd ..
. ./tools/venv/bin/activate
impulse -l 32 | gseries impulse.png
deactivate
- Input and output types are changed to double from float
- Drawing commands are implemented in Python
- No memory leaks
- New features:
- Provide signal processing classes written in C++
- Conversion from/to log area ratio (
lar2par
andpar2lar
) - Entropy calculation (
entropy
) - Huffman coding (
huffman
,huffman_encode
, andhuffman_decode
) - Mel-cepstrum postfilter (
mcpf
) - Mel-filter-bank extraction (
fbank
) - Nonrecursive MLPG (
mlpg -R 1
) - Pitch extraction by DIO used in WORLD (
pitch -a 3
) - Scalar quantization (
quantize
anddequantize
) - Stability check of LPC coefficients (
lpccheck
) - Subband decomposition (
pqmf
andipqmf
)
- Obsoleted commands:
acep
,agcep
, andamcep
->amgcep
bell
c2sp
->mgc2sp
cat2
andecho2
da
ds
,us
,us16
, anduscd
->sox
fig
gc2gc
->mgc2mgc
gcep
,mcep
, anduels
->mgcep
glsadf
,lmadf
, andmlsadf
->mglsadf
ivq
andvq
->imsvq
andmsvq
lsp2sp
->mglsp2sp
mgc2mgclsp
andmgclsp2mgc
psgr
andxgr
raw2wav
,wav2raw
,wavjoin
, andwavsplit
->sox
- Separated commands:
dtw
->dtw
anddtw_merge
mglsadf
->mglsadf
andimglsadf
train
->train
andmseq
ulaw
->ulaw
andiulaw
vstat
->vstat
andmedian
- Renamed commands:
c2ir
->c2mpir
mgclsp2sp
->mglsp2sp
- Keiichi Tokuda - Produce and Design - Nagoya Institute of Technology
- Keiichiro Oura - Nagoya Institute of Technology
- Takenori Yoshimura - Main Maintainer - Nagoya Institute of Technology
- Takato Fujimoto - Nagoya Institute of Technology
- Yoshihiko Nankaku - Nagoya Institute of Technology
- Cassia Valentini - The University of Edinburgh
- Calculated the coefficients of the 6th- and 7th-order modified Pade approximation.
This software is released under the Apache License 2.0.