The Speech Signal Processing Toolkit (SPTK) is a software for speech signal processing tools for UNIX environments.
See this page for a reference manual.
- GCC 4.8+
The latest release can be installed through Git:
git clone https://github.com/sp-nitech/SPTK.git
cd SPTK
make
Then the SPTK commands can be used by adding SPTK/bin/
directory to the PATH
environment variable.
If you would like to use a part of the SPTK library, please link the static library SPTK/lib/libsptk.a
.
The SPTK provides some examples.
Go to an example directory and execute run.sh
, e.g.,
cd egs/analysis_synthesis/mgc
./run.sh
The below is a simple example that decreases the volume of input audio.
You may need to install sox
command on your system.
sox -t wav input.wav -c 1 -t s16 -r 16000 - | \
x2x +sd | sopr -m 0.5 | x2x +ds -r | \
sox -c 1 -t s16 -r 16000 - -t wav output.wav
- Input and output types are changed to double from float
- New features:
- Provide signal processing classes written in C++
- Support conversion from/to log area ratio (
lar2par
andpar2lar
) - Support nonrecursive MLPG (
mlpg
) - Support pitch extraction by DIO used in WORLD (
pitch
) - Entropy calculation (
entropy
) - Huffman coding (
huffman
,huffman_encode
, andhuffman_decode
) - Subband decomposition (
pqmf
andipqmf
) - Mel-filter-bank extraction (
fbank
)
- Obsoluted commands:
acep
,agcep
, andamcep
->amgcep
bell
->c2sp
->mgc2sp
cat2
andecho2
->da
->ds
,us
,us16
, anduscd
->sox
gc2gc
->mgc2mgc
gcep
,mcep
, anduels
->mgcep
glsadf
,lmadf
, andmlsadf
->mglsadf
ivq
->imsvq
lsp2sp
->mglsp2sp
mgc2mgclsp
andmgclsp2mgc
->mgc2mgc
raw2wav
,wav2raw
,wavjoin
, andwavsplit
->sox
smcep
->vq
->msvq
- Separated commands:
dtw
->dtw
anddtw_merge
mglsadf
->mglsadf
andimglsadf
train
->train
andmseq
ulaw
->ulaw
andiulaw
vstat
->vstat
andmedian
- Renamed commands:
c2ir
->c2mpir
mgclsp2sp
->mglsp2sp
- Keiichi Tokuda - Produce and Design - Nagoya Institute of Technology
- Keiichiro Oura - Nagoya Institute of Technology
- Takenori Yoshimura - Main Maintainer - Nagoya Institute of Technology
- Takato Fujimoto - Nagoya Institute of Technology
- Yoshihiko Nankaku - Nagoya Institute of Technology
- Cassia Valentini - The University of Edinburgh
- Calculated the coefficients of the 6th- and 7th-order modified Pade approximation.
This software is released under the Modified BSD license - see LICENSE file for details.