Audio::MFCC 0.0801 review

Download
by rbytes.net on

Audio::MFCC is a Perl module for computing mel-frequency cepstral coefficients. SYNOPSIS use Audio::MFCC; my $fe = Audio::

License: GPL (GNU General Public License)
File size: 88K
Developer: David Huggins-Daines
0 stars award from rbytes.net

Audio::MFCC is a Perl module for computing mel-frequency cepstral coefficients.

SYNOPSIS

use Audio::MFCC;
my $fe = Audio::MFCC->init(%params)
$fe->start_utt;
my @ceps = $fe->process_utt($rawdata, $nsamps);
my $leftover = $fe->end_utt;

This module provides an interface to the Sphinx-II feature extraction library which can be used to extract mel-frequency cepstral coefficients from data. These coefficients can then be passed to the Speech::Recognizer::SPX::uttproc_cepdata function.
You might find this useful if, for example, you wish to do the actual recognition on a different machine from the audio capture, and don't have the bandwidth to send a full stream of audio data over the network.

Currently, Sphinx-II also uses delta and double-delta cepstral vectors as input to its vector quantization module, but the calculation of these values is done inside the recognizer's utterance processing module.. In the future it may be possible to move the extraction of these features into the feature extraction library, or to use entirely different features as input (for example, LPC coefficients, though currently, mel-scale cepstra give the best recognition performance).

Requirements:
Perl

Audio::MFCC 0.0801 search tags