Audio::MFCC 0.0801 review
DownloadAudio::MFCC is a Perl module for computing mel-frequency cepstral coefficients. SYNOPSIS use Audio::MFCC; my $fe = Audio::
|
|
Audio::MFCC is a Perl module for computing mel-frequency cepstral coefficients.
SYNOPSIS
use Audio::MFCC;
my $fe = Audio::MFCC->init(%params)
$fe->start_utt;
my @ceps = $fe->process_utt($rawdata, $nsamps);
my $leftover = $fe->end_utt;
This module provides an interface to the Sphinx-II feature extraction library which can be used to extract mel-frequency cepstral coefficients from data. These coefficients can then be passed to the Speech::Recognizer::SPX::uttproc_cepdata function.
You might find this useful if, for example, you wish to do the actual recognition on a different machine from the audio capture, and don't have the bandwidth to send a full stream of audio data over the network.
Currently, Sphinx-II also uses delta and double-delta cepstral vectors as input to its vector quantization module, but the calculation of these values is done inside the recognizer's utterance processing module.. In the future it may be possible to move the extraction of these features into the feature extraction library, or to use entirely different features as input (for example, LPC coefficients, though currently, mel-scale cepstra give the best recognition performance).
Requirements:
Perl
Audio::MFCC 0.0801 search tags