Audio::MFCC 0.0801 review

Download

Audio::MFCC is a Perl module for computing mel-frequency cepstral coefficients. SYNOPSIS use Audio::MFCC; my $fe = Audio::

License:	GPL (GNU General Public License)
File size:	88K
Developer:	David Huggins-Daines

Audio::MFCC is a Perl module for computing mel-frequency cepstral coefficients.

SYNOPSIS

use Audio::MFCC;
my $fe = Audio::MFCC->init(%params)
$fe->start_utt;
my @ceps = $fe->process_utt($rawdata, $nsamps);
my $leftover = $fe->end_utt;

This module provides an interface to the Sphinx-II feature extraction library which can be used to extract mel-frequency cepstral coefficients from data. These coefficients can then be passed to the Speech::Recognizer::SPX::uttproc_cepdata function.
You might find this useful if, for example, you wish to do the actual recognition on a different machine from the audio capture, and don't have the bandwidth to send a full stream of audio data over the network.

Currently, Sphinx-II also uses delta and double-delta cepstral vectors as input to its vector quantization module, but the calculation of these values is done inside the recognizer's utterance processing module.. In the future it may be possible to move the extraction of these features into the feature extraction library, or to use entirely different features as input (for example, LPC coefficients, though currently, mel-scale cepstra give the best recognition performance).

Requirements:
Perl

Audio::MFCC 0.0801 screenshot
Zoom

Audio::MFCC 0.0801 search tags

Audio::MFCC 0.0801: Audio::MFCC is a Perl module for computing mel-frequency cepstral coefficients. SYNOPSIS use Audio::MFCC; my $fe = Audio::
Speech::Recognizer::SPX::Server 0.0801: Speech::Recognizer::SPX::Server is a Perl module for writing streaming audio speech recognition servers using Sphinx2. SYNOPSIS
Audio::SPX 0.0801: Audio::SPX is a Perl interface to the Sphinx-II audio library. SYNOPSIS use Audio::SPX; my $ad = Audio::SPX->open_sps(1600
Audio::ESD 0.02: Audio::ESD is a Perl extension for talking to the Enlightened Sound Daemon. SYNOPSIS use Audio::ESD; my $stream = Audio::E
Audio::M4P::QuickTime 0.30: Audio::M4P::QuickTime is a Perl module for m4p/mp4/m4a Quicktime audio files. Perl manipulation of Quicktime Audio files, includin
Audio::DB::Web 0.01: Audio::DB::Web is a Perl module that assists in web-based queries of an MP3 Database. SYNOPSIS use Audio::DB::Web; my $mp3
gnome-speech 0.4.7: GNOME Speech's purpose is to provide a simple general API for producing text-to-speech output. The GNOME Speech 1.0 API is cu

Audio::MFCC 0.0801 review

Alternative/similar