SuperCollider Forum
May 23, 2013, 04:26:35 AM *
Welcome, Guest. Please login or register.

Login with username, password and session length
News: The SuperCollider forum is currently experiencing a rash of spambot registrations. New user requests may not be approved quickly as a result -- and if your email address looks like spam, it might not be approved at all. We are working to improve the security of the registration process, and to provide alternate means to contact the administrators to get a new account. Thanks for your patience.
 
   Home   Help Search Calendar Login Register  
Pages: [1]
  Print  
Author Topic: best approach towards speech analysis?  (Read 3635 times)
0 Members and 1 Guest are viewing this topic.
linked_void
Newbie
*

Karma: 0
Posts: 7


View Profile
« on: July 30, 2007, 08:12:56 PM »

i've been asked to help on the audio front with a friend's project. the requirement is to have a program which accepts an audio file - in this case a speech, and have it return a directory of files, one for each word of the input file.  i'm wondering how others may have approached this task and what, if any, classes are out there to use. any info greatly appreciated, thanks all!
Logged
dewdrop_world
AdminGroup
Full Member
*

Karma: 9
Posts: 193



View Profile WWW
« Reply #1 on: January 20, 2008, 04:17:51 PM »

I don't remember any discussions on the mailing list about speech recognition. It's a very interesting topic for me -- because of chronic pain problems, I've used dictation software for years (in fact, I'm writing this using iListen from MacSpeech) -- but I haven't had time to explore it myself.

This thread on the electro-music.com forum has some useful pointers -- that guy is using chuck rather than SuperCollider, but the concepts are valid no matter what language you use.

http://electro-music.com/forum/viewtopic.php?t=21647

A lot of audio analysis techniques convert frequency data to the Mel scale, because raw FFT data over emphasize high frequency content. Recently the MFCC UGen was created to produce "mel-frequency cepstral coefficients" from a basic FFT analysis. You might find this very useful.

Be patient with yourself -- this is hard, hard stuff to wrap your head around. I hope these are useful pointers in any case.

hjh
Logged

Pages: [1]
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.4 | SMF © 2006-2007, Simple Machines LLC Valid XHTML 1.0! Valid CSS!