[Asterisk-Dev] Voice detection

Brad Waite brad at wcubed.net
Fri Oct 3 06:41:02 MST 2003


Chris,

Have you stumbled across any VOX code that would be simpler to introduce to * 
than Sphinx?

Brad Waite

Chris Albertson wrote:

> The "magic words" to use for Google are "VOX DSP Algorithm"
> "VOX" means "Voice Operated Switch" and is very common in
> simplex radio.  It is a kind of automatic "push to talk switch"
> 
> Yes Spinx would have something like this some place in it but
> then so would any teliphony codec that did silence surpression.
> 
> The basic idea is a compute the power in the audio, apply a _very_
> low (on order of about 1Hz) pass filter and threshold the result.
> You may want two thresholds one for on one for off to provide
> stability. (Remember the idea of a dead band in servo systems?)
> Filter responce and the threshholds should be user adjustable
> 
> Yes, I've been looking a spinx for some time now.  I think it is
> "ready for prime time" for simple applcations like voice dialing
> a phone number or yes/no to an IVR system.  
> 
> --- Brad Waite <brad at wcubed.net> wrote:
> 
>>Steven Critchfield wrote:
>>
>>
>>>On Thu, 2003-10-02 at 21:36, Brad Waite wrote:
>>>
>>>
>>>>Does anyone know if there's public voice detection algorithms
>>
>>available?  I've 
>>
>>>>scoured the net for the last hour or so, and I can't come up with
>>
>>anything 
>>
>>>>except a few proprietary or embedded solutions.
>>>>
>>>>I know dsp.c uses goertzel algorithms for DTMF detection, but how
>>
>>does one 
>>
>>>>detect voice?
>>>>
>>>>I dunno, maybe detecting voice isn't the way to go.  I want to
>>
>>begin playback of 
>>
>>>>a file after a phone/answering machine has answered.
>>>
>>>
>>>Argghh telemarketing....
>>
>>Argghh voice mail notification...  (among other non-telemarketing
>>things)  :)
>>
>>
>>>shouldn't you detect silence and then start?
>>
>>Possibly, but I would think that between the last ring and the speech
>>there 
>>could be enough silence to return a false positive.
>>
>>As it turns out, it appears that Festival's cousin, Sphinx, might be
>>the ticket. 
>>  Sphinx not only detects voice, but is a full Open Source speech
>>recognition 
>>engine.  In my mind, that opens up worlds of possibilities with *. 
>>Anyone else 
>>up for it?
>>
>>BTW, more on Sphinx can be found at
>>http://fife.speech.cs.cmu.edu/sphinx/.
>>
>>Brad Waite





More information about the asterisk-dev mailing list