speechcontrol-devel team mailing list archive

Thread
Date

Re: Calling Developers to Their Stations

To: Bill Cox <waywardgeek@xxxxxxxxx>
From: Jacky Alcine <jackyalcine@xxxxxxxxx>
Date: Wed, 12 Jan 2011 19:58:09 -0500
Cc: speechcontrol-devel@xxxxxxxxxxxxxxxxxxx
In-reply-to: <AANLkTi==x+8F03CxJp7_SRkijjNQnC9w7u4PGX3DCN+f@mail.gmail.com>

I support that, Bill, if anything, keep us posted with your work. I'll look
at your work now.

On Wed, Jan 12, 2011 at 5:35 PM, Bill Cox <waywardgeek@xxxxxxxxx> wrote:

> I forgot to mention the new sonic home page:
>
> http://vinux-project.org/sonic
>
> The adoption rate seems quite strong.  It's going into TTS engines and
> Audio book applications, and devices for the blind.
>
> Bill
>
> On Wed, Jan 12, 2011 at 5:30 PM, Bill Cox <waywardgeek@xxxxxxxxx> wrote:
> > On Wed, Jan 12, 2011 at 3:37 PM, Jacky Alcine <jackyalcine@xxxxxxxxx>
> wrote:
> >> I need for pedro3005, webrsk, waywardgeek, m0hi, and bedahr to either
> take
> >> participation in the python-openmary project or the speechcontrol-daemon
> >> project.
> >
> > Hi, Jacky.  I personally feel the weak link in speech control is the
> > non-distributable nature of some speech recognition code, and the lack
> > of productization in Sphinx.  I may be wrong, but I believe I can
> > write a very good quality speech recognition engine that could make a
> > huge difference to open-source speech control.  If you don't mind, I'd
> > like to continue with this work.  To date, it's resulted in libsonic
> > for speeding up speech with low distortion.  The next big step will be
> > isolated word recognition.  I've done a ton of work on cleaning up
> > spectrograms, and I believe I have the best algorithms anywhere, other
> > than potential trade-secret algorithms.  Check out my web page on
> > generating spectrograms:
> >
> > http://vinux-project.org/time-aliased-hann/
> >
> > In addition to improved spectrograms, I believe I can write code to
> > fairly accurately annotate the speech stream with voice events:
> > glottal open, plosive open, stops, fricative begin/end, etc.  I think
> > I can combine evidence from both the time domain and frequency domain
> > to determine what kind of fricatives and plosives are present in the
> > sound stream.  I'm hopeful that the combination of improved spectral
> > analysis and time domain analysis will yield better results than we've
> > seen in any system to date.
> >
> > In short, I'd like to keep working full steam ahead on this.  I can do
> > debian packaging and such, but I'd like my big project to be the
> > speech recognition engine.
> >
> > Bill
> >
>
> _______________________________________________
> Mailing list: https://launchpad.net/~speechcontrol-devel
> Post to     : speechcontrol-devel@xxxxxxxxxxxxxxxxxxx
> Unsubscribe : https://launchpad.net/~speechcontrol-devel
> More help   : https://help.launchpad.net/ListHelp
>

Follow ups

Re: Calling Developers to Their Stations
From: Jacky Alcine, 2011-01-13

References

Calling Developers to Their Stations
From: Jacky Alcine, 2011-01-12
Re: Calling Developers to Their Stations
From: Bill Cox, 2011-01-12
Re: Calling Developers to Their Stations
From: Bill Cox, 2011-01-12