speechcontrol-devel team mailing list archive

Thread
Date

Re: Calling Developers to Their Stations

To: Bill Cox <waywardgeek@xxxxxxxxx>
From: Jacky Alcine <jackyalcine@xxxxxxxxx>
Date: Thu, 13 Jan 2011 01:17:29 -0500
Cc: speechcontrol-devel@xxxxxxxxxxxxxxxxxxx
In-reply-to: <AANLkTikKKBP_Jb111AP85kt40bOVjCXiYdFJWHCfx6mN@mail.gmail.com>

"Sonic is still a new library, and has not yet been incorporated into Debian
or other major distros. For now, feel free to simply add sonic.c and sonic.h
to your application, but consider switching to -lsonic once the library is
available on your distro."

SpeechControl can create a DEB for packaging and whatnot for Sonic, Bill if
need be, for redistro. purposes.

On Wed, Jan 12, 2011 at 7:58 PM, Jacky Alcine <jackyalcine@xxxxxxxxx> wrote:

> I support that, Bill, if anything, keep us posted with your work. I'll look
> at your work now.
>
> On Wed, Jan 12, 2011 at 5:35 PM, Bill Cox <waywardgeek@xxxxxxxxx> wrote:
>
>> I forgot to mention the new sonic home page:
>>
>> http://vinux-project.org/sonic
>>
>> The adoption rate seems quite strong.  It's going into TTS engines and
>> Audio book applications, and devices for the blind.
>>
>> Bill
>>
>> On Wed, Jan 12, 2011 at 5:30 PM, Bill Cox <waywardgeek@xxxxxxxxx> wrote:
>> > On Wed, Jan 12, 2011 at 3:37 PM, Jacky Alcine <jackyalcine@xxxxxxxxx>
>> wrote:
>> >> I need for pedro3005, webrsk, waywardgeek, m0hi, and bedahr to either
>> take
>> >> participation in the python-openmary project or the
>> speechcontrol-daemon
>> >> project.
>> >
>> > Hi, Jacky.  I personally feel the weak link in speech control is the
>> > non-distributable nature of some speech recognition code, and the lack
>> > of productization in Sphinx.  I may be wrong, but I believe I can
>> > write a very good quality speech recognition engine that could make a
>> > huge difference to open-source speech control.  If you don't mind, I'd
>> > like to continue with this work.  To date, it's resulted in libsonic
>> > for speeding up speech with low distortion.  The next big step will be
>> > isolated word recognition.  I've done a ton of work on cleaning up
>> > spectrograms, and I believe I have the best algorithms anywhere, other
>> > than potential trade-secret algorithms.  Check out my web page on
>> > generating spectrograms:
>> >
>> > http://vinux-project.org/time-aliased-hann/
>> >
>> > In addition to improved spectrograms, I believe I can write code to
>> > fairly accurately annotate the speech stream with voice events:
>> > glottal open, plosive open, stops, fricative begin/end, etc.  I think
>> > I can combine evidence from both the time domain and frequency domain
>> > to determine what kind of fricatives and plosives are present in the
>> > sound stream.  I'm hopeful that the combination of improved spectral
>> > analysis and time domain analysis will yield better results than we've
>> > seen in any system to date.
>> >
>> > In short, I'd like to keep working full steam ahead on this.  I can do
>> > debian packaging and such, but I'd like my big project to be the
>> > speech recognition engine.
>> >
>> > Bill
>> >
>>
>> _______________________________________________
>> Mailing list: https://launchpad.net/~speechcontrol-devel
>> Post to     : speechcontrol-devel@xxxxxxxxxxxxxxxxxxx
>> Unsubscribe : https://launchpad.net/~speechcontrol-devel
>> More help   : https://help.launchpad.net/ListHelp
>>
>
>

References

Calling Developers to Their Stations
From: Jacky Alcine, 2011-01-12
Re: Calling Developers to Their Stations
From: Bill Cox, 2011-01-12
Re: Calling Developers to Their Stations
From: Bill Cox, 2011-01-12
Re: Calling Developers to Their Stations
From: Jacky Alcine, 2011-01-13