tamilspellchecker team mailing list archive
-
tamilspellchecker team
-
Mailing list archive
-
Message #00003
Re: code updated to launchpad and mit demo
Hello Elanjelian,
On Sat, Jan 31, 2009 at 9:32 PM, Elanjelian Venugopal <tamiliam@xxxxxxxxx>wrote:
> Dear Selvam,
>
> Just wondering ... there was already an attempt to develop a spellchecker
> using hunspell that sort of works in OOo 2.0. This appears like a new
> attempt. Any particular reason why?
Thank for your response.I am basically an engineering final year
student(CSE).
I initially wanted to improve aspell tamil spell checking,But i did not now
how to implement tamil grammar with aspell and include GUI suppport.
At that time i came to know about malayalam spell checker as a gedit
plugin developed by thottilingam with python.So i decided to develop the
spell checking engine(tamilspell.py) which can be reused for any editors.
As a first step i integrated with gedit.Next will be open office.
>
> Anyway, about four years ago, I was involved in developing a Tamil
> spellchecker using aspel. We put together a list of almost 14,000 unique
> Tamil words, which is attached. The document could also be obtained here:
> http://wiki.services.openoffice.org/wiki/Dictionaries
Thank you,
But this list seems to be taken form aspell,which i also extracted from
aspell.
I am eager to know ,whether you have added any grammatical rules to aspell
and any hint on developing aspell for improved tamil spell check.
Note:We have collected nearly 30,000 tamil words(includiing aspell's
14,000),extracted from tamil web page.
>
>
> Cheers,
> Ve. Elanjelian
>
> 2009/1/30 S.Selvam Siva <s.selvamsiva@xxxxxxxxx>
>
>> Hi all,
>>
>> I have explained the current status of our project, as follows:
>>
>> a)I have updated the code(revision 2) with the following modifications.
>>
>> 1)Added Levenshtein python module to find the list of suggestion
>> words.
>>
>> Levenshtein module takes two words and returns the character
>> difference by position and i used
>> Levenshtein.ratio(mispelled-word,dic-word).So use "sudo apt-get install
>> python-levenshtein " to install it.
>>
>>
>> 2)Modified geditinteeface.py to process all the tamil words entered in
>> the current document.(previously it was 3 lines)
>>
>> Suitably modifeid geditinterface so that it will send whole
>> text to tamilspell.py
>>
>> 3)Modified guicreator.py to mark the word in attention.
>>
>> We need to mark the word which is miss-spelled ,so i added
>> that feature to guicreator.py
>>
>>
>> b)
>> http://bazaar.launchpad.net/~tamilspellchecker/tamilspellchecker/dev-tamilspellchecker/files<http://bazaar.launchpad.net/%7Etamilspellchecker/tamilspellchecker/dev-tamilspellchecker/files>
>>
>> The above is the link to see our all source code files.If u want to
>> update or download the code to/from Launchpad with bazaar and dont know how
>> to do it,i will explain the process in next mail.
>>
>> c)Tomorrow(31-02-2009) ,we are presenting our project at *mit* for the
>> event * Carte Blanche project demo*.I welcome you all to mit ,we can have
>> discussion there.
>>
>> d)sathya balan has collected 4000 unique tamil words from websites.You can
>> also contribute by extracting tamil words from net and filtering correct
>> words.
>> --
>> Yours,
>> S.Selvam
>>
>> _______________________________________________
>> Mailing list: https://launchpad.net/~tamilspellchecker<https://launchpad.net/%7Etamilspellchecker>
>> Post to : tamilspellchecker@xxxxxxxxxxxxxxxxxxx
>> Unsubscribe : https://launchpad.net/~tamilspellchecker<https://launchpad.net/%7Etamilspellchecker>
>> More help : https://help.launchpad.net/ListHelp
>>
>>
>
--
Yours,
S.Selvam
Follow ups
References