GaelSpell Unix:
English summary

Kevin P. Scannell


Summary

This page has been provided as an aid to package maintainers who are unable to read the GaelSpell Unix home page which is entirely in Irish. This is in no sense a translation of the Irish page (which contains much more detailed descriptions).

Three packages are available from this site, providing Irish language support for the most widely used spellcheckers in the Open Source community: ispell-gaeilge, for Geoff Kuenning's International Ispell, aspell-gaeilge for Kevin Atkinson's Aspell, and hunspell-gaeilge for the OpenOffice.org spellchecker. The word lists are identical, just packaged differently.

Diarmaid Mac Mathúna has also repackaged the same underlying word list for use on Windows machines; it is available (under the GPL) from the primary GaelSpell site: www.gaelspell.com.


Features


Installation

There is no "configure" script -- after unpacking the tarball, try using "make" to build the hashed word list (gaeilge.hash for ispell and gaeilge for aspell). If you run into trouble you should edit the variables at the top of the Makefile to the appropriate directories and try again.

The ISO-639-1 code for Irish is "ga".


Alternate Models

The default hash table conforms strictly to standardized Irish spelling. You can generate either a "literary" or "dialect" model (ispell only) by changing the variable INSTALLATION at the top of the Makefile to gaeilgelit or gaeilgemor and using "make" as before.

The gaeilgelit model contains many obsolete or obscure (but standardly spelled) words which are probably best left out of any good Irish spellchecker. For instance, brúitíneach (a stumpy or stuffy person in Ó Dónaill) is a likely misspelling of the much more common word bruitíneach (the measles). Other typical "dangerous" word pairs: deirc for déirc. múid for muid, etc.

The gaeilgemor model, on the other hand, contains non-standard or dialect spellings (alongside the standard spellings) and accepts non-standard inflections of verbs. This greatly reduces its effectiveness as a spellchecking tool; indeed, anyone who uses non-standard forms so frequently that he or she finds the standard model inadequate will likely disagree with the very concept of an Irish spellchecker in the first place!

With all this in mind, I strongly urge installers to make the standard model the default on your system.


Contributing

Only a handful of people have corresponded with me about these packages and no one has contributed any words. Perhaps having an appeal in English will inspire some learners to contribute.

As noted above, I have the software infrastructure in place so that if someone sends me a list of "headwords" with grammatical information, all inflected forms of the word will be added automatically. This is a much more efficient (and less error-prone) way of building up a good word list than by adding forms one by one into a personal dictionary.

The word list already contains a large number of Irish personal names and placenames, but these were not added in any systematic way, and there are possibly quality issues. If there is anyone with a particular interest in this area that would like to help, please get in touch with me at kscanne at gmail dot com.


© Copyright 2002-2007 Kevin P. Scannell