# Copyright 2002 by Eric House (fixin@peak.org). All rights reserved. # # This program is free software; you can redistribute it and/or # modify it under the terms of the GNU General Public License # as published by the Free Software Foundation; either version 2 # of the License, or (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program; if not, write to the Free Software # Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA. LANGCODE=nl_NL # Need to do the IJ->Y transformation first because it changes # (reduces) the lengths. We need to respect lower bounds as well as # upper. # LANGFILTER_PRECLIP: tr [a-z] [A-Z] # LANGFILTER_PRECLIP: | sed LANGFILTER_POSTCLIP: | tr [a-z] [A-Z] LANGFILTER_POSTCLIP: | grep '^[A-Z]*$' LANGINFO:
Dutch uses the same letters as English, but there's a LANGINFO: twist: Anyplace that "IJ" appears in a Dutch word in a LANGINFO: single sylable you can use a "Y" (just one) instead. The LANGINFO: way to handle this for a Crosswords dictionary is to have LANGINFO: two versions of the word in the list, one with "IJ" and the LANGINFO: other with "Y". (Note: if "IJ" appeared in two places in a LANGINFO: word you'd need a total of four to represent all the LANGINFO: possiblities.
LANGINFO:That said, my 'sed' skills aren't up to the task of LANGINFO: expressing the above in a rule, so I'm going to leave it up LANGINFO: to my users -- you folks. If you're building a Dutch LANGINFO: dictionary you'll need to do the IJ->Y duplication yourself LANGINFO: in the wordlist file before sending it to me.
# High bit means "official". Next 7 bits are an enum where # Dutch==4. Low byte is padding XLOC_HEADER:0x8400