Valid XHTML 1.0 Transitional

Normation: Available classes

General

Normation classes are already implemented for several languages. You can use these existing Normation classes for your dictionary via the property languageXNormationClassName in the file DictionaryForMIDs.properties. Read here about how to define Normation classes for your dictionary in the file DictionaryForMIDs.properties.

If you intend to implement an additional Normation class, please read here.

List of existing Normation classes

Normation classes are implemented for the following languages:

Name of Normation class

Supported Language(s)

Description

NormationLat

Latin-based languages

This class provides a first level of normation for Latin-based languages, i.e. many European languages.

NormationLat 'removes extra punctuation' from characters, so that for example "señora" is also found when searching for "senora".

NormationBul

Bulgarian

Allows you to search words both in Cyrillic and Latin transcription.  The transliteration is based on the official Bulgarian transliteration system (2006).

NormationCyr1

Russian, Ukrainian, Macedonian

Allows you to search words both in Cyrillic and Latin transcription.  The Russian and Ukrainian transliteration is based on GOST 16876 (1971). The Macedonian transliteration is based on the system used in passports of the Republic of Macedonia since 2008.

NormationCyr2

Russian, Ukrainian, Macedonian

Allows you to search words both in Cyrillic and Latin transcription.  The Russian and Ukrainian transliteration is based on GOST 7.79 System B (2002). The Macedonian transliteration is based on the system used in passports of the Republic of Macedonia since 2008.

NormationEng

English

Currently NormationEng only provides basic general Normation such as ignoring upper and lower case and ignoring punctuation characters. A real good Normation class for English still needs to be implemented.

NormationEpo

Esperanto

NormationEpo allows to the user to ignore the special diacritics on top of ĉ, ĥ, ĵ, ĝ, ŝ and ŭ and just type c, h, j, g, s and u. Also x is ignored altogether in searches, as users often write 'cxapelo' for 'ĉapelo'.

NormationGer

German

Allows to search for 'Umlaute' with ae, ue, oe. And for ß to search with ss.

NormationFil

Tagalog, Cebuano and other Philippine languages

Treats the following character pairs as equal: v and b, f and p, i and e, o and u.

NormationJpn

Japanese

Handling of Romaji/Hiragana/Katakana.

NormationNor

Norwegian

Handling of special Norwegian characters.

NormationVie

Vietnamese

(description not yet available)

NormationTam

Tamil

Handling transliteration process for Tamil. The input characters in english will automatically be translitered into equivelant tamil characters.

A general note: usually all Normation classes provide basic general normation such as ignoring upper/lower case and ignoring punctuation characters.