diff options
author | rolandsteiner@chromium.org <rolandsteiner@chromium.org@0039d316-1c4b-4281-b951-d872f2087c98> | 2010-10-29 06:05:56 +0000 |
---|---|---|
committer | rolandsteiner@chromium.org <rolandsteiner@chromium.org@0039d316-1c4b-4281-b951-d872f2087c98> | 2010-10-29 06:05:56 +0000 |
commit | dff8c1108d0b414ea73ae0e0d0d904e09856cdd6 (patch) | |
tree | adae2223b4034afb2f017346f2cec11bd32bb4e3 /third_party/hyphen/README.compound | |
parent | 906eed0d5bb752a29adbfa86054df1bebd507a80 (diff) | |
download | chromium_src-dff8c1108d0b414ea73ae0e0d0d904e09856cdd6.zip chromium_src-dff8c1108d0b414ea73ae0e0d0d904e09856cdd6.tar.gz chromium_src-dff8c1108d0b414ea73ae0e0d0d904e09856cdd6.tar.bz2 |
Add Hunspell 'Hyphen' hyphenation library.
BUG=60895
TEST=none
Review URL: http://codereview.chromium.org/4143003
git-svn-id: svn://svn.chromium.org/chrome/trunk/src@64368 0039d316-1c4b-4281-b951-d872f2087c98
Diffstat (limited to 'third_party/hyphen/README.compound')
-rw-r--r-- | third_party/hyphen/README.compound | 57 |
1 files changed, 57 insertions, 0 deletions
diff --git a/third_party/hyphen/README.compound b/third_party/hyphen/README.compound new file mode 100644 index 0000000..9f43f13 --- /dev/null +++ b/third_party/hyphen/README.compound @@ -0,0 +1,57 @@ +Compound word hyphenation + +Hyphen library supports better compound word hyphenation and special +rules of compound word hyphenation of German languages and other +languages with arbitrary number of compound words. The new options, +COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN help to set the right +style for the hyphenation of compound words. + +Algorithm + +The algorithm is an extension of the original pattern based hyphenation +algorithm. It uses two hyphenation pattern sets, defined in the same +pattern file and separated by the NEXTLEVEL keyword. First pattern +set is for hyphenation only at compound word boundaries, the second one +is for hyphenation within words or word parts. + +Recursive compound level hyphenation + +The algorithm is recursive: every word parts of a successful +first (compound) level hyphenation will be rehyphenated +by the same (first) pattern set. + +Finally, when first level hyphenation is not possible, Hyphen uses +the second level hyphenation for the word or the word parts. + +Word endings and word parts + +Patterns for word endings (patterns with ellipses) match the +word parts, too. + +Options + +COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word boundary +COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word boundary +NEXTLEVEL: sign second level hyphenation patterns + +Default hyphenmin values + +Default values of COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN are 0, +and 0 under the hyphenation, too. ("0" values of +LEFTHYPHENMIN and RIGHTHYPHENMIN mean the default "2" under the hyphenation.) + +Examples + +See tests/compound* test files. + +Preparation of hyphenation patterns + +It hasn't been special pattern generator tool for compound hyphenation +patterns, yet. It is possible to use PATGEN to generate both of +pattern sets, concatenate it manually and set the requested HYPHENMIN values. +(But don't forget the preprocessing steps by substrings.pl before +concatenation.) One of the disadvantage of this method, that PATGEN +doesn't know recursive compound hyphenation of Hyphen. + +László Németh +<nemeth (at) openoffice.org> |