summaryrefslogtreecommitdiffstats
path: root/third_party/hyphen/README.compound
diff options
context:
space:
mode:
authorrolandsteiner@chromium.org <rolandsteiner@chromium.org@0039d316-1c4b-4281-b951-d872f2087c98>2010-10-29 06:05:56 +0000
committerrolandsteiner@chromium.org <rolandsteiner@chromium.org@0039d316-1c4b-4281-b951-d872f2087c98>2010-10-29 06:05:56 +0000
commitdff8c1108d0b414ea73ae0e0d0d904e09856cdd6 (patch)
treeadae2223b4034afb2f017346f2cec11bd32bb4e3 /third_party/hyphen/README.compound
parent906eed0d5bb752a29adbfa86054df1bebd507a80 (diff)
downloadchromium_src-dff8c1108d0b414ea73ae0e0d0d904e09856cdd6.zip
chromium_src-dff8c1108d0b414ea73ae0e0d0d904e09856cdd6.tar.gz
chromium_src-dff8c1108d0b414ea73ae0e0d0d904e09856cdd6.tar.bz2
Add Hunspell 'Hyphen' hyphenation library.
BUG=60895 TEST=none Review URL: http://codereview.chromium.org/4143003 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@64368 0039d316-1c4b-4281-b951-d872f2087c98
Diffstat (limited to 'third_party/hyphen/README.compound')
-rw-r--r--third_party/hyphen/README.compound57
1 files changed, 57 insertions, 0 deletions
diff --git a/third_party/hyphen/README.compound b/third_party/hyphen/README.compound
new file mode 100644
index 0000000..9f43f13
--- /dev/null
+++ b/third_party/hyphen/README.compound
@@ -0,0 +1,57 @@
+Compound word hyphenation
+
+Hyphen library supports better compound word hyphenation and special
+rules of compound word hyphenation of German languages and other
+languages with arbitrary number of compound words. The new options,
+COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN help to set the right
+style for the hyphenation of compound words.
+
+Algorithm
+
+The algorithm is an extension of the original pattern based hyphenation
+algorithm. It uses two hyphenation pattern sets, defined in the same
+pattern file and separated by the NEXTLEVEL keyword. First pattern
+set is for hyphenation only at compound word boundaries, the second one
+is for hyphenation within words or word parts.
+
+Recursive compound level hyphenation
+
+The algorithm is recursive: every word parts of a successful
+first (compound) level hyphenation will be rehyphenated
+by the same (first) pattern set.
+
+Finally, when first level hyphenation is not possible, Hyphen uses
+the second level hyphenation for the word or the word parts.
+
+Word endings and word parts
+
+Patterns for word endings (patterns with ellipses) match the
+word parts, too.
+
+Options
+
+COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word boundary
+COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word boundary
+NEXTLEVEL: sign second level hyphenation patterns
+
+Default hyphenmin values
+
+Default values of COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN are 0,
+and 0 under the hyphenation, too. ("0" values of
+LEFTHYPHENMIN and RIGHTHYPHENMIN mean the default "2" under the hyphenation.)
+
+Examples
+
+See tests/compound* test files.
+
+Preparation of hyphenation patterns
+
+It hasn't been special pattern generator tool for compound hyphenation
+patterns, yet. It is possible to use PATGEN to generate both of
+pattern sets, concatenate it manually and set the requested HYPHENMIN values.
+(But don't forget the preprocessing steps by substrings.pl before
+concatenation.) One of the disadvantage of this method, that PATGEN
+doesn't know recursive compound hyphenation of Hyphen.
+
+László Németh
+<nemeth (at) openoffice.org>