Conversion options
These flags control the linguistic rules applied during conversion.
Preset
--preset selects a preconfigured combination of defaults:
Individual flags below override the preset's defaults.
Segmentation strategy
--segmentation controls how word boundaries are found:
lattice(default): finds the globally optimal segmentation by evaluating all dictionary matches at every position with dynamic programming. Best for accuracy.eager: greedy left-to-right longest-match. Faster but may mis-segment compound words.
Numeral handling
--numerals controls how hanja numerals are rendered:
Initial sound law
The initial sound law (頭音法則) is enabled by default for ko-kr and
disabled for ko-kp. It affects character-by-character fallback readings for
characters not found in any dictionary; dictionary entries already encode their
correct readings.
Override with explicit flags:
Homophone disambiguation
When the same hanja appears multiple times in a window, Gukhanmun can mark
repeated occurrences so readers can tell them apart. --disambiguation
sets the scope of that window:
First-occurrence clearing
--first-occurrence removes annotations from characters whose presentation
was already forced earlier in the window:
Error recovery
--recovery controls behaviour when an unrecoverable parse error occurs
(currently relevant for HTML input only):
strict(default) — abort with an errorlenient— skip the problematic fragment and continue