tlgu is a utility for converting an input file in Thesaurus Linguae Graeca (TLG) or Packard Humanities Institute (PHI) representation (beta code text and citation information) into Unicode (UTF-8).
A companion GNU/Linux Hellenic Polytonic HOWTO may also be found in the tlgu site.
inserts a form feed and citation information (levels a, b, c, d) on every "book" citation change. By default the program will output line feeds only (see also -p).
observes paging instructions. By default the program will output line feeds only.
primarily Roman text. Some TLG texts, notably doccan1.txt and doccan2.txt are mainly roman texts lacking explicit language change codes. Setting this option will force a change to roman text after each citation block is encountered.
highest-level reference citation is included before each text line (v-level)
reference citation is included before each text line (w-level)
reference citation is included before each text line (x-level)
reference citation is included before each text line (y-level)
lowest-level reference citation is included before each text line (z-level).
-Z < custom_citation_format_string >
an arbitrary combination of citation information is included before each text line; see also -e option e.g. "%A/%B/%x/%y/%zt" will output the contents of the A, B citation description levels, followed by x, y, z citation reference levels, followed by a TAB character.
-e < custom_blank_citation_string >
if there is no citation information for a citation level defined with the -Z option above, a single right-hand slash is substituted by default; you may define any string with this option e.g. "-" or "[NONE]" are valid inputs
inserts blank space (a tab) before each and every line.
citation debug information is output.
special code debug information is output.
block processing information is output (verbose).
each work (book) is output as a separate file in the form output_file-xxx.txt
What's New in This Release: [ read full changelog ]
· Compact citation format (x or y citations are inserted as they change with -X and -Y options).
· Hyphen, space, and line end removal with the -N option.