tlgu

  504 downloads
1.6 GPL (GNU General Public License)    
  not rated
An utility for converting an input file in TLG or PHI representation into Unicode (UTF-8)

description

download

specifications

changelog

tlgu is a utility for converting an input file in Thesaurus Linguae Graeca (TLG) or Packard Humanities Institute (PHI) representation (beta code text and citation information) into Unicode (UTF-8).

A companion GNU/Linux Hellenic Polytonic HOWTO may also be found in the tlgu site.

Options:

-b
inserts a form feed and citation information (levels a, b, c, d) on every "book" citation change. By default the program will output line feeds only (see also -p).
-p
observes paging instructions. By default the program will output line feeds only.
-r
primarily Roman text. Some TLG texts, notably doccan1.txt and doccan2.txt are mainly roman texts lacking explicit language change codes. Setting this option will force a change to roman text after each citation block is encountered.
-v
highest-level reference citation is included before each text line (v-level)
-w
reference citation is included before each text line (w-level)
-x
reference citation is included before each text line (x-level)
-y
reference citation is included before each text line (y-level)
-z
lowest-level reference citation is included before each text line (z-level).
-Z < custom_citation_format_string >
an arbitrary combination of citation information is included before each text line; see also -e option e.g. "%A/%B/%x/%y/%zt" will output the contents of the A, B citation description levels, followed by x, y, z citation reference levels, followed by a TAB character.
-e < custom_blank_citation_string >
if there is no citation information for a citation level defined with the -Z option above, a single right-hand slash is substituted by default; you may define any string with this option e.g. "-" or "[NONE]" are valid inputs
-B
inserts blank space (a tab) before each and every line.
-C
citation debug information is output.
-S
special code debug information is output.
-V
block processing information is output (verbose).
-W
each work (book) is output as a separate file in the form output_file-xxx.txt
READ MORE   
Last updated on December 7th, 2011
2  
tlgutlgu

0 User reviews so far.

SUBMIT