tlgu icontlgu 1.6

An utility for converting an input file in TLG or PHI representation into Unicode (UTF-8)
tlgu is a utility for converting an input file in Thesaurus Linguae Graeca (TLG) or Packard Humanities Institute (PHI) representation (beta code text and citation information) into Unicode (UTF-8).

A companion GNU/Linux Hellenic Polytonic HOWTO may also be found in the tlgu site.

Options:

-b
inserts a form feed and citation information (levels a, b, c, d) on every "book" citation change. By default the program will output line feeds only (see also -p).
-p
observes paging instructions. By default the program will output line feeds only.
-r
primarily Roman text. Some TLG texts, notably doccan1.txt and doccan2.txt are mainly roman texts lacking explicit language change codes. Setting this option will force a change to roman text after each citation block is encountered.
-v
highest-level reference citation is included before each text line (v-level)
-w
reference citation is included before each text line (w-level)
-x
reference citation is included before each text line (x-level)
-y
reference citation is included before each text line (y-level)
-z
lowest-level reference citation is included before each text line (z-level).
-Z < custom_citation_format_string >
an arbitrary combination of citation information is included before each text line; see also -e option e.g. "%A/%B/%x/%y/%zt" will output the contents of the A, B citation description levels, followed by x, y, z citation reference levels, followed by a TAB character.
-e < custom_blank_citation_string >
if there is no citation information for a citation level defined with the -Z option above, a single right-hand slash is substituted by default; you may define any string with this option e.g. "-" or "[NONE]" are valid inputs
-B
inserts blank space (a tab) before each and every line.
-C
citation debug information is output.
-S
special code debug information is output.
-V
block processing information is output (verbose).
-W
each work (book) is output as a separate file in the form output_file-xxx.txt

last updated on:
December 7th, 2011, 9:56 GMT
price:
FREE!
developed by:
Dimitri Marinakis
license type:
GPL (GNU General Public License) 
category:
ROOT \ Text Editing&Processing \ Others

FREE!

In a hurry? Add it to your Download Basket!

user rating

UNRATED
0.0/5
 

0/5

2 Screenshots
tlgutlgu
What's New in This Release:
  • Compact citation format (x or y citations are inserted as they change with -X and -Y options).
  • Hyphen, space, and line end removal with the -N option.
read full changelog

Add your review!

SUBMIT