Search Perform an advanced search query SOFTPEDIA
 
SOFTPEDIA
Updated one minute ago
HomeSubmit a program for being reviewedAdvertise on our websiteGet help on surfing our websitesSend us your feedbackGet information about our XML/RSS backend and how to use itBrowse the news archiveVisit our discussion forumVizitati forumul in limba romana



Program Finder

uni2ascii 4.9

Download uni2ascii 4.9
Add to download basket
Send update information
Rating: Good
Rated by: 17 user(s)

Recent news
- Softpedia Recommended Mac A...
- iPhone 3G Gets Even Babies ...
- LinkedIn Sells Private Cust...
- Internet Explorer 8 - The G...
- BlackBerry Bold out on July...
- Google's StreetView Pr...
- Sony Ericsson W595 to Repla...
- E-fencing on the Rise in th...
- 50,000:1 Contrast Ratio LCD...
- A Nokia Aeon Wannabe

Downloads: 2,447
Developer: Bill Poser | More programs by this producer
License: GPL v3
Price: FREE 
Last Updated: May 7th, 2008 22:51
Category: MAIN :: Text Editing&Processing :: Markup
 Read user reviews (0) Add a review Refer to friend Subscribe

uni2ascii description Download
uni2ascii and ascii2uni convert between UTF-8 Unicode and any of a variety of 7-bit ASCII equivalents including: hexadecimal and decimal HTML numeric character references, u-escapes, standard hexadecimal, and raw hexadecimal.

Such ASCII equivalents are useful when including Unicode text in program source, when entering text into Web programs that can handle the Unicode character set but are not 8-bit safe, and when debugging.

The Unicode escapes available are:

· HTML hexadecimal numeric character references (e.g. é)
· HTML decimal numeric character references (e.g. ȳ)
· u-escapes, as used in Python (e.g. u00E9)
· u-escapes within the BMP and U-escapes beyond the BMP, e.g. u00E9 but U00010024.
· U+-escapes (e.g. U+00E9)
· U-escapes (e.g. U00E9)
· u-escapes (e.g. u00E9)
· U-escapes within angle brackets (e.g. )
· x-escapes (e.g. x00E9)
· x-escapes with braces (e.g. x{00E9})
· Standard hexadecimal (e.g. 0x00E9)
· Raw hexadecimal (e.g. 00E9)

uni2ascii accepts a command line flag determining whether to generate upper-case A-F or lower-case a-f as hexadecimal digits since some some programs accept only one or the other. ascii2uni accepts either.

In the case of uni2ascii by default, only characters outside the ASCII range are converted. Even if ASCII characters are also converted, newlines are preserved unless their conversion is explicitly requested. Space characters are also preserved unless conversion is explicitly requested. In the case of the three non-ASCII space characters (Ethiopic word space, Ogham space, and ideographic space), if space characters are not converted, these are replaced with ASCII space (0x20) so as to keep the output within the 7-bit ASCII range.

This package contains four programs. The main program is uni2ascii. It is written in C and must be compiled. uni2html.py is the predecessor to uni2ascii. As it is written in Python, it does not need to be compiled and should run on just about any current computer. uni2ascii is otherwise superior in that:

· It generates a wider range of output formats.
· It is approximately 20 times faster.
· It handles input in the full 32 bit Unicode range. In contrast, uni2html handles only the

Basic Multilingual Plane (Plane 0) because at present Python represents Unicode encoded text internally using 16-bit integers. If you've got text in, say, Linear B or Ugaritic, you need uni2ascii.

It does a better job of reporting errors. If it encounters an error in its input, such as mal-formed UTF-8, it reports the location of the error both in terms of the character count from the beginning of the file (starting at 0) and in terms of the byte count from the beginning of the file (also starting at 0). (Character counts and byte counts are generally not the same since a UTF-8 encoded character occupies from one to four bytes.) The Python version reports only the character count. uni2ascii also provides information about the nature of the error.

The third program, ascii2uni, is the inverse of uni2ascii. It accepts text containing a variety of ASCII representations of Unicode characters and generates UTF-8 Unicode.

The fourth program, ascii2uni.py, reads 7-bit ASCII containing u-escaped Unicode, as used in Python and Tcl, and converts it to UTF-8 Unicode. It is the original program of which ascii2uni is a generalization.

What's New in This Release:

· This release fixes a bug that produced bad output or a segmentation fault if a line ended in the prefix to an escape.
· In quoted-printable format, if a line ends in an equal-sign, both the equal sign and the immediately following newline are now skipped by ascii2uni, in accordance with RFC 2045.


uni2ascii Download
uni2ascii Screenshots
Find Related Downloads



MOST POPULAR PROGRAMS IN THIS CATEGORY ( Markup )  

XMLmind XML Editor 3.7.1 (Personal Edition)
XMLmind XML Editor is a validating XML editor featuring a word processor-like view.
screenshot
update
Bluefish 1.1.5
Bluefish is a GTK-based Web development editor.
screenshot
pick | update
pdfTeX 1.40.7
pdfTeX is an extended version of TeX that can create PDF directly from TeX source files.
screenshot
update
JabRef 2.2
JabRef is a graphical Java application for managing bibtex (. bib) databases.
screenshot
update
uni2ascii 4.9
uni2ascii and ascii2uni convert between UTF-8 Unicode and any of a variety of 7-bit ASCII.
screenshot
update
Parsifal XML Parser 1.0.0
Parsifal XML Parser is a validating XML 1.0 parser written in ANSI C.
screenshot
update
Xerces2 2.9.1
Xerces2 is the next generation Apache Xerces-J XML parser.
screenshot
update

go to top  



Welcome!
Hello, Guest

Login if you have a Softpedia.com account.

Otherwise, register for one.

Main categories:
  • Adaptive Technologies
  • Adobe AIR Apps
  • Artistic Software
  • Communications
  • Database
  • Desktop Environment
  • Documentation
  • Education
  • Games
  • Home Automation
  • Information Management
  • Internet
  • Multimedia
  • Office
  • Printing
  • Programming
  • Religion
  • Science
  • Science and Engineering
  • Security
  • System
  • Text Editing&Processing
  • Utilities

  • Week's best:
  • Softpedia Linux RSS ...
  • Ubuntu 8.04.1 (Hardy...
  • Pidgin 2.4.3
  • Adobe Flash Player f...
  • The Gimp 2.4.6 / 2.5...
  • openSUSE Linux 11.0
  • Linux Kernel 2.6.25....
  • Super Grub Disk 0.97...
  • Skype 2.0.0.68
  • OpenOffice.org 2.4.1...
  • Mozilla Firefox 3.0 ...
  • Transmission 1.22
  • DeVeDe 3.9
  • Wine 1.1.0
  • wine-doors 0.1.2
  • Shoreline Firewall 4...
  • Linux Mint 5.0 (Elys...
  • Google Gadgets 0.9.3
  • Fedora 9
  • Opera 9.51
  • Latest Distributions
  • Caixa Mágica 12 Live...
  • Greenie Linux 3.0.4H
  • SYS 0.23 RC4
  • GoblinX Micro Editio...
  • Clonezilla-SysRescCD...
  • SchilliX 0.6.7
  • PING 2.01.13
  • Edubuntu 8.04.1 (Har...
  • Gobuntu 8.04.1 (Hard...
  • Ubuntu JeOS 8.04.1 (...
  • Xubuntu 8.04.1 (Hard...
  • Ubuntu Studio 8.04.1...
  • MythBuntu 8.04.1
  • Kubuntu 8.04.1 (Hard...
  • Kubuntu 8.04.1 (Hard...
  • Ubuntu 8.04.1 (Hardy...
  • Pardus Linux 2008
  • Poseidon Linux 3.0
  • GParted LiveCD 0.3.7...
  • Elive 1.7.8 Developm...
  • Latest Ubuntu Packages
  • KubXtra 8.04-7
  • GScrot 0.38
  • Rapache 0.4
  • Furius ISO Mount 0.9...
  • GNU Interactive Tool...
  • Advene 0.32
  • TrueCrypt 6.0
  • Listaller 0.1.16 Pre...
  • Newspost 2.1.1
  • Ortro 1.3.2a
  • ManDVD 2.5-3
  • QLandkarte 0.7.3
  • LOVE 0.3.2
  • DjVuSmooth 0.1.2
  • Amaya 10.0 / 10.1 Pr...
  • TriX 0.94
  • Alien GUI 0.99.1
  • XWii 2.5
  • Domain Technologie C...
  • XML Copy Editor 1.1....
  • © 2001 - 2008 Softpedia. All rights reserved.
    Softpedia™ and Softpedia™ logo are registered trademarks of SoftNews NET SRL.
    Copyright Information | Privacy Policy | Terms of Use | Contact Softpedia | Update your software | Archive