Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.3 / 3....
  • LibreOffice 3.6.6 / 4.0.3
  • MPlayer 1.1.1
  • systemd 204
  • Arch Linux 2013.05.01
  • Blender 2.67a
  • KDE Software Compilatio...
  • CrunchBang Linux Stable...
  • Elementary OS 0.1 / 0.2...
  • SystemRescueCd 3.6.0
  • Home > Linux > Programming > Perl Modules

    Lingua::RU::Detect 1.1

    Download button

    No screenshots available
    Downloads: 347  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    Good (3.3/5)
    13 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Andrew Shitov | More programs
    Perl Artistic License / FREE
    June 11th, 2008, 15:15 GMT
    ROOT / Programming / Perl Modules

     Read user reviews (0)  Refer to a friend  Subscribe

    Lingua::RU::Detect description

    Lingua::RU::Detect is a Perl module that offers heuristics for guessing encoding sequence.

    Lingua::RU::Detect is a Perl module that offers heuristics for guessing encoding sequence.

    SYNOPSIS

    use Lingua::RU::Detect "detect_enc";
    say Dumper(detect_enc("бНОПНЯ"));
    say Dumper(detect_enc("бОДТЕК"));

    ABSTRACT

    Lingua::RU::Detect make a guess of how the original text was reconverted with a sequence of different encodings.

    This module is a heart of http://decodr.ru/ website which provides a tool for automatic recovering Russian texts which were damaged by multiple transcodings. Two and three item chains are now available to detect, and the speed is much higher than that of programmes based on a dictionary.

    The result of calling detect_enc subroutine is a list of encoding pairs. To get original UTF-8 string you need to make all these transcodings in the order specified in the array returned. For example:

    $VAR1 = [
    [
    'UTF-8',
    'ISO-8859-5'
    ],
    [
    'KOI8-R',
    'UTF-8'
    ]
    ];

    If no reencoding is needed, result is an empty array.

    Product's homepage

    Requirements:

    · Perl

      


    TAGS:

    encoding sequence | language detector | Perl module | russian | encoding | sequence

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM