LINUX CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>

7-DAY TOP DOWNLOAD

#
Program
Mandriva Linux
2008.1 / 2009 RC1

256,557
Fedora 9 / 10 Alpha
216,729
KNOPPIX Live DVD
5.3.1

210,180
Kororaa AIGLXgl Live
CD 0.3

180,544
Beryl 0.2.1
178,872
BackTrack 3.0
172,153
MPlayer 1.0 RC2
158,367
aircrack 2.41
158,233
VLC 0.9.0
113,406
Cedega 6.0
108,171

WEEK'S BEST

  • Softpedia Linux RS...
  • Ubuntu 8.04.1
  • Pidgin 2.5.1
  • Adobe Flash Player...
  • The Gimp 2.4.7 / 2...
  • openSUSE Linux 11....
  • Linux Kernel 2.6.2...
  • Super Grub Disk 0....
  • Skype 2.0.068
  • OpenOffice.org 2.4...
  • Mozilla Firefox 3....
  • Transmission 1.33
  • DeVeDe 3.11b
  • Wine 1.1.4
  • wine-doors 0.1.2
  • Shoreline Firewall...
  • Linux Mint 5.0
  • Google Gadgets 0.1...
  • Fedora 9 / 10 Alpha
  • Opera 9.52
  • Home / Linux / Text Editing&Processing / Markup

    Jericho HTML Parser 2.6



    No screenshots available
    Downloads: 0  Add to download basket  Tell us about an update
    User Rating:
    Rated by:
    Fair (2.5/5)
    17 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Martin Jericho | More programs
    LGPL / FREE
    June 25th, 2008, 08:40 GMT
    ROOT / Text Editing&Processing / Markup

     Read user reviews (0)  Add a review  Refer to a friend  Subscribe

     

    Jericho HTML Parser description

     

    Jerich HTML Parser is a simple but powerful java library allowing analysis and manipulation of parts of an HTML document.

    Jerich HTML Parser is a simple but powerful java library allowing analysis and manipulation of parts of an HTML document, including some common server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It also provides high-level HTML form manipulation functions.

    Jericho HTML Parser project is an open source library released under the GNU Lesser General Public License (LGPL). You are therefore free to use it in commercial applications subject to the terms detailed in the licence document.

    Here are some key features of "Jericho HTML Parser":

    · No parse tree of the entire document is ever generated. The document source text is searched only for the markup relevant to the current operation. This allows the library to analyse and modify documents containing incorrect or badly formatted HTML or any other server or client side code, script, macro or markup. Most other parsers can't handle content that they are not explicitly programmed to accept.
    · The beginning and end positions in the source text of all parsed segments are accessible, allowing modification of only selected segments of the document without having to reconstruct the entire document from a parse tree. This feature, in combination with the one above, makes the toolkit extremely powerful in its simplicity.
    · Provides a simple but comprehensive interface for the analysis and manipulation of HTML form controls, including the extraction and population of initial values, and conversion to read-only or data display modes. Analysis of the form controls also allows data received from the form to be stored and presented in an appropriate manner.
    · ASP, JSP, PSP, PHP and Mason server tags can be registered for recognition by the parser, and are recognised as accurately as is possible without incorporating actual parsers for these languages into the library. The library then allows any of these segments to be ignored when parsing the rest of the document so that they do not interfere with the HTML syntax. (see Segment.ignoreWhenParsing())
    · Custom tag types can be easily defined and registered for recognition by the parser.

    What's New in This Release:

    · This version includes important bugfixes and the following enhancements.
    · Non-server tags are no longer recognized inside server tags.
    · Microsoft downlevel-revealed conditional comments are recognized.
    · All unnecessary white space may be removed from a source document.
    · Various other enhancements were made to existing features.

      


    TAGS:

    HTML parser | java library | HTML manipulator | Jericho | HTML | parser

    Related downloads IT News Popular downloads New additions   Latest reviews  
    Java Wikipedia API 3.0.8
    Java Wikipedia API is a library that converts Wikipedia syntax to HTML.
    locale4j 1.1.4
    locale4j is a Java library created to work with localization data.
    MicroLog 0.5.0.2
    MicroLog is a scaleable logging library for use with Java ME (aka J2ME), compatible with Log4j.
    HTML::TableParser 0.38
    HTML::TableParser is Perl module to extract data from an HTML table.
    CodeSounding 1.4
    CodeSounding is a Java sonification library: the sound produced running a .class depends on its source code before compilation.


    HTML code for linking to this page:


    Go to top



    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   ENTER NEWS SITE   |   ENGLISH BOARD   |   ROMANIAN FORUM