Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.6 / 3....
  • Linux Kernel 3.0.82 LTS...
  • KDE Software Compilatio...
  • PulseAudio 4.0
  • Wireshark 1.10.0
  • NetworkManager 0.9.8.2
  • LibreOffice 3.6.6 / 4.0...
  • SystemRescueCd 3.7.0
  • Linux Kernel 3.10 RC6
  • Ubuntu Tweak 0.8.5
  • Home > Linux > Science

    CorpusSearch 2.002.71

    Download button

    No screenshots available
    Downloads: 1,046  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    Fair (2.1/5)
    12 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Beth Randall | More programs
    GPL / FREE
    February 18th, 2010, 18:50 GMT [view history]
    ROOT / Science

     Read user reviews (0)  Refer to a friend  Subscribe

    CorpusSearch description

    A tool that finds syntactic structures in a corpus

    CorpusSearch is a tool that finds syntactic structures in a corpus of annotated sentence trees. It can be used as a research tool on a corpus, or as a development tool for building the corpus.

    CorpusSearch 2 is a Java program that supports research in corpus linguistics. It is useful both for the construction of syntactically annotated (parsed) corpora and for searching them.

    Both the input and output files of CorpusSearch are ordinary text files, with syntactic annotations in the Penn-Treebank format.

    INSTALLATION:

    1. Download CS.jar
    2. Put the file in a convenient place.
    3. Open a terminal
    4. Assuming that you have put CS.jar into the folder FOO, the following line will start CorpusSearch in any flavor of Unix that has Java installed (including Mac OS X):

    % java -classpath /FOO/CS.jar csearch/CorpusSearch

    Don't type the '%'. That stands for the terminal prompt. Note that we are assuming Unix path syntax and that FOO is a top-level directory. The classpath must give the full path, using appropriate syntax.


    Product's homepage

    Here are some key features of "CorpusSearch":

    · Tree search configurations in CS are defined in a Boolean query language over tree predicates.
    · The output of a CS search is itself searchable.
    · CS runs on any Java-supported platform.
    · The CS query language contains many features to make searching easier and more intuitive for linguistic research.
    · CS has extensive user configuration options.

    Requirements:

    · Java 2 Standard Edition Runtime Environment

    What's New in This Release: [ read full changelog ]

    · Added extend_span to revision software.
    · More cleaning up of "collapse".

      


    TAGS:

    java application | research tool | corpus research | corpus | research | syntactic

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM