Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>

WEEK'S BEST

  • BackTrack 5 R1
  • Wine 1.2.3 / 1.4 RC3
  • Mozilla Firefox 10...
  • Ubuntu 11.04
  • Angry Birds 1.1.2.1
  • Ubuntu 10.04.3 LTS
  • Linux Kernel 3.2.6
  • Ubuntu Manual 10.10
  • Adobe Flash Player...
  • Pidgin 2.10.1
  • Home > Linux > Utilities

    Sherlock Holmes 4.0

    Download button

    No screenshots available
    Downloads: 1,942  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    Excellent (5.0/5)
    1 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Martin Mares | More programs
    GPL / FREE
    April 13th, 2009, 19:45 GMT
    ROOT / Utilities

     Read user reviews (1)  Refer to a friend  Subscribe

    Sherlock Holmes description

    A universal search engine.

    Sherlock Holmes is a universal search engine, a system for gathering and indexing of textual data (text files, web pages, etc), both locally and over the network.


    Product's homepage

    Here are some key features of "Sherlock Holmes":

    · Gathers files via HTTP or from local files.
    · Parses text files, HTML, PDF, and several other formats using external parsers (such as MS Word and PostScript).
    · The whole system is modular, so adding your own data sources or parsers is just matter of plugging in right module (well, usually also writing it).
    · Works well in mixed charset environment.
    · Considers multiple occurences of the same file (even with minor changes) a single document with multiple URL's.
    · Everything is highly configurable. You can write filtering rules in a special language which allows to tweak configuration variables depending on the document being processed.
    · Searching of words, phrases, and boolean expressions. Searching in filenames and link texts.
    · Proximity search and proximity weighting of regular searches.
    · Recognition of languages, easy integration of stemmers and synonymic dictionaries.
    · Spelling checker based on word frequencies observed in the indexed data, hinting the user that his query might be misspelled.
    · Search results include context in each document.
    · Scales well to tens of millions of documents on normal PC hardware.
    · User interface (the front-end) is completely separated from the rest of the system, making it easy to modify and also to embed the search engine in existing applications.
    · Downloaded files and indices are compressed to save space.

      


    TAGS:

    search engine | gathering textual data | indexing textual data | Sherlock | Holmes | search



    HTML code for linking to this page:


    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM