Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.6 / 3....
  • Linux Kernel 3.0.82 LTS...
  • KDE Software Compilatio...
  • PulseAudio 4.0
  • Wireshark 1.10.0
  • NetworkManager 0.9.8.2
  • LibreOffice 3.6.6 / 4.0...
  • SystemRescueCd 3.7.0
  • Linux Kernel 3.10 RC6
  • Ubuntu Tweak 0.8.5
  • Home > Linux > Internet > Plone Extensions

    collective.portlet.similarcontent 1.5

    Download button

    No screenshots available
    Downloads: 89  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Matt Hamilton | More programs
    ZPL / FREE
    December 13th, 2011, 12:31 GMT [view history]
    ROOT / Internet / Plone Extensions

     Read user reviews (0)  Refer to a friend  Subscribe

    collective.portlet.similarcontent description

    A Plone portlet that uses the catalog internals to find 'similar' content to the page you are looking at

    collective.portlet.similarcontent is a Plone portlet that uses some deep dark data structures within the ZCatalog and ZCTextindex, so it could be brittle in the future if those structures are changed. Then again, they have been the same for the past 8 years or so.

    This portlet also runs in linear time relative to the number for documents you have in your site, so it could well slow things down. That said I've tried to make it pretty efficient.

    How it Works

    In a nutshell, this portlet compares the text content of an object with all other objects on the site to find other objects with a similar content. The steps are as follows:

    - Find the path of this document
    - Look up the record_id (docid) of this path in the catalog
    - Look in the SearchableText index to find all word ids (wids) in this document
    - Work out the top 20 most 'important' words in this document [*]
    - For each of the top 20 words, find all documents containing any of those words
    - Use a vector space model to measure similarity of each candidate document to our top 20 words
    - Return the top 10 most similar documents.

    [*] We work out the top 20 words using a TF*IDF algorithm (the same used in ZCTextIndex.OkapiIndex) to find the words that appear proportionately high in this document compared to all documents in general.


    Product's homepage

    Requirements:

    · Python
    · Plone CMS

    What's New in This Release: [ read full changelog ]

    · Fixed portlet edit permission used [vangheem]

      


    TAGS:

    similar content | Plone portlet | Plone | similar | content

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM