PyStemmer 1.3.0

Snowball stemming algorithms, for information retrieval

  Add it to your Download Basket!

 Add it to your Watch List!

0/5

Rate it!
send us
an update
LICENSE TYPE:
MIT/X Consortium License 
USER RATING:
UNRATED
  0.0/5
DEVELOPED BY:
Richard Boulton
HOMEPAGE:
snowball.tartarus.org
CATEGORY:
ROOT \ Text Editing&Processing \ Indexing
PyStemmer is a software that provides access to efficient algorithms for calculating a "stemmed" form of a word. This is a form with most of the common morphological endings removed; hopefully representing a common linguistic base form. This is most useful in building search engines and information retrieval software; for example, a search with stemming enabled should be able to find a document containing "cycling" given the query "cycles".

PyStemmer provides algorithms for several (mainly european) languages, by wrapping the libstemmer library from the Snowball project in a Python module.

It also provides access to the classic Porter stemming algorithm for english: although this has been superceded by an improved algorithm, the original algorithm may be of interest to information retrieval researchers wishing to reproduce results of earlier experiments.

Last updated on November 8th, 2009

requirements

#stemming algorithms #information retrieval #Python library #stemming #algorithms #information #retrieval

Add your review!

SUBMIT