DKPro Similarity 2.1.0

A cross-platform and Open Source Java framework that can be used for text similarity
DKPro Similarity is an open source and completely free Java framework that can be used for text similarity: between two terms, between two lists of strings that represent entire documents, and between two texts based on a UIMA JCas representation.

DKPro Similarity's main goal is to provide a complete repository of text similarity measures that can be implemented using standardized interfaces. It is designed as an add-on for the DKPro Core software.

The application is comprised of various measures ranging from ones based on common subsequences and simple n-grams, to more complex ones, such as high-dimensional vector comparisons.

last updated on:
October 10th, 2013, 14:56 GMT
developed by:
Richard Eckart
license type:
The Apache License 2.0 
ROOT \ Information Management


In a hurry? Add it to your Download Basket!

user rating



Rate it!
1 Screenshot
DKPro Similarity - An example of computing similarity between two given texts, which are already lemmatized
What's New in This Release:
  • This version adds major bugfixes compared to the somewhat unstable 2.0.0 release.
read full changelog

Add your review!