TRMiner

  234 downloads
1.1 MIT/X Consortium License    
  UNRATED
Token based Regular Expression Miner

description

download

specifications

TRMiner is a Python utility that aims at scientific data curators. It allows to rapidly prune large collections of scientific publications to sentences relevant for a given mining goal.

This is achieved in two steps. First, texts are tranlated into sequences of tokens for relevant words. Second, regular expression patterns are searched in the token sequences. Matches are translated back into natural language sentences and provided as HTML5 based output, that allows manual curators to sort and rate matches for further reading and information extraction.
read more   
Last updated on March 6th, 2012

0 User reviews so far.

SUBMIT