Google N-Gram-Patterns 1.0

Google N-Gram-Patterns seeks to build a co-occurrence network based on n-gram data provided by Google Inc.

  Add it to your Download Basket!

 Add it to your Watch List!

0/5

Rate it!
send us
an update
LICENSE TYPE:
GPL (GNU General Public License) 
USER RATING:
2.9/5 14
DEVELOPED BY:
Anurag Jain, Bin Lan, Darshan Paranjap...
HOMEPAGE:
n-gram-patterns.sourceforge.net
CATEGORY:
ROOT \ Internet \ HTTP (WWW)
Google N-Gram-Patterns seeks to build a co-occurrence network based on n-gram data provided by Google Inc. This project presents an easy and fast way to analyze Google n-gram data, which is contributed by Google Inc.

Google n-gram data consists of a huge amount of word information based on real life searching queries entered by internet users. The huge amount of data makes it so hard to analyze the whole data set. In this project, we present a possible parallel solution to build and access co-occurrence network using Google n-gram data.

Moreover, we use the co-occurrence network to find relationship (path) between words in this large corpus. We also build a common library based on C/MPI for all the similar co-occurrence network analysis programs. This method was tested on both Blade system and Altix system from MSI at University of Minnesota Twin City campus.

Last updated on January 9th, 2008

#Google n-gram #n-gram databas #extract database information #Google #n-gram #database #information

Add your review!

SUBMIT