w2m

  196 downloads
0.2 GPL (GNU General Public License)    
  UNRATED
WWW spider producing an adjacency matrix

description

download

specifications

w2m is essentially a Python module that provides the W2M class. The W2M class is a www spider which explores a subset of the www, extracts the adjacency matrix of the oriented graph, and saves it as a GNU-Octave .mat file.

If numpy and matplotlib are available, it computes the spectrum of the adjacency matrix and plots the result. The list of vertices and an image of the matrix are also produced. The file w2m.py can be used as a module or as a script handling command line arguments. W2M uses threading.

W2M does not depend on modules outside the standard library, except numpy and matplotlib, which are optional. This makes more easy the possible 2to3 transition.

FILE LIST
  w2m.py          Python module (source)
  examples/       examples on small Wikipedias (Kabyle and Mahori)
  setup.py        setup file for distutils http://docs.python.org/distutils/
  PKG-INFO        package information file produced by setup.py (distutils)

TYPICAL USAGE AS A SCRIPT
  chmod +x w2m.py
  ./w2m.py --help
  # and if you use the Bash shell:
  /w2m.py --start someurl > output.lst 2>&1 & tail -f output.lst
read more   
Last updated on January 6th, 2012

0 User reviews so far.

SUBMIT