w2m 0.2

WWW spider producing an adjacency matrix
w2m is essentially a Python module that provides the W2M class. The W2M class is a www spider which explores a subset of the www, extracts the adjacency matrix of the oriented graph, and saves it as a GNU-Octave .mat file.

If numpy and matplotlib are available, it computes the spectrum of the adjacency matrix and plots the result. The list of vertices and an image of the matrix are also produced. The file w2m.py can be used as a module or as a script handling command line arguments. W2M uses threading.

W2M does not depend on modules outside the standard library, except numpy and matplotlib, which are optional. This makes more easy the possible 2to3 transition.

FILE LIST
  w2m.py          Python module (source)
  examples/       examples on small Wikipedias (Kabyle and Mahori)
  setup.py        setup file for distutils http://docs.python.org/distutils/
  PKG-INFO        package information file produced by setup.py (distutils)

TYPICAL USAGE AS A SCRIPT
  chmod +x w2m.py
  ./w2m.py --help
  # and if you use the Bash shell:
  /w2m.py --start someurl > output.lst 2>&1 & tail -f output.lst

last updated on:
January 6th, 2012, 4:48 GMT
price:
FREE!
developed by:
Djalil CHAFAI
homepage:
djalil.chafai.net
license type:
GPL (GNU General Public License) 
category:
ROOT \ Internet \ HTTP (WWW)

FREE!

In a hurry? Add it to your Download Basket!

user rating

UNRATED
0.0/5
 

0/5

Add your review!

SUBMIT