PyGalleryCrawler 0.1.1

PyGalleryCrawler project is a Web crawler for online image galleries.
PyGalleryCrawler project is a Web crawler for online image galleries.

Installation:

tar -xzf pygallerycrawler.tar.gz

cd pygallerycrawler

Extra python modules

psyco @ http://psyco.sourceforge.net

- performance

Python Imaging Library aka PIL @ http://www.pythonware.com/products/pil/

- thumbnails generation
- size verification

feedparser @ http://feedparser.org

- feed parser

Use:

chmod a+x pygallerycrawler.py

./pygallerycrawler.py the_url_you_want_crawl

Personal configuration:

If you make change in config.py, your changes will be overwrite at the next update. So, you can create a personal configuration and use it with the --config (or -c) switch.

cp config.py ~/pgc_config.py

vi ~/pgc_config.py

./pygallerycrawler.py -c ~/pgc_config.py the_url_you_want_crawl

Limitations:

· No check if some pictures are the same after download. Some gallery have a presentation link wich is one of the pictures. So the images will be double.

Requirements:

· Python

What's New in This Release:

· A check for the image size of both pictures and thumbnails was added.
· Regexp support was improved.
· An internal algorithm was cleaned.
· A simple feed which can be tried if there is direct link to the gallery was added.

last updated on:
April 18th, 2007, 21:35 GMT
price:
FREE!
developed by:
Cedric L HOMME
homepage:
gna.org
license type:
GPL (GNU General Public License) 
category:
ROOT \ Multimedia \ Graphics

FREE!

In a hurry? Add it to your Download Basket!

user rating

UNRATED
0.0/5
 

0/5

1 Screenshot
PyGalleryCrawler

Add your review!

SUBMIT