Wikipedia Dump Reader display the text-only wikipedia compressed dumps.. #Wikipedia dumps #Dump reader #Wikipedia reader #Wikipedia #Dump #Reader
After reading on some blog from planetkde about an offline wikipedia reader, I decided to make available some python-Qt program I wrote some months ago as i realized it might be useful for more people than myself.
Wikipedia Dump Reader display the text-only wikipedia compressed dumps, currently available at http://download.wikimedia.org/backup-index.html, generally named something like pages-articles.xml.bz2.
It's fairly useable now for basic wikipedia reading, altough lots of feature might still not work, since i originally intended this reader for my own usage only.
Features includes a Qt viewer with basic text markup, following links, ability to read directly on the .bz2 compressed file (altough some index creations step is needed on first run), tab-like list of articles with load-in-the-background by default, very light source-code.
The current code required PyQt4, altough some old unmaintened PyQt 3 code is included.
Tested on Fedora Core 4 and Kubuntu with PyQt4.1 (Python 2.4, Qt 4.2).
1. on the commandline, run:
python dumpReader.py
or just click on it from your favorite file manager
2. Browse and select the archive (some file probably named *.xml.bz2)
3. If it's the first time, an index is created, which can take some time. Currently, the program need write permission on the same directory.
4. The main windows contains the article title area (top), main text area (left) and article history (right). You can go to an article by typing its name then click the "Go" button, or by clicking a link from the main text area. By default, clicking a link load the article in the background. The search-box area allows to keyword search among the articles' title. You can also go to a random article by clicking "Go" with an empty entry.
System requirements
What's new in Wikipedia Dump Reader 0.2.10:
- Use a new indexing scheme for the entrylist - articles load faster now
- Upgrade path for old indexing scheme
- Utf8 fixes for non-ascii pathnames
- experimental RPM package - feedback welcome at the project website : https://launchpad.net/wikipediadumpreader
Wikipedia Dump Reader 0.2.10
add to watchlist add to download basket send us an update REPORT- runs on:
- Linux
- filename:
- wikipediaDumpReader-0.2.10.tar.gz
- main category:
- Education
- developer:
- visit homepage
4k Video Downloader 1.5.3.0080 Plus / 4.30.0.5655
Bitdefender Antivirus Free 27.0.35.146
calibre 7.9.0
Zoom Client 6.0.3.37634
ShareX 16.0.1
IrfanView 4.67
Microsoft Teams 24060.3102.2733.5911 Home / 1.7.00.7956 Work
7-Zip 23.01 / 24.04 Beta
Context Menu Manager 3.3.3.1
Windows Sandbox Launcher 1.0.0
- 7-Zip
- Context Menu Manager
- Windows Sandbox Launcher
- 4k Video Downloader
- Bitdefender Antivirus Free
- calibre
- Zoom Client
- ShareX
- IrfanView
- Microsoft Teams