Pavuk 0.9.33 review

by on

Pavuk is UNIX program used to mirror contents of WWW documents or files

License: GPL (GNU General Public License)
File size: 0K
Developer: Stevo Ondrejicka
0 stars award from

Pavuk is UNIX program used to mirror contents of WWW documents or files. It transfers documents from HTTP, FTP, Gopher and optionaly from HTTPS (HTTP over SSL) servers. Pavuk has an optional GUI based on GTK2 widget set.

Here are some key features of "Pavuk":
recursive downloading based on links inside HTML documents
supports CSS and HTML4.0
local tree of documents is similiar to original (located on remote server)
transformation of Gopher and FTP directories into HTML document
HTML links translation from remote to local or local to remote
supports proxy servers (HTTP, FTP, SSL, HTTP gateway for FTP, HTTP gateway for Gopher,SOCKS 4/5)
supports authentification against HTTP server and proxy HTTP server
Can provide detailed timing information about transfers
has many options to define the set of documents for transfer :
limit on server
limit on domain
limit on prefix
limit on suffix
limit on document tree level
limit on maximal and minimal size of file
limit on type of document (as yet only for document transfered via HTTP or HTTPS)
matching patterns on URLs and document names
and many other
does restart of transfer (only when server support it) after program break, link down, timeout or some other error
stalled connection should timeout after given period
can be run in differend modes:
normal - simlpe recursion
sync - pavuk looks for newer versions of already downloaded documnts/files
singlepage - download of single document with all inline objects (pictures, backgrounds, sounds, ...)
resumereget - looks for documents which transfer were broken and try to download missing parts
singlereget - retries to transfer file until is not succesfuly downloaded
linkupdate - scans local tree of documents and try to update links inside HTML document when some linked documents are allready downloaded, but it is not reflected in
dontstore - used to fetch files to cache/proxy server
reminder - used to inform user about changes on remote HTTP servers.
can be run on terminal or inside Xwindows window
Xwindows interface based on GTK2 toolkit
DnD of URLs with GTK2.0
fetching URLs from clipboard
have Native Language Support based on GNU gettext
asynchronous buffered DNS name resolving when runing in X-windows
so called dirty FTP proxy support (using CONNECT request to HTTP proxy)
can be used as full featured FTP mirroring tool (preserves modification time,permisions, symbolic links, ...)
optional transfer speed limitation max./min.
very customizable URL - local filename mapping algoritm
automaticaly loads copy from Netscape browser cahce if enabled
can remove advertisement banners from HTML pages
HTTP/1.1 support
FTP over SSL
supports POST requests an in GTK UI have also dialog for intercative HTML forms filling
supports many formats of FTP directory listings (Unix BSD/SYSV, EPFL, Novel, VMS, MS DOS/Windows)
optional multithreading support
multiple round-robin used HTTP proxies
supports javascript via regular expression patterns
supports NTLM authorization
has JavaScript bindings to allow scripting of particular tasks
allows user to define custom FTP login procedures

Pavuk 0.9.33 search tags