Pavuk

Pavuk is a Webgrabber with an optional GTK GUI.
Download

Pavuk Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Stevo Ondrejicka
  • Publisher web site:

Pavuk Tags


Pavuk Description

Pavuk is a Webgrabber with an optional GTK GUI. Pavuk is UNIX program used to mirror contents of WWW documents or files. It transfers documents from HTTP, FTP, Gopher and optionaly from HTTPS (HTTP over SSL) servers. The project has an optional GUI based on GTK2 widget set.Here are some key features of "Pavuk":· recursive downloading based on links inside HTML documents· supports CSS and HTML4.0· local tree of documents is similiar to original (located on remote server)· transformation of Gopher and FTP directories into HTML document· HTML links translation from remote to local or local to remote· supports proxy servers (HTTP, FTP, SSL, HTTP gateway for FTP, HTTP gateway for Gopher,SOCKS 4/5)· supports authentification against HTTP server and proxy HTTP server· Can provide detailed timing information about transfers· has many options to define the set of documents for transfer :limit on serverlimit on domainlimit on prefixlimit on suffixlimit on document tree levellimit on maximal and minimal size of filelimit on type of document (as yet only for document transfered via HTTP or HTTPS)matching patterns on URLs and document namesand many other · does restart of transfer (only when server support it) after program break, link down, timeout or some other error· stalled connection should timeout after given period· can be run in differend modes:normal - simlpe recursionsync - pavuk looks for newer versions of already downloaded documnts/filessinglepage - download of single document with all inline objects (pictures, backgrounds, sounds, ...)resumereget - looks for documents which transfer were broken and try to download missing partssinglereget - retries to transfer file until is not succesfuly downloadedlinkupdate - scans local tree of documents and try to update links inside HTML document when some linked documents are allready downloaded, but it is not reflected indontstore - used to fetch files to cache/proxy serverreminder - used to inform user about changes on remote HTTP servers. · can be run on terminal or inside Xwindows window· Xwindows interface based on GTK2 toolkit· DnD of URLs with GTK2.0· fetching URLs from clipboard· have Native Language Support based on GNU gettext· asynchronous buffered DNS name resolving when runing in X-windows· so called dirty FTP proxy support (using CONNECT request to HTTP proxy)· can be used as full featured FTP mirroring tool (preserves modification time,permisions, symbolic links, ...)· optional transfer speed limitation max./min.· very customizable URL - local filename mapping algoritm· automaticaly loads copy from Netscape browser cahce if enabled· can remove advertisement banners from HTML pages· HTTP/1.1 support· FTP over SSL· supports POST requests an in GTK UI have also dialog for intercative HTML forms filling· supports many formats of FTP directory listings (Unix BSD/SYSV, EPFL, Novel, VMS, MS DOS/Windows)· optional multithreading support· multiple round-robin used HTTP proxies· supports javascript via regular expression patterns· supports NTLM authorization· has JavaScript bindings to allow scripting of particular tasks· allows user to define custom FTP login proceduresWhat's New in This Release:· bug fixes


Pavuk Related Software