Linux SoftwareInternetHTTP (WWW)Simple Page Archive 1.3

Simple Page Archive 1.3


Simple Page Archive is a mirror and archiving tool to copy Web pages you are interested in
Developer:   Alexander Meisel
      more software by author →
Price:  0.00
License:   GPL (GNU General Public License)
File size:   5K
Language:   
OS:   
Rating:   0 /5 (0 votes)
Your vote:  
enlarge screenshot


Simple Page Archive is a mirror and archiving tool to copy Web pages you are interested in. The CGI script downloads all images and CSS files to preserve the mirrored Web page.

It works with the ZEUS (www.zeus.com) and Apache (www.apache.org) web servers. SPA is an simple CGI script which allows you to mirror a single web page. It stores all images and CSSs locally, so you are able to browse through the archive without the need of the original, images being availiable.

The script is dead simple to install!
1. First you need to download "Beatiful Soup" (BS) from http://www.crummy.com/software/BeautifulSoup/ which is a quite simple but very good HTML Parser (not like the one in the Python distro .. which is acutally broken). Please "install" the BS module in your site-packages directory of python.
2. Copy the "index.py" file to directory of your "web archive".
3. Edit the script and change wroot variable in Configuration section at the beginning of the script to the document root directory of your web archive (NOT the physical path on the disk!)
3.1 If you are behind a firewall and you need proxy support, add your proxy server in the Configuration section as well.
4. Make sure you have CGI support enabled in your web server.
5. Make sure index.py is being called as the default DirectoryIndex.
6. Make sure the permissions of the index.py file and the directory are set
correctly. The CGI process must be able to write to your archive directory.
7. Open a browser and try to mirror a page ;-)

What's New in This Release:
  • Added filter support
  • Output now sorted by date
    tags your web  make sure  you are  the script  web archive  configuration section  the cgi  cgi script  the index  web page  all images  you need  images and  

    Download Simple Page Archive 1.3


     http://www.meisel.cc/software/spa/SimplePageArchive-1.3.tar.bz2


    Authors software

    Uber Project Document Management System 1.0 (by Alexander Meisel)
    Project document and management tracking system written in PHP using PostgreSQL to store user, project and document related data and

    Simple Page Archive 1.3 (by Alexander Meisel)
    Simple Page Archive is a mirror and archiving tool to copy Web pages you are interested in


    Similar software

    Simple Page Archive 1.3 (by Alexander Meisel)
    Simple Page Archive is a mirror and archiving tool to copy Web pages you are interested in

    Template::Tutorial 2.15 (by Andy Wardley)
    Template::Tutorial are template toolkit tutorials.

    This section includes tutorials on using the Template Toolkit

    Archive sort 0.1 (by Jason Dunsmore)
    Archive sort is a bash script that sorts directories into manageable 4.4GB directories for the purpose of archiving onto DVDs.

    It

    Zorbstats 0.24 (by P. Fabrice)
    Zorbstats is a simple Web statistics generator like BigBrotherWebstats but using PHP and MySQL .

    Follow next steps to install the

    ln_local 1.1.1 (by Philippe Brochard)
    ln_local is a simple shell script for managing the installation of software packages (typically in /usr/local)

    Thumbnail AutoIndex 2.0 (by Tomasz Sterna)
    Thumbnail AutoIndex is a thumbnail index generation script designed to be a companion to mod_autoindex for Apache

    Python Web Objects 1.3 (by James Turner)
    Python Web Objects is a dynamic page generation system that allows the developer to embed Python code inside HTML

    Archive::Builder 1.06 (by Adam Kennedy)

    Twibright Twig 1.0 (by Karel Kulhavy)
    Twibright Twig is an image gallery program

    Archive::Tyd 0.02 (by C. J. Kirsle)
    Archive::Tyd is a Perl extension for simple file archiving.

    SYNOPSIS

    use Archive::Tyd;

    my $tyd = new Archive::Tyd (passw


    Other software in this category

    SquirrelMail 1.5.1 (by The SquirrelMail Project Team)
    SquirrelMail is a standards-based Webmail package written in PHP4

    Tiki CMS/Groupware 1.9.7 (by Luis Argerich)

    Downloader for X 2.5.7 (by Chuchelo)
    Downloader for X is a tool for downloading files from the Internet via both HTT

    Links 2.1pre26 (by Martin Pergel)
    Links is graphics and text mode WWW browser, similar to Lynx

    Mozilla Firefox 1.5.0.8 (by Mozilla Project)

  •     search


    Featured Software

    jEdit 4.3 pre8
    jEdit is an Open Source text editor written in Java

    Opera 9.02
    Surf the Internet in a safer, faster, and easier way with Opera browser

    GNU Aspell 0.60.4
    GNU Aspell is a Free and Open Source spell checker designed to eventually replace Ispell


    Subscribe in Rojo
    Google Reader
    Add to My Yahoo!

    Add to My AOL
    Subscribe with Bloglines
    Subscribe in NewsGator Online
    Add 'nixbit linux software' to Newsburst from CNET News.com
    del.icio.us nixbit linux software


    Top tags