mod_proxy_html 2.5.1 review
Downloadmod_proxy_html is an output filter to rewrite HTML links in a proxy situation, to ensure that links work for users outside the proxy
|
|
mod_proxy_html is an output filter to rewrite HTML links in a proxy situation, to ensure that links work for users outside the proxy. mod_proxy_html serves the same purpose as Apache's ProxyPassReverse directive does for HTTP headers, and is an essential component of a reverse proxy.
For example, if a company has an application server at appserver.example.com that is only visible from within the company's internal network, and a public webserver www.example.com, they may wish to provide a gateway to the application server at http://www.example.com/appserver/.
When the application server links to itself, those links need to be rewritten to work through the gateway. mod_proxy_html serves to rewrite < a href="http://appserver.example.com/foo/bar.html" >foobar< /a > to < a href="http://www.example.com/appserver/foo/bar.html" >foobar< /a > making it accessible from outside.
Capabilities
The original capabilities of mod_proxy_html are:
Parses HTML and XHTML markup, rewriting links according to rules defined by the server admin.
Optionally converts HTML to XHTML or vice versa.
Optionally makes some minor corrections to broken HTML.
Important changes from 1.x to 2.x include:
Support for rewriting URLs in scripts, stylesheets and scripting events
Support for regular expression match-and-replace
Improved charset detection, including XML BOM, XML declaration, and HTML META where the information is not available in HTTP.
Support for converting meta http-equiv HTML elements to real HTTP headers.
The default FPI (doctype) generated has changed to none, on the grounds that it's better to omit it than declare a bogus doctype. You should now configure it explicitly if your backend generates sane HTML or XHTML.
A verbose logging option is provided to help with testing your configuration and diagnosing exactly what it is doing.
With these new features, mod_proxy_html might find applications outside a proxy context. But reverse-proxying remains its primary purpose, while mod_publisher offers a far wider range of capabilities for other applications.
Requirements:
Apache 2.x
libxml2
mod_proxy_html 2.5.1 keywords