This document describes how to efficiently serve an arbitrary number
of virtual hosts with Apache 1.3. Some familiarity with
mod_rewrite is
useful.
The techniques described here are of interest if your
httpd.conf contains hundreds of
<VirtualHost> sections that are substantially the
same, for example:
NameVirtualHost 111.22.33.44 <VirtualHost 111.22.33.44> ServerName www.customer-1.com DocumentRoot /www/hosts/www.customer-1.com/docs ScriptAlias /cgi-bin/ /www/hosts/www.customer-1.com/cgi-bin </VirtualHost> <VirtualHost 111.22.33.44> ServerName www.customer-2.com DocumentRoot /www/hosts/www.customer-2.com/docs ScriptAlias /cgi-bin/ /www/hosts/www.customer-2.com/cgi-bin </VirtualHost> # blah blah blah <VirtualHost 111.22.33.44> ServerName www.customer-N.com DocumentRoot /www/hosts/www.customer-N.com/docs ScriptAlias /cgi-bin/ /www/hosts/www.customer-N.com/cgi-bin </VirtualHost>
The basic idea is to replace all of the static
<VirtualHost> configuration with a mechanism that
works it out dynamically. This has a number of advantages:
The main disadvantage is that you cannot have a different log file
for each server; however if you have very many virtual hosts then
doing this is dubious anyway because it eats file descriptors. It's
better to log to a pipe or a fifo and arrange for the process at the
other end to distribute the logs (and perhaps accumulate statistics,
etc.). A LogFormat directive that includes
%{SERVER_NAME}e for the virtual host makes it easy to do this.
All of the dynamic virtual hosts will either be configured as part
of the main server configuration, or within a
<VirtualHost> section. For a simple (very uniform)
setup, <VirtualHost> sections aren't needed at all.
A couple of things need to be `faked' to make the dynamic virtual
host look like a normal one. The most important is the server name
(configured with ServerName and available to CGIs via the
SERVER_NAME environment variable). The way it is
determined is controlled by the UseCanonicalName
directive: with UseCanonicalName off the server name
comes from the contents of the Host: header in the
request. If there is no Host: header then the value
configured with ServerName is used instead.
The other one is the document root (configured with
DocumentRoot and available to CGIs via the
DOCUMENT_ROOT environment variable). This is used by the
core module when mapping URIs to filenames, but in the context of
dynamic virtual hosting its value only matters if any CGIs or SSI
documents make use of the DOCUMENT_ROOT environment
variable. This is an Apache extension to the CGI specification and as
such shouldn't really be relied upon, especially because this
technique breaks it: there isn't currently a way of setting
DOCUMENT_ROOT dynamically.
The meat of the mechanism works via Apache's URI-to-filename
translation API phase. This is used by a number of modules:
mod_rewrite,
mod_alias,
mod_userdir,
and the core module.
In the default configuration these modules are called in that order
and given a chance to say that they know what the filename is. Most of
these modules do it in a fairly simple fashion (e.g. the core module
concatenates the document root and the URI) except for
mod_rewrite, which provides enough functionality to do
all sorts of sick and twisted things (like dynamic virtual hosting).
Note that because of the order in which the modules are called, using
a mod_rewrite configuration that matches any URI means
that the other modules (particularly mod_alias) will
cease to function. The examples below show how to deal with this.
The dynamic virtual hosting idea is very simple: use the server name as well as the URI to determine the corresponding filename.
This extract from httpd.conf implements the virtual
host arrangement outlined in the Motivation
section above, but in a generic fashion.
The first half shows some other configuration options that are
needed to make the mod_rewrite part work as expected; the
second half uses mod_rewrite to do the actual work. Some
care is taken to do a per-dynamic-virtual-host equivalent of
ScriptAlias.
# dynamic ServerName
UseCanonicalName Off
# splittable logs
LogFormat "%{SERVER_NAME}e %h %l %u %t \"%r\" %s %b" vcommon
CustomLog logs/access_log vcommon
<Directory /www/hosts>
# ExecCGI is needed here because we can't force
# CGI execution in the way that ScriptAlias does
Options FollowSymLinks ExecCGI
</Directory>
# now for the hard bit
RewriteEngine On
# a ServerName derived from a Host: header may be any case at all
RewriteMap lowercase int:tolower
## deal with normal documents first:
# allow Alias /icons/ to work - repeat for other aliases
RewriteCond %{REQUEST_URI} !^/icons/
# allow CGIs to work
RewriteCond %{REQUEST_URI} !^/cgi-bin/
# do the magic
RewriteRule ^/(.*)$ /www/hosts/${lowercase:%{SERVER_NAME}}/docs/$1
## and now deal with CGIs - we have to force a MIME type
RewriteCond %{REQUEST_URI} ^/cgi-bin/
RewriteRule ^/(.*)$ /www/hosts/${lowercase:%{SERVER_NAME}}/cgi-bin/$1 [T=application/x-httpd-cgi]
# that's it!
This is an adjustment of the above system tailored for an ISP's
homepages server. Using slightly more complicated rewriting rules we
can select substrings of the server name to use in the filename so
that e.g. the documents for www.user.isp.com are found in
/home/user/. It uses a single cgi-bin
directory instead of one per virtual host.
RewriteEngine on
RewriteMap lowercase int:tolower
# allow CGIs to work
RewriteCond %{REQUEST_URI} !^/cgi-bin/
# check the hostname is right so that the RewriteRule works
RewriteCond ${lowercase:%{HTTP_HOST}} ^www\.[a-z-]+\.isp\.com$
# concatenate the virtual host name onto the start of the URI
# the [C] means do the next rewrite on the result of this one
RewriteRule ^(.+) ${lowercase:%{HTTP_HOST}}$1 [C]
# now create the real file name
RewriteRule ^www\.([a-z-]+)\.isp\.com/(.*) /home/$1/$2
# define the global CGI directory
ScriptAlias /cgi-bin/ /www/std-cgi/
This arrangement uses a separate configuration file to specify the translation from virtual host to document root. This provides more flexibility but requires more configuration.
The vhost.map file contains something like this:
www.customer-1.com /www/customers/1 www.customer-2.com /www/customers/2 # ... www.customer-N.com /www/customers/N
The http.conf contains this:
RewriteEngine on
RewriteMap lowercase int:tolower
# define the map file
RewriteMap vhost txt:/www/conf/vhost.map
# deal with aliases as above
RewriteCond %{REQUEST_URI} !^/icons/
RewriteCond %{REQUEST_URI} !^/cgi-bin/
RewriteCond ${lowercase:%{SERVER_NAME}} ^(.+)$
# this does the file-based remap
RewriteCond ${vhost:%1} ^(/.*)$
RewriteRule ^/(.*)$ %1/docs/$1
RewriteCond %{REQUEST_URI} ^/cgi-bin/
RewriteCond ${lowercase:%{SERVER_NAME}} ^(.+)$
RewriteCond ${vhost:%1} ^(/.*)$
RewriteRule ^/(.*)$ %1/cgi-bin/$1
With more complicated setups, you can use Apache's normal
<VirtualHost> directives to control the scope of
the various rewrite configurations. For example, you could have one IP
address for homepages customers and another for commercial customers
with the following setup. This can of course be combined with
convential <VirtualHost> configuration
sections.
UseCanonicalName Off
LogFormat "%{SERVER_NAME}e %h %l %u %t \"%r\" %s %b" vcommon
CustomLog logs/access_log vcommon
<Directory /www/commercial>
Options FollowSymLinks ExecCGI
AllowOverride All
</Directory>
<Directory /www/homepages>
Options FollowSymLinks
AllowOverride None
</Directory>
<VirtualHost 111.22.33.44>
ServerName www.commercial.isp.com
RewriteEngine On
RewriteMap lowercase int:tolower
RewriteCond %{REQUEST_URI} !^/icons/
RewriteCond %{REQUEST_URI} !^/cgi-bin/
RewriteRule ^/(.*)$ /www/commercial/${lowercase:%{SERVER_NAME}}/docs/$1
RewriteCond %{REQUEST_URI} ^/cgi-bin/
RewriteRule ^/(.*)$ /www/commercial/${lowercase:%{SERVER_NAME}}/cgi-bin/$1 [T=application/x-httpd-cgi]
</VirtualHost>
<VirtualHost 111.22.33.45>
ServerName www.homepages.isp.com
RewriteEngine on
RewriteMap lowercase int:tolower
RewriteCond %{REQUEST_URI} !^/cgi-bin/
RewriteCond ${lowercase:%{HTTP_HOST}} ^www\.[a-z-]+\.isp\.com$
RewriteRule ^(.+) ${lowercase:%{HTTP_HOST}}$1 [C]
RewriteRule ^www\.([a-z-]+)\.isp\.com/(.*) /www/homepages/$1/$2
ScriptAlias /cgi-bin/ /www/std-cgi/
</VirtualHost>