NetHere Knowledge Base (KB)
Main ::Web Hosting
Minimize Text   Default    Enlarge Text

Default Webalizer Configuration

Default Webalizer Configuration

  

Webalizer is the program that NetHere uses to process stats for web sites hosted on our platform.  Webalizer uses a default configuration file to process web site log files for each site.  This default configuration file determines the output/format that Webalizer produces.

 

Below is a portion of the default configuration file that NetHere establishes for web sites that have had web stats enabled.  If you have not had web stats enabled for your site, please send a request to webhosting@nethere.com with the name of your site and your name and email address and we will provision stats for your site. 

 

Default Configuration File:

 

LogType                 web

 

HostName                %%DOMAIN_NAME%%

 

PageType         htm*

PageType         cgi

PageType         pl

PageType         php

PageType         asp*

PageType         shtm*

Quiet                   yes

 

IndexAlias              index.htm

IndexAlias              index.html

IndexAlias              index.shtml

IndexAlias              index.cgi

IndexAlias              home.htm

IndexAlias              home.html

IndexAlias              home.shtml

IndexAlias              audio.shtml

 

HideReferrer            %%DOMAIN_NAME%%/

HideReferrer            Direct Request

 

 

Although you can not access your configuration file for your web site, we can make adjustments to your configuration file to alter the output of Webalizer however you would like based up the various “keyword values".  The 'keyword' is one of the currently available configuration keywords and 'values' are the values to assign to that particular option.  Below is a list of the most frequently adjusted keyword values in the configuration file.  For a more extensive list, please see the readme file at www.webalizer.com.

 

--------------------------------------------------------------------------

 

General Configuration Keywords

------------------------------

 

VisitTimeout   Set the 'visit timeout' value.  Visits are determined by looking at the time difference between the current and last request made by a specific site.  If the difference in time is greater than the visit timeout value, the request is considered a new visit.  The default value of 30 minutes (3000) should be fine for most. 

 

PageType      Allows you to define the 'page' type extension.  Normally, people consider HTML and CGI scripts as 'pages'.  This option allows you to specify what extensions you consider a page.  Default is 'htm*' and 'cgi' for web logs, and 'txt' for ftp logs. 

     

IndexAlias    This allows additional 'index.html' aliases to be defined.  Normally, The Webalizer scans for and strips the string “index." from URL's before processing them.  This turns a URL such as /somedir/index.html into just /somedir/ which is really the same URL.  This keyword allows _additional_ names to be treated in the same fashion for sites that use different default names, such as "home.html".  The string is scanned for anywhere in the URL, so care should be used if and when you define additional aliases.  For example, if you were to use an alias such as 'home', the URL /somedir/homepages/brad/home.html would be turned into just /somedir/ which probably isn't the intended result.  Instead, you should have specified 'home.htm' which would correctly turn the URL into /somedir/homepages/brad/ like intended.  It should also be noted that specified aliases are scanned for in EVERY log record... A bunch of aliases will noticeably degrade performance as each record has to be scanned for every alias defined.  You don't have to specify 'index.' As it is always the default.

             

MangleAgents   The MangleAgents keyword specifies the level of user agent name mangling, if any.  There are 6 levels that may be specified, each producing a different level of detail displayed.  Level 5 displays only the browser name (MSIE or Mozilla) and the major version number.  Level 4 adds the minor version (single decimal place).  Level 3 adds the minor version to two decimal places.  Level 2 will also add any sub-level designation (such as Mozilla/3.01Gold or MSIE 3.0b).  Level 1 will also attempt to add the system type.  The default level 0 will leave the user agent field unmodified and produces the greatest amount of detail.

             

SearchEngine   This keyword allows specification of search engines and their query strings.  Search strings are obtained from the referrer field in the record, and in order to work properly, the Webalizer needs to know what query strings different search engines use.  The SearchEngine allows you to specify the search engine and its query string to parse the search string from.  The line is formatted as:  "SearchEngine engine-string query-string” where 'engine-string' is a substring for matching the search engine with, such as "yahoo.com" or "altavista".  The 'query-string' is the unique query string that is added to the URL for the search engine, such as "search=" or "MT=" with the actual search strings appended to the end.

 

Top Table Keywords

------------------

 

TopAgents     This allows you to specify how many "Top" user agents are displayed in the "Top User Agents" table.  The default is 15.  If you do not want to display user agent statistics, specify a value of zero (0).  The display of user agents               will only work if your web server includes this information in its log file (ie: a combined log format file).

             

AllAgents     Will cause a separate HTML page to be generated for all normally visable User Agents.  A link will be added to the bottom of the "Top User Agents" table if enabled.  Value can be either 'yes' or 'no', with 'no' being the default.

 

TopCountries   This allows you to specify how many "Top" countries are displayed in the "Top Countries" table.  The default is 30.  If you want to disable the countries table, specify a value of zero (0).

             

TopReferrers   This allows you to specify how many "Top" referrers are displayed in the "Top Referrers" table.  The default is 30.  If you want to disable the referrers table, specify a value of zero (0).  The display of referrer information will only work if your web server includes this information in its log file (ie: a combined log format file).

             

AllReferrers   Will cause a separate HTML page to be generated for all normally visable Referrers.  A link will be added to the "Top Referrers" table if enabled.  Value can be either 'yes' or 'no', with 'no' being the default.

 

TopSites      This allows you to specify how many "Top" sites are displayed in the "Top Sites" table.  The default is 30.                If you want to disable the sites table, specify a value of zero (0).

             

TopKSites    Identical to TopSites, except for the 'by KByte' table.  Default is 10. 

 

TopURLs       This allows you to specify how many "Top" URL's are displayed in the "Top URL's" table.  The default is 30.

             

TopKURLs      Identical to TopURLs, except for the 'by KByte' table.  Default is 10. 

 

AllURLs       Will cause a separate HTML page to be generated for all normally visable URLs.  A link will be added to the bottom of the "Top URLs" table if enabled.  Value can be either 'yes' or 'no', with 'no' being the default.

 

TopEntry      Allows you to specify how many "Top Entry Pages" are displayed in the table.  The default is 10. 

 

TopExit       Allows you to specify how many "Top Exit Pages" are displayed in the table.  The default is 10. 

 

TopSearch     Allows you to specify how many "Top Search Strings" are displayed in the table.  The default is 20. 

 

AllSearchStr   Will create a separate HTML page to be generated for all normally visible Search Strings.  A link will be added to the bottom of the "Top Search Strings" table if enabled.  Value can be either 'yes' or 'no', with 'no' being the default.

 

 

Hide Object Keywords

--------------------

 

These keywords allow you to hide user agents, referrers, sites, URL's and usernames from the various "Top" tables.  You can specify as many of these as you want without limit. 

 

HideReferrer  This allows you to hide specified referrers from the "Top Referrers" table.  Normally, you would only specify your own web server to be hidden, as it is usually the top generator of references to your own pages.  Of course, this keyword is useless if 1) your log file does not include referrer information or 2) you disable the top referrers table.

          

HideSite      This allows you to hide specified sites from the "Top Sites" table.  Normally, you would only specify your own web server or other local machines to be hidden, as they are usually the highest hitters of your web site, especially if you have their browsers home page pointing to it.             

 

HideAllSites   This allows hiding all individual sites from the display, which can be useful when a lot of groupings are being used (since grouped records cannot be hidden).  It is particularly useful in conjunction with the GroupDomain feature, however can be useful in other situations as well.  Value can be either 'yes' or 'no', with 'no' the default.

             

HideURL       This allows you to hide URL's from the "Top URL's" table.  Normally, this is used to hide items such as graphic files, audio files or other 'non-html' files that are transferred to the visiting user.

             

Ignore/Include Object Keywords

----------------------

 

These keywords allow you to completely ignore log records when generating statistics, or to force their inclusion regardless of ignore criteria.  Records can be ignored or included based on site, URL, user agent, referrer and username. 

 

IgnoreSite    This allows specified sites to be completely ignored from the generated statistics.

 

IgnoreURL     This allows specified URL's to be completely ignored from the generated statistics.  One use for this keyword would be to ignore all hits to a 'temporary' directory where development work is being done, but is not accessible to the outside world.

 

IgnoreReferrer   This allows records to be ignored based on the referrer field.

 

IgnoreAgent   This allows specified User Agent records to be completely ignored from the statistics.  (Maybe useful if you really don't want to see all those hits from MSIE :)

 

IgnoreUser    This allows specified username records to be completely ignored from the statistics.  Usernames can only be used if you use http authentication on your server.

 

IncludeSite   Force the record to be processed based on hostname. 

 

IncludeURL    Force the record to be processed based on URL. 

 

IncludeReferrer Force the record to be processed based on referrer. 

 

IncludeAgent   Force the record to be processed based on user agent.  .

 

 

 

 

 

 



 

Search Glossary Saved Article Contact us