This document contains reference information for Google Sitemap Generator administrators.
In this document:
The installation command is for Linux only.
This is the command format:
sitemap-install/install.sh [option]...
The option variable can be one of the following:
-h: Displays help about these command line options. -d directory: Specifies a non-default installation directory. -a Apache-binary-filename: Specifies the Apache binary file path. You can also specify this value interactively, in response to the installation script. -c Apache-config-filename: Specifies the Apache root configuration file.-g Apache-group: Specifies the user group name under which Apache is running. -t Apache-control-filename: Specifies the Apache control script. All of these parameters are optional. Use a space to separate multiple options.
Example: sitemap-install/install.sh -t /usr/sbin/apache2ctl -g www-data
This command controls the Google Sitemap Generator service. To start or stop the service, use the Windows Services interface.
This is the command format:
SitemapService.exe option
The option variable can be one of the following:
remote_admin {enable | disable}: Lets an administrator log on from a remote computer. reset_password: Enters interactive mode to set the administrative password. -v or version: Displays the current version.-h or help: Lists the command line options. You can specify only one option at a time.
This command controls the Google Sitemap Generator daemon.
This is the command format:
sitemap-daemon option
The option variable can be one of the following:
remote_admin {enable | disable}: Lets an administrator log on from a remote computer. service {start | stop | restart}: Starts, stops, or restarts the Google Sitemap Generator daemon.reset_password: Enters interactive mode to set the administrative password. -v or version : Displays the current version.-h or help: Lists the command line options. You can specify only one option at a time.
The Site Status page displays status information under the headers URL collectors and Sitemap creators. This section describes the meaning of the status values.
These are the status values for the URL collectors (webserver filter, file scanner, and log parser):
These are the status values for the Sitemap collectors (Web, Mobile, Code Search, and Blog Search):
The following table lists and explains the site configuration settings.
| Option or Section | Description | Default Value |
|---|---|---|
| Host name | The name used in generated Sitemaps. | Google Sitemap Generator deduces the host name by monitoring site traffic. |
| Pathname for log files | The Apache or IIS web server log that the Google Sitemap Generator log parser monitors. The value can resolve to a file or a directory. If the value resolves to a directory, the log parser monitors all files in the directory, but it does not monitor files in any subdirectories.
Linux example: /var/log/apach2/access.log Windows example: C:\WINDOWS\system32\LogFiles\W3SVC1 |
System dependent |
| Resource limits | Specifies the resources that Google Sitemap Generator uses on this web server.
You can override these site-level default values on a per-site basis.
Maximum age of URLs included in Sitemap file lets you exclude old URLs from Sitemaps files, to prevent files from becoming bloated with URLs that are already known. Maximum number of URLs in memory and Maximum number of URLs on disk help you limit the web server resources used by Google Sitemap Generator. As Google Sitemap Generator finds URLs, it enforces these limits and removes older URLs as needed. These values apply to unique URLs; each URL can appear only once in the cache and once on disk. The number of URLs in memory should be smaller than the number of URLs on disk, because the memory cache is periodically written to disk. |
Maximum age of URLs included in Sitemap file: 365 days
Maximum number of URLs in memory: 100000 Maximum number of URLs on disk: 500000 |
| URL collectors | Specifies the way that URLs are collected.
You can activate or deactivate the web server filter, file scanner, and log parser. The web server filter runs continuously when activated, but the file scanner and log parser run at intervals that you can specify. |
Web server filter: Default value is set at installation.
File scanner and log parser: Disabled by default. Default execution interval for the file scanner and log parser: 1440 minutes |
| URL query fields | Includes specified URL query fields in generated Sitemaps, overriding the default exclusion of all query fields.
Read the privacy notice and test the generated Sitemaps to ensure that you do not inadvertently compromise user privacy by including inappropriate query fields in Sitemaps. |
All query fields are excluded by default. |
| Sitemap types | Enables and disables the generation of URLs for Web, Mobile, Code Search, and Blog Search. | Enabled: Web
Disabled: All others |
This section lists and explains the configuration settings that are specific to specific types of Sitemaps. It contains sections for the following types of Sitemaps:
The following table describes the configuration settings for Web Sitemaps.
The following table describes the configuration settings for Mobile Sitemaps.
| Option or Section | Description | Default Values |
|---|---|---|
| Sitemap generation schedule | Specifies the start time for generating Sitemaps and the frequency of Sitemap generation.
You can use the start date and time to defer the start time for Google Sitemap Generator, even after Google Sitemap Generator is running and Mobile Sitemap types are enabled. |
Start time and date: Installation time
Interval: 1 day |
| Sitemap file settings | Configures settings that affect the Sitemap files.
Sitemap file name specifies the first Sitemap file for this Sitemap type. Maximum number of URLs specifies the maximum number of unique URLs per file. For example, if this value is set to 20000 and the number of URLs is 100000, Google Sitemap Generator creates five Sitemap files. Maximum file size specifies the size of each Sitemap file. |
Sitemap file compression: Enabled
File name: Maximum number of URLs: 20000 Maximum file size: 5120 KB |
| Sitemap URL filter | Defines URL patterns that determine how Google Sitemap Generator selects the URLs that are included in each Sitemap.
Refer to the information for this setting under Web Sitemaps. |
Refer to the information for this setting under Web Sitemaps. |
The following table describes the configuration settings for Code Search Sitemaps.
| Option or Section | Description | Default Values |
|---|---|---|
| Sitemap generation schedule | Specifies the start time for generating Sitemaps and the frequency of Sitemap generation.
You can use the start date and time to defer the start time for Google Sitemap Generator, even after Google Sitemap Generator is running and Code Search Sitemap types are enabled. |
Start time and date: Installation time
Interval: 1 day |
| Sitemap file settings | Configures settings that affect the Sitemap files.
Sitemap file name specifies the first Sitemap file for this Sitemap type. Maximum number of URLs specifies the maximum number of unique URLs per file. For example, if this value is set to 20000 and the number of URLs is 100000, Google Sitemap Generator creates five Sitemap files. Maximum file size specifies the size of each Sitemap file. |
Sitemap file compression: enabled
File name: Maximum number of URLs: 20000 Maximum file size: 5120 KB |
| Sitemap URL filter | Defines URL patterns that determine how Google Sitemap Generator selects the URLs that are included in each Sitemap.
Refer to the information for this setting under Web Sitemaps. Be careful to ensure that the file extensions specified here actually match the file extensions in use. If they do not match, the Sitemap could be empty. |
Included URL patterns: All files with the extensions .vb, .c, .cxx, .cpp, .h, .cc, and .java.
Excluded URL patterns: none
|
The following table describes the configuration settings for Blog Search Sitemaps. For Blog Search, Google Sitemap Generator does not generate and then submit Sitemaps files, as it does for other content types. Instead, Google Sitemap Generator monitors the web server traffic for new content and then dynamically pings search engines periodically when it finds the new content.
| Option or Section | Description | Default Values |
|---|---|---|
| Ping schedule | Specifies the frequency with which Google Sitemap Generator pings search engines to notify them about new blog content. | Start time and date: Installation time
Interval: 1 day |
| Sitemap URL filter | Defines URL patterns that determine how Google Sitemap Generator selects the URLs that are sent to the search engines. You can use this filter to specify the blogs that you want to include and exclude. | Included URL patterns: * (all files)
Excluded URL patterns: none |
Updated on