Interface | Description |
---|---|
CanonicalizationRule |
A rule to apply canonicalizing a url.
|
Class | Description |
---|---|
BaseRule |
Base of all rules applied canonicalizing a URL that are configurable
via the Heritrix settings system.
|
FixupQueryString |
Strip any trailing question mark.
|
LowercaseRule |
Lowercases the URL.
|
RegexRule |
General conversion rule.
|
RulesCanonicalizationPolicy |
URI Canonicalizatioon Policy
|
StripExtraSlashes |
Strip any extra slashes, '/', found in the path.
|
StripSessionCFIDs |
Strip cold fusion session ids.
|
StripSessionIDs |
Strip known session ids.
|
StripUserinfoRule |
Strip any 'userinfo' found on http/https URLs.
|
StripWWWNRule |
Strip any 'www[0-9]*' found on http/https URLs IF they have some
path/query component (content after third slash).
|
StripWWWRule |
Strip any 'www' found on http/https URLs, IF they have some
path/query component (content after third slash).
|
UriCanonicalizationPolicy |
URI Canonicalizatioon Policy
|
Copyright © 2003–2019 Internet Archive. All rights reserved.