public interface ExtractorParameters
Modifier and Type | Method and Description |
---|---|
boolean |
getExtract404s()
Whether to extract links from responses with a 404 'not found' response
code.
|
boolean |
getExtractIndependently()
Whether each extractor should make an independent decision as to whether
it can extract links from a URI's content (when value is true), or
whether a previous extractor's success (marking the URI as
hasBeenLinkExtracted) should cancel later extractors (when value is
false).
|
int |
getMaxOutlinks()
The maximum number of outlinks to discover from any URI's content.
|
int getMaxOutlinks()
boolean getExtractIndependently()
boolean getExtract404s()
Copyright © 2003–2019 Internet Archive. All rights reserved.