web_add_auto_filter
Sets criteria to include or exclude URLs when downloading (applies to all subsequent action functions).
int web_add_auto_filter([Action,]< List of Attributes >, LAST );
Argument | Description |
---|---|
Action | A value indicating whether to include or exclude URLs matching the filter criteria. The following values are supported. – Include: include only URLs which match the criteria – Exclude: exclude URLs that match criteria. (default) |
List of Attributes |
For each attribute in the list, use the format "AttributeName=value" . For details, see the Attributes section, below.
|
LAST | A marker indicating the end of the List of Attributes |
Return Values
This function returns LR_PASS (0) on success, and LR_FAIL (1) on failure.
Parameterization
All attributes from the list of attributes can be parameterized using standard parameterization.
General Information
web_add_auto_filter specifies a filter to be used when downloading content for ensuing action functions. The filter is disabled when web_remove_auto_filter is invoked. Depending on the Action attribute passed in this function, the filter will either include or exclude URLs containing the matching criteria. The default action is "Action=Exclude".
If the Port attribute is set, then only content whose source is that port is downloaded. If no port is specified, then all ports are considered a match.
Attributes
URL – the specific URL to filter
UrlPrefix – Filter all URLs that begin with this string. For example,
"UrlPrefix = http://www.cc"
filters http://www.ccn.com, http://www.cc.edu, http://cc–nanochem.de, and any other URL that begins with "http://www.cc"
Host – URLs which include Host in the hostname portion of the URL path are filtered
HostPrefix – URLs which include this prefix in the hostname portion of the URL path are filtered
HostSuffix – URLs which include this suffix in the host name portion of the URL path are filtered
Path – URLs with the specified path are filtered
PathPrefix – URLs with the specified path prefix are filtered
PathSuffix – URLs with the specified path suffix are filtered
Port – URLs from the specified port are filtered
Scheme – URLs with the specified communications scheme are filtered. Example schemes are http, https and ftp
Query – URLs with the specified query are filtered
QueryPrefix – URLs with the specified query prefix are filtered
QuerySuffix – URLs with the specified query suffix are filtered
FragmentId – URLs containing a matching Fragment ID are filtered
FragmentIdPrefix – URLs containing a matching Fragment ID prefix are filtered
FragmentIdSuffix – URLs containing a matching Fragment ID suffix are filtered
ID: An identifier for use by web_remove_auto_filter.