Method: PHPCrawler::addLinkSearchContentType()



Adds a rule to the list of rules that decide in what kind of documents the crawler
should search for links in (regarding their content-type)
Signature:

public addLinkSearchContentType($regex)

Parameters:

$regex string Regular-expression defining the rule

Returns:

bool  TRUE if the rule was successfully added

Description:

By default the crawler ONLY searches for links in documents of type "text/html".
Use this method to add one or more other content-types the crawler should check for links.

Example:$crawler->addLinkSearchContentType("#text/css# i");
$crawler->addLinkSearchContentType("#text/xml# i");

These rules let the crawler search for links in HTML-, CSS- ans XML-documents.

Please note: It is NOT recommended to let the crawler checkfor links in EVERY document-
type! This could slow down the crawling-process dramatically (e.g. if the crawler receives large
binary-files like images and tries to find links in them).