Modify the acquisition node
Site index The content configuration
Node basic information
The name of the node: Target page coding: GetAtt('sourcelang')=='gb2312') echo " checked='1'"; ?>/> GB2312 GetAtt('sourcelang')=='utf-8') echo " checked='1'"; ?>/> UTF8 GetAtt('sourcelang')=='big5') echo " checked='1'"; ?>/> BIG5
Regional matching model: GetAtt('macthtype')=='regex') echo " checked='1'"; ?>/> Regular expression GetAtt('macthtype')=='string') echo " checked='1'"; ?>/> string Content import order: GetAtt('cosort')=='asc') echo " checked='1'"; ?>/> Agree with the target station GetAtt('cosort')=='desc') echo " checked='1'"; ?>/> Opposite the target station
The following options only need to be set on the anti-hotlinking mode. If the target site has no anti-hotlinking function, please do not open it, otherwise it will reduce the collection speed.
Anti-hotlinking mode: GetAtt('isref')=='no') echo " checked='1'"; ?>/> Don't open GetAtt('isref')=='yes') echo " checked='1'"; ?>/> open Resource download timeout: seconds
Reference site: (A web site for one of the posts on the target site)
List url for rules
'> '>
The source attribute: GetAtt('sourcetype')=='batch') echo " checked='checked'"; ?>/> Batch generate list url GetAtt('sourcetype')=='hand') echo " checked='checked'"; ?>/> Manually specify the list url GetAtt('sourcetype')=='rss') echo " checked='checked'"; ?>/> Get it from RSS
Batch generate address Settings:
Match the url:
(Such as:http://www.dedecms.com/html/test/list_(*).html,If you can't match all the urls, you can type in the additional url in the place where the url is manually specified)
(*)from to (fill in the page number or regular increment) & NBSP; Increment per page: /> Enable multi-column distribution (#)
Manual address:
Some unmatched urls can be specified here after specifying the rules of distribution.
Multi-column distribution rules:
If the target site USES a single template, you can use "(#)" in the matching url to indicate the difference in the approximate url, then set the set in the general distribution rule, and you can specify the export column.

The sample format:[(#)=>labs/list_3; (*)=>1-25; typeid=>7] Match the url:http://www.aaa.com/(#)_(*).html
Article url matching rules
Content url matching mode: GetAtt('urlrule')=='area') echo " checked='1'"; ?>/> Specify the area that contains the url of the article (you can access the url, title, image, etc.) of the site. GetAtt('urlrule')=='regx') echo " checked='1'"; ?>/> Specify the url regular expression (only access to the url information)
Include the area Settings for the url of the article:
The beginning of the HTML in the region:
HTML for the end of the region:
If the link contains pictures: GetAtt('listpic')=='0') echo " checked='1'"; ?>/> Don't deal with GetAtt('listpic')=='1') echo " checked='1'"; ?>/> Gather as a thumbnail
Refilter the regional web site:
(Use regular expressions)
Must contain: (The priority is higher than the latter)
Can't contain: