Skip to content

Instantly share code, notes, and snippets.

@tesths
Last active November 14, 2018 02:47
Show Gist options
  • Select an option

  • Save tesths/43f8ada9a98c281eb07797b425746c80 to your computer and use it in GitHub Desktop.

Select an option

Save tesths/43f8ada9a98c281eb07797b425746c80 to your computer and use it in GitHub Desktop.
web scraper 公众号爬虫
{
"_id": "gongzhonghao",
"startUrl": [
"https:AAA"
],
"selectors": [
{
"id": "total",
"type": "SelectorElementScroll",
"parentSelectors": [
"_root"
],
"selector": "div.weui_msg_card:nth-of-type(n+2)",
"multiple": true,
"delay": "2000"
},
{
"id": "title",
"type": "SelectorText",
"parentSelectors": [
"total"
],
"selector": "h4.weui_media_title",
"multiple": false,
"regex": "",
"delay": 0
},
{
"id": "link",
"type": "SelectorElementAttribute",
"parentSelectors": [
"total"
],
"selector": "h4.weui_media_title",
"multiple": false,
"extractAttribute": "hrefs",
"delay": 0
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment