最新回答 / 慕粉4289539
运行以后是这样的C:\Python27\python.exe D:/pycharm/xiexie/baike_spider/spider_main.pycraw 1 : Nonecraw failed Process finished with exit code 0
2016-10-27
最新回答 / 宇娃
# coding:utf8from bs4 import BeautifulSoupimport rehtml_doc = """<html><head><title>The Dormouse's story</title></head><body><p class="title"><b>The Dormouse's story</b></p><p class="story">...
2016-10-27
最新回答 / qq_江仔_0
def _get_new_urls(self, curr_url, soup): new_urls = set() links = soup.find_all('a', href=re.compile(r"/item/*"))正则表达式的内容改下就行,网页格式变了
2016-10-24