为了账号安全,请及时绑定邮箱和手机立即绑定

代码报错​AttributeError: 'HtmlOutputer' object has no attribute 'output_html'

代码报错​AttributeError: 'HtmlOutputer' object has no attribute 'output_html'

薛定谔的猫4118301 2017-03-29 17:50:10
# coding:utf-8 from baike_spider import url_manager, html_downloader, html_parser, \     html_outputer class SpiderMain(object):     def __init__(self):         self.urls = url_manager.UrlManager()         self.downloader = html_downloader.Htmldownloader()         self.parser = html_parser.HtmlParser()         self.outputer = html_outputer.HtmlOutputer()     def craw(self,root_url):         count = 1         self.urls.add_new_url(root_url)         while self.urls.has_new_url():             try:                 new_url = self.urls.get_new_url()                 print 'craw %d : %s' % (count, new_url)                 html_cont = self.downloader.download(new_url)                 new_urls, new_data = self.parser.parse(new_url, html_cont)                 self.urls.add_new_urls(new_urls)                 self.outputer.collect_data(new_data)                 if count == 1000:                     break                 count = count + 1             except:                 print"craw failed"         self.outputer.output_html() if __name__ == "__main__":     root_url = "http://baike.baidu.com/item/亚马逊/21766?sefr=cr"     obj_spider = SpiderMain()     obj_spider.craw(root_url)
查看完整描述

目前暂无任何回答

  • 0 回答
  • 0 关注
  • 1469 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
意见反馈 帮助中心 APP下载
官方微信