课程
                    
                        /后端开发
                        
                            /Python
                        
                        /Python开发简单爬虫

output输出问题

为什么爬取的内容是字节码的格式？

pan060757

2016-03-16

源自：Python开发简单爬虫 7-5

关注问题我要回答

790

操作

收起

4 回答

sS浩子_M
2018-07-27

class HtmlOutputer(object):
    def __init__(self):
        self.datas=[]

    def collect_data(self,data):
        if data is None:
            return
        self.datas.append(data)

    def output_html(self):
        fout = open('output.html', 'w', encoding='utf-8')
        fout.write("<html>")
        fout.write("<head>")
        fout.write('<meta charset="UTF-8">')
        fout.write("</head>")
        fout.write("<body>")
        fout.write("<table>")

        #ascii
        for data in self.datas:
            fout.write("<tr>")
            fout.write("<td>%s</td>"%data['url'])
            fout.write("<td>%s</td>"%data['title'])
            fout.write("<td>%s</td>"%data['summary'])
        fout.write("</table>")
        fout.write("</body>")
        fout.write("</html>")