练习爬虫,抓取链家页面信息,链家页面是utf-8,print出来后中文乱码
import requests
from bs4 import BeautifulSoup
url = 'http://nj.lianjia.com/xiaoqu/'
html = requests.get(url)
soup = BeautifulSoup(html.text,'lxml')
title = soup.title.get_text()
print(title)
得到的是“å京å°åºäºææ¿(å京é¾å®¶ç½)”这玩意,请问如何能正常显示中文?
添加回答
举报
0/150
提交
取消