我正试图从jb hifi中提取特殊信息,这是我所做的:from BeautifulSoup import BeautifulSoupimport urllib2import reurl="http://www.jbhifionline.com.au/support.aspx?post=1&results=10&source=all&bnSearch=Go!&q=ipod&submit=Go"page=urllib2.urlopen(url)soup = BeautifulSoup(page.read())Item0=soup.findAll('td',{'class':'check_title'})[0] print (Item0.renderContents())输出为:Apple iPod Classic 160GB (Black)Â <span class="SKU">MC297ZP/A</span>我想要的是:Apple iPod Classic 160GB (Black)我尝试使用re删除其他信息 print(Item0.renderContents()).replace{^<span:,""} 但这没用所以我的问题是如何删除无用的信息并获得“ Apple ipod classic 160GB(黑色)”
添加回答
举报
0/150
提交
取消