为了账号安全,请及时绑定邮箱和手机立即绑定

使用python在html上提取<label><span>标签

使用python在html上提取<label><span>标签

呼唤远方 2021-07-03 10:07:38
我想提取网页,如: https://www.glassdoor.com/Overview/Working-at-Apple-EI_IE1138.11,16.htm,所以我想以以下格式返回结果。Website       Headquarters  Size             Revenue                Typewww.apple.com Cupertino, CA 10000+ employees $10+ billion (USD) per year     Company - Public (AAPL)然后我使用下面的代码beatifulsoup来得到这个。all_href = com_soup.find_all('span', {'class': re.compile('value')})all_href = list(set(all_href))它返回带有<span>. 此外,它没有在下面显示标签<label>[<span class="value"> Computer Hardware &amp; Software</span>, <span class="value"> Company - Public (AAPL) </span>, <span class="value">10000+ employees</span>, <span class="value"> $10+ billion (USD) per year</span>, <span class="value-title" title="4.0"></span>, <span class="value">Cupertino, CA</span>, <span class="value"> 1976</span>, <span class="value-title" title="5.0"></span>, <span class="value website"><a class="link" href="http://www.apple.com" rel="nofollow noreferrer" target="_blank">www.apple.com</a></span>]
查看完整描述

2 回答

  • 2 回答
  • 0 关注
  • 346 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
微信客服

购课补贴
联系客服咨询优惠详情

帮助反馈 APP下载

慕课网APP
您的移动学习伙伴

公众号

扫描二维码
关注慕课网微信公众号