为了账号安全,请及时绑定邮箱和手机立即绑定

我想从网站中排除粗体段落

我想从网站中排除粗体段落

神不在的星期二 2021-08-11 17:25:09
我使用以下代码来抓取网站:import requestsfrom bs4 import BeautifulSoupresp = requests.get('https://www.ecb.europa.eu/press/pressconf/2018/html/ecb.is180913.en.html')soup = BeautifulSoup(resp.content, 'html5lib')article = soup.find('article')paragraphs = article.find_all('p')输出看起来像:[<p>Based on our regular economic and monetary analyses, we decided to keep the <strong>key ECB interest rates</strong> unchanged. .... to levels that are below, but close to, 2% over the medium term.</p>,<p><strong>Has QE been used well by the various euro area countries?</strong></p>, <p>By and large, yes, it's been used well in the sense that the intended effects of the QE – mind, ... It reduced dispersion in growth rates everywhere. An employment situation which is by and large improving almost everywhere, some countries more than others. </p>, <p>If your question is meant to say; shouldn't governments have taken advantage of the situation of such low rates to decrease budget deficits, to restore? ... is a good situation for doing that.</p>, <p><strong>My second question is on reinvestment. ...Have you today explicitly asked the committees to come up with proposals on reinvestments?</strong></p>, <p>About inflation: I said inflation is going to hover around the present level for the rest of the year and then I gave numbers for next year and 2020. ...will reach our objective over the medium term. </p>,]我想排除包含的粗体段落 <p><strong>我尝试编码但未能获得所需的输出。如果您能帮助我,我将不胜感激。
查看完整描述

2 回答

?
郎朗坤

TA贡献1921条经验 获得超9个赞

用于str()将 bs4 对象转换为字符串<p><strong>......</strong></p>


....

paragraphs = article.find_all('p')


for p in paragraphs:

    if '<p><strong>' not in str(p):

        print str(p)


查看完整回答
反对 回复 2021-08-11
?
当年话下

TA贡献1890条经验 获得超9个赞

试试这个extract()功能:

article = soup.find('article')paragraphs = article.find_all('p')article.strong.extract()paragraphs_without_bold = article.find_all('p')

另请参见


查看完整回答
反对 回复 2021-08-11
  • 2 回答
  • 0 关注
  • 281 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
意见反馈 帮助中心 APP下载
官方微信