请问P段落怎么提取不出文字？

# coding:utf8from bs4 import BeautifulSoup import re html_doc = """<html><head><title>The Dormouse's story</title></head><body>The Dormouse's storyOnce upon a time there were three little sisters; and their names were<a href="http://example.com/elsie" class="sister" id="link1">Elsie</a>,<a href="http://example.com/lacie" class="sister" id="link2">Lacie</a> and<a href="http://example.com/tillie" class="sister" id="link3">Tillie</a>;and they lived at the bottom of a well...."""soup = BeautifulSoup(html_doc,'html.parser',from_encoding='utf-8')print'获取所有连接'links = soup.find_all('a')for link in links: print link.name,link['href'],link.get_text()print'获取lacie的连接'link_node=soup.find('a',href='http://example.com/lacie')print link_node.name,link_node['href'],link_node.get_text()print'正则匹配'link_node=soup.find('a',href=re.compile(r"ill"))print link_node.name,link_node['href'],link_node.get_text()print'p段落文字'p_node=soup.find('a',class_="title")print p_node.name, p_node.get_text()会报错如下获取所有连接a http://example.com/elsie Elsiea http://example.com/lacie Laciea http://example.com/tillie Tillie获取lacie的连接a http://example.com/lacie Lacie正则匹配a http://example.com/tillie Tilliep段落文字Traceback (most recent call last): File "C:\Users\Administrator\workspace\2.7\66\test_bs4.py", line 34, in <module> print p_node.name, p_node.get_text()AttributeError: 'NoneType' object has no attribute 'name'

查看完整描述

2 回答

慕粉3913390

TA贡献13条经验获得超3个赞

提问应该有个语言模式，这样看起来很乱

反对回复 2017-04-04

热搜

最近搜索清空

请问P段落怎么提取不出文字？

请问P段落怎么提取不出文字？

2 回答

添加回答