课程
                    
                        /后端开发
                        
                            /Python
                        
                        /Python开发简单爬虫

python

最后一行出现错误 print p_node.name,p_node.get_text()

AttributeError: 'NoneType' object has no attribute 'name' 怎么解决

Want丶y

2017-07-31

源自：Python开发简单爬虫 6-4

关注问题我要回答

1215

操作

收起

2 回答

静夜无缘
2017-07-31

p_node.get_text()改成link_node.get_text()

0 回复有任何疑惑可以回复我~

收起回答

静夜无缘

看错了，手动删除。。

2017-07-31 回复有任何疑惑可以回复我~

慕雪1753686
2017-07-31

# coding:utf8

import re

from bs4 import BeautifulSoup

html_doc = """

<html><head><title>The Dormouse's story</title></head>

<body>

The Dormouse's story

Once upon a time there were three little sisters; and their names were

<a href="http://example.com/elsie" class="sister" id="link1">Elsie</a>,

<a href="http://example.com/lacie" class="sister" id="link2">Lacie</a> and

<a href="http://example.com/tillie" class="sister" id="link3">Tillie</a>;

and they lived at the bottom of a well.

...

"""

soup = BeautifulSoup(html_doc, 'html.parser', from_encoding='utf-8')

print '获取所有的链接'

links = soup.find_all('a')

for link in links:

print link.name, link ['href'], link.get_text()

print '获取Lacie的链接'

link_node = soup.find('a',href='http://example.com/lacie')

print link_node.name, link_node ['href'], link_node.get_text()

print '正则匹配'

link_node = soup.find('a',href=re.compile(r"ill"))

print link_node.name, link_node ['href'], link_node.get_text()

print '获取P段落文字'

p_node = soup.find('p',class_="title")

print p_node.name,p_node.get_text()

1 回复有任何疑惑可以回复我~

收起回答

0/150

提交

取消

Python开发简单爬虫

参与学习 227557 人
解答问题 1288 个

本教程带您解开python爬虫这门神奇技术的面纱

进入课程

python

我要回答关注问题

热搜

最近搜索清空

python

2 回答

python