使用 Beautiful Soup 从 YouTube 播放列表中抓取曲目链接

我正在尝试从我的播放列表中抓取曲目的所有链接。这是我的代码from selenium import webdriver from time import sleepfrom bs4 import BeautifulSoupfrom urllib.request import urlopenimport replaylist = 'minimal_house'url = 'https://www.youtube.com/channel/UCt2GxiTBN_RiE-cbP0cmk5Q/playlists'html = urlopen(url)soup = BeautifulSoup(html , 'html.parser')tracks = soup.find(title = playlist).get('href')print(tracks)url = url + tracksprint(url)html = urlopen(url)soup = BeautifulSoup(html, 'html.parser')links = soup.find_all('a',attrs={'class':'yt-simple-endpoint style-scope ytd-playlist-panel-video-renderer'})print(links)我不能刮'a'; 也不由id; 也不按类名。

查看完整描述

1 回答

Helenr

TA贡献1780条经验获得超3个赞

这是对我有用的混乱代码：

from selenium import webdriver

from time import sleep

from bs4 import BeautifulSoup

from urllib.request import urlopen

import re

playlist = 'minimal_house'

url = 'https://www.youtube.com/channel/UCt2GxiTBN_RiE-cbP0cmk5Q/playlists'

html = urlopen(url)

soup = BeautifulSoup(html, 'html.parser')

tracks = soup.find('a', attrs={'title': playlist}).get('href')

print(tracks)

url = 'https://www.youtube.com' + str(tracks)

print(url)

html = urlopen(url)

soup = BeautifulSoup(html, 'html.parser')

links = soup.find_all('a')

links = set([link.get('href') for link in links if link.get('href').count('watch')])

print(links)

由于类名会根据设备请求更改，因此在这种情况下最好获取所有链接。并且您需要使用 selenium 向下滚动以获取所有列表。

反对回复 2022-12-14

热搜

最近搜索清空

使用 Beautiful Soup 从 YouTube 播放列表中抓取曲目链接

使用 Beautiful Soup 从 YouTube 播放列表中抓取曲目链接

1 回答

添加回答