带有简单 html dom 解析器的垃圾 Javascript 和 Css 代码

Html5

慕容森 2023-10-30 20:41:58

我正在使用简单的 html dom 解析器来解析 php 的链接。下面是我正在使用的 url 和 php 代码。网址：https://homeshopping.pk/products/-Imported-Stretchable-Tights-For-Women--Pack-Of-3-.htmlPHP 脚本：$html = file_get_html('https://homeshopping.pk/products/-Imported-Stretchable-Tights-For-Women--Pack-Of-3-.html');foreach($html->find('div#ProductDescription_Tab') as $description){ $comments = $description->find('.hsn_comments', 0); $comments->outertext = ''; print $description->outertext ;}问题是，运行脚本后，我得到了我想要的前端，但查看页面源代码会显示大量 javascript 和 css 垃圾代码。可以吗？我不能只获取 html 标签而不需要任何额外的 css 或 javascript 代码吗？下面是我的 php 脚本运行脚本后查看页面源的图像。https://i.stack.imgur.com/78X6z.jpg

查看完整描述

1 回答

牛魔王的故事

TA贡献1830条经验获得超3个赞

如果您使用的是最新版本的 simpleHTMLDom，则可以使用该remove()功能。这是基于您现有代码的示例代码

$html = file_get_html('https://homeshopping.pk/products/-Imported-Stretchable-Tights-For-Women--Pack-Of-3-.html');

foreach($html->find('div#ProductDescription_Tab') as $description)

{

$comments = $description->find('.hsn_comments', 0);

$comments->outertext = '';

//remove div with script

$description->find('div#flix-minisite',0)->remove();

$description->find('div#flix-inpage',0)->remove();

//will remove all <script> tags

foreach($description->find('script') as $s) $s->remove();

//wil remove all <style> tags

foreach($description->find('style') as $s) $s->remove();

echo $description->innertext ;

}

反对回复 2023-10-30

热搜

最近搜索清空

带有简单 html dom 解析器的垃圾 Javascript 和 Css 代码

带有简单 html dom 解析器的垃圾 Javascript 和 Css 代码

1 回答

添加回答