php解析字符串里所有URL地址的方法

当前位置：首页 > 范文|应用文 > IT技术专栏 > 网络编程

php解析字符串里所有URL地址的方法

来源：阅读：2129 次日期：2015-04-07 14:53:59

温馨提示：小编为您整理了“php解析字符串里所有URL地址的方法”,方便广大网友查阅！

具体如下：

<?php

// $html = the html on the page

// $current_url = the full url that the html came from

//(only needed for $repath)

// $repath = converts ../ and / and // urls to full valid urls

function pageLinks($html, $current_url = "", $repath = false){

preg_match_all("/\<a.+?href=(\"|')(?!javascript:|#)(.+?)(\"|')/i", $html, $matches);

$links = array();

if(isset($matches[2])){

$links = $matches[2];

}

if($repath && count($links) > 0 && strlen($current_url) > 0){

$pathi = pathinfo($current_url);

$dir = $pathi["dirname"];

$base = parse_url($current_url);

$split_path = explode("/", $dir);

$url = "";

foreach($links as $k => $link){

if(preg_match("/^\.\./", $link)){

$total = substr_count($link, "../");

for($i = 0; $i < $total; $i++){

array_pop($split_path);

}

$url = implode("/", $split_path) . "/" . str_replace("../", "", $link);

}elseif(preg_match("/^\/\//", $link)){

$url = $base["scheme"] . ":" . $link;

}elseif(preg_match("/^\/|^.\//", $link)){

$url = $base["scheme"] . "://" . $base["host"] . $link;

}elseif(preg_match("/^[a-zA-Z0-9]/", $link)){

if(preg_match("/^http/", $link)){

$url = $link;

}else{

$url = $dir . "/" . $link;

}

$links[$k] = $url;

}

return $links;

}

header("content-type: text/plain");

$url = "";

$html = file_get_contents($url);

// Gets links from the page:

print_r(pageLinks($html));

// Gets links from the page and formats them to a full valid url:

print_r(pageLinks($html, $url, true));

更多信息请查看IT技术专栏

上一篇：php对文件进行hash运算的方法

下一篇：php实现递归抓取网页类实例

手机网站地址：php解析字符串里所有URL地址的方法

由于各方面情况的不断调整与变化，提供的所有考试信息和咨询回复仅供参考，敬请考生以权威部门公布的正式信息和咨询为准！

最新信息

2024年玉溪市市政开发建设有限公司项目用工招聘公告

2025年玉溪市红塔区卫生健康系统招聘毕业生综合考核成绩表

2025年昭通市护士执业资格考试报名相关事宜公告

2025年昭通市卫生专业技术资格考试报名相关事宜公告

2024年云南民族文化宫讲解员面试通知

2024年玉溪市检验检测认证院招聘编外人员（第二批）岗位裁减公告

2024年红河州建水泽润环保科技有限责任公司招聘公告

2025年怒江考点全国护士执业资格考试报名公告

2025年怒江考点全国卫生专业技术资格考试报名公告

2024年昆明市官渡区阿拉街道社区卫生服务中心拟录用公告

公考类

云南公务员贵州公务员四川公务员 118金宝搏app 各省公务员国家公务员选调遴选

招聘类

118bet金博宝 118金宝搏三支一扶志愿者银行招聘 118bet金博宝下载

各类考试

学历升学会计考试职业资格医学考试工程考试教师资格