python re.match()用法相关示例

脚本专栏 2025/2/6 佚名

3 2 1

帝王谷资源网 Design By www.wdxyy.com

学习python爬虫时遇到了一个问题，书上有示例如下：

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*)are(.*"htmlcode">

matchObj=re.match(r'(.*)are(.*"htmlcode">

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*)are(.*"matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))
 print("matchObj.group(3):", matchObj.group(3))
else:
 print('No match!\n')




得到的结果是：

matchObj.group(): Cats are smarter than dogs

matchObj.group(1): Cats 

matchObj.group(2): 

matchObj.group(3):  smarter than dogs



可见第二个括号里的内容被默认为空了，然后删去那个？，可以看到结果变成：

matchObj.group(): Cats are smarter than dogs

matchObj.group(1): Cats 

matchObj.group(2):  smarter than dogs

matchObj.group(3): 



那么这是否就意味着？的默认值很可能是0次，那？这个符号到底有什么用呢
仔细想来这个说法并不是很严谨。尝试使用单独的."htmlcode">

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are(.*)"matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))




也能在组别2中正常提取到are之后的字符内容，但稍微改动一下将？放到第二个括号内，
就什么也提取不到，同时导致group(0)中匹配的字符到Cats are就截止了（也就是第二个括号匹配失败）。
令人感到奇怪的是，如果将上面的代码改成


import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are (.*)+',line)

if matchObj:
 print("matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))




也就是仅仅将？改为+，虽然能成功匹配整个line但group(2)中没有内容，
如果把+放到第二个括号中就会产生报错，匹配失败。
那么是否可以认为.*"htmlcode">

import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are (.*r).*',line)

if matchObj:
 print("matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))
 #print("matchObj.group(3):", matchObj.group(3))
else:
 print('No match!\n')




为了泛用性尝试了一下把r改成‘ '但是得到的结果是‘smarter than '。于是尝试把.换成表示任意字母的
[a-zA-Z]，成功提取出了单个smarter，代码如下：


import re

line='Cats are smarter than dogs'
matchObj=re.match(r'(.*) are ([a-zA-Z]* ).*',line)

if matchObj:
 print("matchObj.group():",matchObj.group())
 print("matchObj.group(1):", matchObj.group(1))
 print("matchObj.group(2):", matchObj.group(2))
 #print("matchObj.group(3):", matchObj.group(3))
else:
 print('No match!\n')

python,re.match(),python,re.match

标签：

python,re.match(),python,re.match

帝王谷资源网 Design By www.wdxyy.com

广告合作：本站广告合作请联系QQ：858582 申请时备注：广告合作（否则不回）
免责声明：本站文章均来自网站采集或用户投稿，网站不提供任何软件下载或自行开发的软件！如有用户或公司发现本站内容信息存在侵权行为，请邮件告知！ 858582#qq.com

帝王谷资源网 Design By www.wdxyy.com

评论“python re.match()用法相关示例”

暂无评论...

P70系列延期，华为新旗舰将在下月发布

3月20日消息，近期博主@数码闲聊站透露，原定三月份发布的华为新旗舰P70系列延期发布，预计4月份上市。

而博主@定焦数码爆料，华为的P70系列在定位上已经超过了Mate60，成为了重要的旗舰系列之一。它肩负着重返影像领域顶尖的使命。那么这次P70会带来哪些令人惊艳的创新呢？
根据目前爆料的消息来看，华为P70系列将推出三个版本，其中P70和P70 Pro采用了三角形的摄像头模组设计，而P70 Art则采用了与上一代P60 Art相似的不规则形状设计。这样的外观是否好看见仁见智，但辨识度绝对拉满。

更新日志

2025年02月06日

python re.match()用法相关示例

python,re.match(),python,re.match

Python爬虫实现selenium处理iframe作用域问题

python利用appium实现手机APP自动化的示例

评论“python re.match()用法相关示例”

P70系列延期，华为新旗舰将在下月发布

更新日志

友情链接