markdown 尖叫青蛙自定义提取的正则表达式Cheatsheet
Posted
tags:
篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了markdown 尖叫青蛙自定义提取的正则表达式Cheatsheet相关的知识,希望对你有一定的参考价值。
## Google Analytics ID
#### Returns Google Analytics UA ID
```
["'](UA-.*?)["']
```
## Google Search Console
#### Returns Google Search Console Verification String
```
<meta\s+content=(?:'|")([^'">]+)(?:'|")\s+name=(?:'|")google-site-verification(?:'|")>
```
## Bing Webmaster Tools
#### Returns Bing Webmaster Tools Validation String
```
<meta\s+content=(?:'|")([^'">]+)(?:'|")\s+name=(?:'|")msvalidate.01(?:'|")>
```
## JSON-LD
#### Returns Property Name and Content
```
("image": ".*?")
```
```
("logo": ".*?")
```
```
("description": ".*?")
```
#### Returns all content within JSON-LD script tags
```
<script type=\"application\/ld\+json\">(.*?)</script>
```
## Microdata
#### Returns all itemtypes in the document
```
itemtype='(.*?)'
```
#### Returns all itemprops in the document
```
itemprop='(.*?)'
```
## Open Graph Protocol
#### Returns Open Graph Title when Content follows Property
```
<meta [^>]*property=[\"']og:title[\"'] [^>]*content=[\"']([^'^\"]+?)[\"'][^>]*>
```
#### Returns Open Graph Title when Property follows Content
```
<meta [^>]*content=[\"']([^'^\"]+?)[\"'] [^>]*property=[\"']og:title[\"'][^>]*>
```
## Twitter Cards
#### Returns Twitter Card Name when Content follows Name
```
<meta [^>]*name=[\"']twitter:title[\"'] [^>]*content=[\"']([^'^\"]+?)[\"'][^>]*>
```
#### Returns Twitter Card Name when Name follows Content
```
<meta [^>]*content=[\"']([^'^\"]+?)[\"'] [^>]*name=[\"']twitter:title[\"'][^>]*>
```
## Images
#### Returns Image Source URL for List Mode
```
<img\s.*?src=(?:'|")([^'">]+)(?:'|")
```
##### Sources:
* https://www.screamingfrog.co.uk/web-scraping/
* https://stackoverflow.com/questions/17712062/regex-for-parsing-facebook-open-graph-meta-tag
* https://stackoverflow.com/questions/18664997/how-can-i-use-regular-expression-to-grab-an-img-tag
以上是关于markdown 尖叫青蛙自定义提取的正则表达式Cheatsheet的主要内容,如果未能解决你的问题,请参考以下文章
正则表达式部分提取php代码((数组定义))
干货 | Logstash自定义正则表达式ETL实战
Java正则表达式提取标签之间的文本
使用正则提取字符串中URL等信息
通过定义标题的正则表达式拆分 Markdown 文本文件
Jmeter 参数化方式提取上个接口的参数函数助手常用的函数(用户定义的变量CSV数据文件设置用户参数JSON提取器正则表达式提取器)