Golang正则表达式使用及简单示例

(?!re)

The regexp implementation provided by this package is guaranteed to run in time linear in the size of the input.

所以，本质上这不是一个「够不够完善」的问题，而是「技术实现决定了它支持程度就是这样」。更多讨论可以关注这里。

Go的正则表达式采用RE2语法，详细语法及支持情况可以参阅[这里]。

regex包的设计原则

regexp包的方法命名规则如下：

Find(All)?(String)?(Submatch)?(Index)?

包含All的方法捕获所有match, 返回值是一个slice. 同时一般会提供一个参数n作为最大匹配次数。
包含String的方法对string类型进行匹配，反之对[]byte进行匹配。
包含Submatch的方法返回所有子匹配，返回值是一个slice. 位置0是对应整个正则表达式匹配结果，位置n(n>0)是第n个子表达式(group) 匹配结果。
包含Index的方法返回匹配的位置。例如，返回loc []int, 则与之对应的匹配字符为src[loc[0]:loc[1]].

Unicode character class (one-letter name): \pN
Unicode character class: \p{Greek}

test1 := “123中文汉字abc321”

reg1 := regexp.MustCompile(`\p{L}+`)

match := reg1.FindString(test1)

log.Println(match)

// 输出：中文汉字abc

test1 := `123中文汉字abc321`

reg2 := regexp.MustCompile(`(?P<All>(?P<Number>\d+)(?P<Letter>\p{L}+)(?P<Number>\d+))`)

names := reg2.SubexpNames()

for k, v := range names {

log.Printf("%d: %s", k, v)

}

matches := reg2.FindStringSubmatch(test1)

for k, v := range matches {

log.Printf("%d: %s", k, v)

}

// 输出：

// 2016/04/25 16:26:18 0:

// 2016/04/25 16:26:18 1: All

// 2016/04/25 16:26:18 2: Number

// 2016/04/25 16:26:18 3: Letter

// 2016/04/25 16:26:18 4: Number

// 2016/04/25 16:26:18 0: 123中文汉字abc321

// 2016/04/25 16:26:18 1: 123中文汉字abc321

// 2016/04/25 16:26:18 2: 123

// 2016/04/25 16:26:18 3: 中文汉字abc

// 2016/04/25 16:26:18 4: 321

–EOF–

版权声明
转载请注明出处，本文原始链接