从Golang中的字符串中提取内部子字符串的最佳方式是什么?在Golang中从HTML中提取文本内容
输入:
"Hello <p> this is paragraph </p> this is junk <p> this is paragraph 2 </p> this is junk 2"
输出:
"this is paragraph \n
this is paragraph 2"
是否有围棋任何字符串包/库已经做这样的事情?
package main
import (
"fmt"
"strings"
)
func main() {
longString := "Hello world <p> this is paragraph </p> this is junk <p> this is paragraph 2 </p> this is junk 2"
newString := getInnerStrings("<p>", "</p>", longString)
fmt.Println(newString)
//output: this is paragraph \n
// this is paragraph 2
}
func getInnerStrings(start, end, str string) string {
//Brain Freeze
//Regex?
//Bytes Loop?
}
感谢
2014-01-08 user3173591
+1
[Here](http://golang.org/pkg/regexp)。阅读关于子匹配的部分;它应该帮助你。 –
+0
是的,我看到了,但我不确定这是否是正确的选择。尽管为未来引用添加书签。 –