从Golang中的字符串中提取内部子字符串的最佳方式是什么?在Golang中从HTML中提取文本内容

输入:

"Hello <p> this is paragraph </p> this is junk <p> this is paragraph 2 </p> this is junk 2" 

输出:

"this is paragraph \n 
this is paragraph 2" 

是否有围棋任何字符串包/库已经做这样的事情?

package main 

import (
    "fmt" 
    "strings" 
) 

func main() { 
    longString := "Hello world <p> this is paragraph </p> this is junk <p> this is paragraph 2 </p> this is junk 2" 

    newString := getInnerStrings("<p>", "</p>", longString) 

    fmt.Println(newString) 
    //output: this is paragraph \n 
    //  this is paragraph 2 

} 
func getInnerStrings(start, end, str string) string { 
    //Brain Freeze 
     //Regex? 
     //Bytes Loop? 
} 

感谢

2014-01-08 user3173591

+1

[Here](http://golang.org/pkg/regexp)。阅读关于子匹配的部分;它应该帮助你。 –

+0

是的,我看到了,但我不确定这是否是正确的选择。尽管为未来引用添加书签。 –