What

The regular expression engine reads the expression. There are 2 kinds of popular engines:

Most of Linux tools support at least BRE. sed supports only BRE, while awk supports ERE.

BRE

RE is case sensitive.

special character: .*[]^${}+?|() / use \. instead.

^ denotes the head of the line. If it not appeared at the beginning of the expression, it is a normal character
$ denotes the end of the line.
. denotes there must be a character. e.g. .at does not match at.
character class []. any character appears in [] is ok. use to exclusive some characters.

special character class

* denotes this character appears 0 or any time. use ‘bla.*bla’ to skip some words. it also works for character class.

? denotes the character can appear 0 or 1 time.
+ denotes the character must appear at least once.
{} denotes the interval, which means the character must appears given m times(e.g. a{1}), or at least m times and at most n times(e.g. a{1,2}). Note in gawk ‘—re-interval’ must be specified.
| denotes any expression matched is ok.
() denotes the words in the () is seemed as one character.

断言保证我们所匹配的字符串满足断言所描述的模式。（但是该断言描述的模式并不属于匹配的字符串）

以正向后行断言举例。(?<=(T|t)he\s)(fat|mat) 匹配The或the后面的fat或mat

如正则表达式 /at(.)?$/gm，表示：小写字母 a，后跟小写字母 t，匹配除了换行符以外任意字符零次或一次。而且因为 m 标记，现在正则表达式引擎匹配字符串中每一行的末尾。