Question 1

Why doesn't my regex work the same in JavaScript and Python?

Accepted Answer

Regular expression engines differ between languages. JavaScript historically lacked lookbehind support (added in ES2018), while Python has had it for years. Named capture group syntax differs — JavaScript uses (?...) while some older engines use (?P...). Unicode handling, flag names, and behaviour around newlines also vary. The s (dotAll) flag is re.DOTALL in Python. Always test in the target language's engine — this tool uses JavaScript's ECMAScript regex implementation.

Question 2

How do I match a literal dot or bracket?

Accepted Answer

Special characters like ., [, (, *, +, and ? have meaning in regex syntax. To match them literally, escape with a backslash: \. matches a period, \[ matches a square bracket. Alternatively, place the character inside a character class: [.] matches a literal dot (most special characters lose their meaning inside []). Inside a character class, only ], \, ^ (at start), and - (between characters) need escaping.

Question 3

What's the difference between .* and .*?

Accepted Answer

Both match "any character, zero or more times," but they differ in greediness. .* is greedy — it matches as much text as possible, then backtracks. .*? is lazy (or reluctant) — it matches as little as possible. Example: given hello world, the pattern .* matches the entire string (greedy), while .*? matches just hello (lazy). When extracting content between delimiters, lazy quantifiers are almost always what you want.

Question 4

Is regex the right tool for parsing HTML?

Accepted Answer

No. This is one of the most famous answers in programming — regular expressions cannot reliably parse HTML because HTML is a nested, context-sensitive language, and regex (in the formal computer science sense) can only match regular languages. A regex can't track matching opening and closing tags, handle self-closing elements, or deal with attributes containing special characters. For HTML parsing, use a proper DOM parser like the browser's built-in DOMParser, cheerio (Node.js), or BeautifulSoup (Python). Regex is fine for quick-and-dirty text extraction from known, simple structures — but never for general HTML processing.

Pattern	Meaning	Example
`.`	Any character (except newline by default)	`a.c → "abc", "a1c"`
`\d`	Digit [0-9]	`\d{3} → "123", "456"`
`\w`	Word character [a-zA-Z0-9_]	`\w+ → "hello", "var_1"`
`\s`	Whitespace (space, tab, newline)	`a\sb → "a b", "a\tb"`
`[abc]`	Any character in the set	`[aeiou] → vowels`
`[a-z]`	Character range	`[A-Z] → uppercase letters`
`[^abc]`	Any character NOT in the set	`[^0-9] → non-digits`
`^`	Start of string (or line with m flag)	`^Hello → starts with "Hello"`
`$`	End of string (or line with m flag)	`end$ → ends with "end"`
`*`	Zero or more (greedy)	`ab*c → "ac", "abc", "abbc"`
`+`	One or more (greedy)	`ab+c → "abc", "abbc"`
`?`	Zero or one (optional)	`colou?r → "color", "colour"`
`{n}`	Exactly n times	`\d{4} → "2026"`
`{n,m}`	Between n and m times	`\d{2,4} → "12", "123", "1234"`
`()`	Capture group	`(\d+)px → captures "12" from "12px"`
`(?:)`	Non-capturing group	`(?:ab)+ → groups without capturing`
`\|`	Alternation (OR)	`cat\|dog → "cat" or "dog"`
`\b`	Word boundary	`\bword\b → whole word match`

Regex tester

What are regular expressions?

Regex syntax quick reference

Common regex patterns

Frequently asked questions

Regex tester

What are regular expressions?

Regex syntax quick reference

Common regex patterns

Frequently asked questions

Related tools