Security Journey

Regex can be a powerful tool when dealing with dynamic data. That being said, it can be challenging to create a regular expression that can do precisely what you want it to 100% of the time. Unexpected edge cases or carefully crafted malicious payloads may be able to bypass regex filtering, resulting in unexpected or unsafe results. 

Prioritize using specialized packages or libraries that are designed to perform specific parsing or filtering functions. This includes parsing HTML, validating email addresses, or sanitizing user input that may influence code functionality.

- ex. Using Python's urllib.parse package over regex to filter out specific URL schemes

Use previously validated patterns for common use cases.

- ex. <a href="https://owasp.org/www-community/OWASP_Validation_Regex_Repository" target="_blank" rel="nofollow noopener noreferrer">OWASP's validation regex repository</a> for URLs, IPs, usernames, and passwords.

Avoid using Evil Regex. These are regex patterns that get stuck in exponential backtracking due to specific crafted inputs, causing excessive CPU usage and potential system downtime.

- ex. <code>(a+)+ </code>, <code>([a-zA-Z]+)*</code>, <code>(.*a){x} for x \&gt; 10</code>Use well-known and trusted regex tools for building, linting, validating, and testing regex

Limit regex complexity. Overly complex regex can be difficult to create correctly and can lead to performance issues.

Avoid using regex when the incoming data is unconstrained or from an unknown source.

- Prioritize using specialized packages or libraries that are designed to perform specific parsing or filtering functions. This includes parsing HTML, validating email addresses, or sanitizing user input that may influence code functionality.
 - ex. Using Python's urllib.parse package over regex to filter out specific URL schemes
- Use previously validated patterns for common use cases.
 - ex. <a href="https://owasp.org/www-community/OWASP_Validation_Regex_Repository" target="_blank" rel="nofollow noopener noreferrer">OWASP's validation regex repository</a> for URLs, IPs, usernames, and passwords.
- Avoid using Evil Regex. These are regex patterns that get stuck in exponential backtracking due to specific crafted inputs, causing excessive CPU usage and potential system downtime.
 - ex. <code>(a+)+ </code>, <code>([a-zA-Z]+)*</code>, <code>(.*a){x} for x \&gt; 10</code>Use well-known and trusted regex tools for building, linting, validating, and testing regex
 - ex. regex101, RegExr
- Limit regex complexity. Overly complex regex can be difficult to create correctly and can lead to performance issues.
- Use regex timeouts when available
- Avoid using regex when the incoming data is unconstrained or from an unknown source.

Regex Best Practices

Find answers and get help from Intercom Support and Community Experts

Link, Press control-option-right-arrow to exit

Empty Help Center

Uh oh. That page doesn’t exist.

Disappointed

Neutral

Smiley

Thinking...

Searching through sources...

Analyzing...

Tickets submitted through the messenger or by a support agent in your conversation will appear here.