ripgrep

twin/ripgrep

Fork 0

mirror of https://github.com/BurntSushi/ripgrep.git synced 2025-08-01 12:41:58 -07:00

Commit Graph

Author	SHA1	Message	Date
Andrew Gallant	2a2b1506d4	Fix a performance bug where using -w could result in very bad performance. The specific issue is that -w causes the regex to be wrapped in Unicode word boundaries. Regrettably, Unicode word boundaries are the one thing our regex engine can't handle well in the presence of non-ASCII text. We work around its slowness by stripping word boundaries in some circumstances, and using the resulting expression as a way to produce match candidates that are then verified by the full original regex. This doesn't fix all cases, but it should fix all cases where -w is used.	2016-09-21 19:12:07 -04:00

Author

SHA1

Message

Date

Andrew Gallant

2a2b1506d4

Fix a performance bug where using -w could result in very bad performance.

The specific issue is that -w causes the regex to be wrapped in Unicode
word boundaries. Regrettably, Unicode word boundaries are the one thing
our regex engine can't handle well in the presence of non-ASCII text. We
work around its slowness by stripping word boundaries in some
circumstances, and using the resulting expression as a way to produce match
candidates that are then verified by the full original regex.

This doesn't fix all cases, but it should fix all cases where -w is used.

2016-09-21 19:12:07 -04:00

1 Commits