Description: https://help.libreoffice.org/Common/List_of_Regular_Expressions doesn't say anything about regexp lookaround patterns (see https://www.regular-expressions.info/lookaround.html). Steps to Reproduce: I'd love to add this information but https://help.libreoffice.org/Main_Page says there's no way to participate. Actual Results: Lookaround patterns do in fact work Expected Results: =SEARCH("(?<=a)b", "cab") // 3 Reproducible: Always User Profile Reset: No Additional Info: User-Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/62.0.3202.89 Safari/537.36
help.libreoffice.org is not editable because it is a transformation of the local help system written in XML. However, if you are willing to write about the issue, please just write in a LibreOffice Writer document, and attach to this bug. I'll review and convert it in XML for the help system, with your authorship. Please be factual and concise, your contents will be translated into >60 languages.
Actually, LibreOffice's RegEx support is via ICU libraries, our documentation comes from there including coverage of ICU RegEx Operators to lookahead and lookbehind and their negatives--the Lookarounds The Help now links to the Wiki https://wiki.documentfoundation.org/Documentation/HowTo/Writer/Regular_Expressions or to https://wiki.documentfoundation.org/Documentation/HowTo/Calc/Regular_Expressions both of which correctly defer to the ICU content [1]. It is a little thin, but authoritative, and should be basis for addition to our Help and Documentation--with some practical examples included, preferably on our Wiki. The OOo era content at AOO should probably go away. =-ref-= [1] http://userguide.icu-project.org/strings/regexp#TOC-Regular-Expression-Operators
Also missing is \d as a synonym for [:digit:], \s for [:space:] and \w for [:word:]
(In reply to gmolleda from comment #3) > Also missing is \d as a synonym for [:digit:], \s for [:space:] and \w for > [:word:] Those are fully described in the linked ICU libs documentation: https://unicode-org.github.io/icu/userguide/strings/regexp.html#regular-expression-metacharacters But the unmaintained OOo era Wiki content needs to be replaced.
for reference, the URL List_of_Regular_Expressions in first report is not valid anymore, we shall assume it's now https://help.libreoffice.org/latest/en-US/text/shared/01/02100001.html
(In reply to fpy from comment #5) > for reference, the URL List_of_Regular_Expressions in first report is not > valid anymore, > we shall assume it's now > https://help.libreoffice.org/latest/en-US/text/shared/01/02100001.html Yep, that looks correct. Thanks! While the ICU libs project provided UG listing at: https://unicode-org.github.io/icu/userguide/strings/regexp.html#regular-expression-metacharacters remains definitive.
Pierre F committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/help/commit/ade7397aed9febc73918d93ffdf6477d3f4d4175 explicit \s and \d + put the note first for full ICU spec. tdf#113975
Thanks for the commit. Can this be closed as fixed?
Pierre F committed a patch related to this issue. It has been pushed to "libreoffice-24-8": https://git.libreoffice.org/help/commit/92a38c452c2e71463a3331431fbc24b6e3d3c6c0 explicit \s and \d + put the note first for full ICU spec. tdf#113975