Word: Clean copy from lists, remove bullet points and leading numbers
Word Aufzählung entfernen Macro (v11.0.3)
With this RegEx you can filter the copied word of a list with numbers or bullet points. Word copies this as standard into the clipboard and you can't search & replace with this. You have to manually remove the leading bullet point or number followed by tab and the space at the end if you double click to highlight.
Or you use this macro.
RegEx: ^\s*(?:[IVXLCDMivxlcdm]+|[a-zA-Z]+|[0-9]+)[.)]?\s+(.?)\s$
It only runs if Word is activated in the front.
Workflow is the same (Cmd+C), but you get a clean word in the field search.
Thanks for your first post in the forum, @mgasperl.
A tip on displaying code: as you may well have noticed, the forum software, “Discourse”, rendered .*?\s* as .?)\s`. That's because Discourse interprets text inside a pair of asterisks as being italicized (it's standard in Markdown).
You can get around that when showing code by wrapping the code inside a pair of backticks…
like this:`^\s*(?:[IVXLCDMivxlcdm]+|[a-zA-Z]+|[0-9]+)[.)]?\s+(.*?)\s*$`
(and you can use pairs of three backticks two wrap blocks of code over multiple lines).
You might find it preferable to put the macro in a macro group that is active only when Word is the active application. For one thing, this makes it easier to set up the use of the same hotkey (e.g. ⌘-C) for multiple macros that do different things for different applications.
Another problem with nested bullet lists in the public.utf8-plain-text component of an MS Word clipboard, is that sub-lists are no more indented than their parent items.
If, for any reason, you need to preserve indentation, while removing bullet and numbering prefixes, for example copying an MS Word nested bullet list like:
to tab-indented plain text:
MS Word bullet outline
Alpha
Delta
Epsilon
Theta
Lambda
Mu
Nu
Iota
Kappa
Zeta
Beta
Gamma
Then you can try something like the macro below, which needs you to have, somewhere in an active KM Group, Ver 0.6 or above of the Filtering XML with XQuery Subroutine
XQuery over public.html component of MS Word clipboard:
for $p in //p
return if (contains($p/@style, "level")) then
(: LEVEL as integer :)
let $level :=
for $t in tokenize($p/@style, ";")
where starts-with($t, "mso-list")
return xs:integer(substring-after(tokenize($t, " ")[2], "level"))
(: One TAB for each level above 1:)
return concat(
string-join((for $i in 2 to $level return "\t"),""),
(: SECOND span only – first is bullet prefix :)
$p//span[matches(., "\p{L}")]/text()[2]
)
else
string($p)