Word: Clean copy from lists, remove bullet points and leading numbers (v11.0.3)

Word: Clean copy from lists, remove bullet points and leading numbers
Word Aufzählung entfernen Macro (v11.0.3)

With this RegEx you can filter the copied word of a list with numbers or bullet points. Word copies this as standard into the clipboard and you can't search & replace with this. You have to manually remove the leading bullet point or number followed by tab and the space at the end if you double click to highlight.
Or you use this macro.
RegEx: ^\s*(?:[IVXLCDMivxlcdm]+|[a-zA-Z]+|[0-9]+)[.)]?\s+(.?)\s$

It only runs if Word is activated in the front.
Workflow is the same (Cmd+C), but you get a clean word in the field search.

Word Aufzählung entfernen.kmmacros (5.1 KB)

Thanks for your first post in the forum, @mgasperl.

A tip on displaying code: as you may well have noticed, the forum software, “Discourse”, rendered .*?\s* as .?)\s`. That's because Discourse interprets text inside a pair of asterisks as being italicized (it's standard in Markdown).

You can get around that when showing code by wrapping the code inside a pair of backticks…

like this:`^\s*(?:[IVXLCDMivxlcdm]+|[a-zA-Z]+|[0-9]+)[.)]?\s+(.*?)\s*$`

(and you can use pairs of three backticks two wrap blocks of code over multiple lines).

You might find it preferable to put the macro in a macro group that is active only when Word is the active application. For one thing, this makes it easier to set up the use of the same hotkey (e.g. ⌘-C) for multiple macros that do different things for different applications.

I wonder if it would be enough to replace

^[^\t]+\t

with \t ?

( It seems that itemizing prefixes are always separated from the text, in the public.utf8-plain-text component of an MS Word clipboard, by a tab )

1 Like

Another problem with nested bullet lists in the public.utf8-plain-text component of an MS Word clipboard, is that sub-lists are no more indented than their parent items.

If, for any reason, you need to preserve indentation, while removing bullet and numbering prefixes, for example copying an MS Word nested bullet list like:

Screenshot 2025-04-02 at 1.29.13 pm

to tab-indented plain text:

MS Word bullet outline
Alpha
	Delta
	Epsilon
		Theta
			Lambda
			Mu
			Nu
		Iota
		Kappa
	Zeta
Beta
Gamma

Then you can try something like the macro below, which needs you to have, somewhere in an active KM Group, Ver 0.6 or above of the Filtering XML with XQuery Subroutine

Macro using that subroutine:

MS Word Bullet Nest copied as Indented Text.kmmacros (12 KB)


XQuery over public.html component of MS Word clipboard:

for $p in //p
return if (contains($p/@style, "level")) then
	(: LEVEL as integer :)
	let $level := 
		for $t in tokenize($p/@style, ";") 
		where starts-with($t, "mso-list")
		return xs:integer(substring-after(tokenize($t, " ")[2], "level"))

	(: One TAB for each level above 1:)
	return concat(
		string-join((for $i in 2 to $level return "\t"),""),

		(: SECOND span only – first is bullet prefix :)
		$p//span[matches(., "\p{L}")]/text()[2]
	)
else
	string($p)