techsupport

2469 readers

2 users here now

The Lemmy community will help you with your tech problems and questions about anything here. Do not be shy, we will try to help you.

If something works or if you find a solution to your problem let us know it will be greatly apreciated.

Rules: instance rules + stay on topic

Partnered communities:

You Should Know

Software gore

Recommendations

founded 1 year ago

MODERATORS

GatoB@lemmy.world

[Solved!] Trying (failing) to use MS Word wildcards to detect any two capitalized letters, and insert a space between them... (lemmy.world)

submitted 3 months ago* (last edited 3 months ago) by Sterile_Technique@lemmy.world to c/techsupport@lemmy.world

9 comments fedilink hide all child comments

Follow-up to this thread - this is way more specific, so hopefully worthy of its own thread. I think wildcards are the best option for my skill level (basically none), and have gotten a good chunk of what I wanted to accomplish done with those.

An issue I've run into and can't seem to google my way out is making TTS pronounce acronyms in a sensible way. For example "PACU" (post-anesthesia care unit) is usually vocalized as "pack-you" but my TTS software likes to say things like "pace-uh". Or "PO" (latin abbreviation for 'by mouth') is vocalized by just saying the letters, but TTS says "Poe". Stuff like that.

When the TTS comes across a capitol letter with a space on either side, it just pronounces the letter, so I'd still lose things like "pack-you" but at least hearing it spell out "pee ay see you" would make sense, vs "pace-uh" which is gibberish and confusing at high playback speeds.

Best I've come up with so far is <([A-Z]{2})> on the Find side, but that's only spotting the two character terms like PO, and ignoring the longer ones... I'd hoped it would see PACU and detect PA, AC, and CU as three distinct sets of two that could cobbled into "P A C U".

Nothing I've done on the Replace side comes close to working. It either does nothing at all, or it'll do something like turn "PO" into <([A- Z]{2})>. Not sure if preserving the original characters is something A-Z is actually capable of - seems not, but I'm kind of an idiot with stuff like this, so any tips would be appreciated!

Thank you!

you are viewing a single comment's thread
view the rest of the comments

[–] unmagical@lemmy.ml 1 points 3 months ago

The parenthetical groups in the search query define what is to be captured. They are numbered from left to right. In this case that is a capital letter assigned to group 1 and then an immediately following capital letter assigned to group 2. If we used a replace of only "\1\2" then we would get no change from the original input. If we want to switch them then we just need to swap the order in the replace "\2\1".