Flow: Azure TTS (SSML or text) -> one generated clip -> sent to both blend APIs with the same word target.
No result yet.