Any plans to offer speech to speech models which keep prosody, intonation, and t...

jeffharris · 2025-03-20T23:24:20 1742513060

we'll keep expanding these GPT-4o based models with more controls. Is the main feature missing we're missing custom voices?

oidar · 2025-03-21T04:38:25 1742531905

No, not custom voices - but voices that can be influenced by a recording. As in, a male voice actor records a part, and the model transforms it to a female part - keeping all the prosody, intonation and timing in the original recording. This would allow one voice actor to do many roles.