I want to speak with the top engineers building real-time speech-to-speech voice models Open to all architectures - cascade/direct, simultaneous/chunked, mel/codec My DM’s are open 🖤
39,41K