[Cartesia] Upgrading the message cutoff for Cartesia Synthesizer to use timestamps #700

chongzluong · 2024-09-05T00:24:53Z

Overview

Discussed with Ajay over Slack, recapping below.

There's a distinction between Cartesia's version of continuations and Vocode's general expectation for continuations. Vocode seems to expect an N:N ratio of senders to receivers, but our continuations is an N:1 ratio of senders to receivers. Vocode's get_message_up_to also reflects this expected N:N approach.

The proposed solution to this is to start storing 2 new variables self.ctx_message and self.ctx_timestamps. The Cartesia TTS now requests timestamps, and those timestamps are used to indicate what message we've gotten up to. In the event that the timestamps aren't available for some reason, or in the event that timestamps are delayed beyond a 2 second gap, we fall back to using an estimated wpm and the self.ctx_message to get a best approximation.

Testing

I set up the telephony_app per the Vocode directions. I then adjusted the synthesizer and played around with it locally on my own Vocode deployment to check that it works as intended.

vocode/streaming/synthesizer/cartesia_synthesizer.py

ajar98

will approve / merge once linting is good! can run make lint from the root dir

cyrilS-dev · 2024-09-07T12:01:03Z

This PR is causing an error :

vocode.streaming.synthesizer.cartesia_synthesizer:chunk_generator:185 - Caught error while receiving audio chunks from CartesiaSynthesizer: Failed to generate audio:
Error generating audio:
error processing TTS request: Language must be specified for timestamps.

Upgrading the message cutoff for Cartesia Synthesizer to use timestamps

dad4d88

ajar98 reviewed Sep 5, 2024

View reviewed changes

vocode/streaming/synthesizer/cartesia_synthesizer.py Show resolved Hide resolved

ajar98 reviewed Sep 5, 2024

View reviewed changes

Linting

19b2070

chongzluong requested a review from ajar98 September 5, 2024 18:12

chongzluong and others added 2 commits September 5, 2024 11:23

Adding typing to

86a9feb

fix lint

57b52dc

ajar98 approved these changes Sep 6, 2024

View reviewed changes

ajar98 merged commit dc983a0 into vocodedev:main Sep 6, 2024
4 checks passed

cyrilS-dev mentioned this pull request Sep 7, 2024

Add language support to Cartesia synthesizer #703

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cartesia] Upgrading the message cutoff for Cartesia Synthesizer to use timestamps #700

[Cartesia] Upgrading the message cutoff for Cartesia Synthesizer to use timestamps #700

chongzluong commented Sep 5, 2024

ajar98 left a comment •

edited

Loading

cyrilS-dev commented Sep 7, 2024

[Cartesia] Upgrading the message cutoff for Cartesia Synthesizer to use timestamps #700

[Cartesia] Upgrading the message cutoff for Cartesia Synthesizer to use timestamps #700

Conversation

chongzluong commented Sep 5, 2024

Overview

Testing

ajar98 left a comment • edited Loading

Choose a reason for hiding this comment

cyrilS-dev commented Sep 7, 2024

ajar98 left a comment •

edited

Loading