Spaces:

nineninesix
/

KaniTTS

Running on Zero

Token limit?

by jujutechnology - opened Oct 2

Oct 2

Hi. The model only outputs approximately 16 seconds of audio no matter how long the text is. Is this just a limitation of this Space or is the model not able to do longer form text?

Simonlob

NineNineSix org about 1 month ago

This model is pre-trained on audio up to 15 seconds, which is okay for streaming but not very good for generating long sentences. On longer sentences it may show instability.

D3vShoaib

23 days ago

is there a streaming implementation example somewhere ? if not maybe high-level instructions on how to about it, thank you for incredible work 👍👍👍👍

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment