Granite Docling not working using vllm
#20
by
SadiaSid
- opened
Hey @SadiaSid , this is a known issue with vLLM with word tied embeddings. We uploaded the untied under a branch in this repo "untied" this works right away with vLLM, we will update the readme with this note. I will close the issue but feel free to open if something else comes up!
asnassar
changed discussion status to
closed
For simplicity, you can serve it like this:
vllm serve ibm-granite/granite-docling-258M --revision untied
Does this impact performance?
i got a very bad result with vllm serving, it doesnt even allow me to convert it with docling package
