-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper • 2312.11556 • Published • 36 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper • 2312.12423 • Published • 13 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper • 2312.11392 • Published • 20 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video • Updated • 191k • 3.21k
Robson Cassio Ribas
rocari
·
AI & ML interests
None yet
Organizations
CV
Agents, Planning & Tools
LLMs
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper • 2310.17796 • Published • 18 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation • 11B • Updated • 27.1k • 643 -
openchat/openchat-3.5-1210
Text Generation • 7B • Updated • 556 • 278
Audio, Speech & Music
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 187k • 935 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 6.5M • • 5.24k -
jonatasgrosman/whisper-large-pt-cv11
Automatic Speech Recognition • Updated • 44 • 14 -
openai/whisper-large-v2
Automatic Speech Recognition • 2B • Updated • 46.3k • 1.78k
CodeGen
Image Generation
-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper • 2312.11556 • Published • 36 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper • 2312.12423 • Published • 13 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper • 2312.11392 • Published • 20 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video • Updated • 191k • 3.21k
LLMs
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper • 2310.17796 • Published • 18 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation • 11B • Updated • 27.1k • 643 -
openchat/openchat-3.5-1210
Text Generation • 7B • Updated • 556 • 278
CV
Audio, Speech & Music
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 187k • 935 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 6.5M • • 5.24k -
jonatasgrosman/whisper-large-pt-cv11
Automatic Speech Recognition • Updated • 44 • 14 -
openai/whisper-large-v2
Automatic Speech Recognition • 2B • Updated • 46.3k • 1.78k
Agents, Planning & Tools
CodeGen