Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Robson Cassio Ribas's picture

40

Robson Cassio Ribas

rocari

·

rocari

AI & ML interests

None yet

Organizations

rocari 's collections 6

Image Generation

StarVector: Generating Scalable Vector Graphics Code from Images

Paper • 2312.11556 • Published Dec 17, 2023 • 36
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Paper • 2312.12423 • Published Dec 19, 2023 • 13
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Paper • 2312.11392 • Published Dec 18, 2023 • 20
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 191k • 3.21k

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 29
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution

Paper • 2401.00935 • Published Jan 1, 2024 • 18

Agents, Planning & Tools

Nexusflow/NexusRaven-V2-13B

Text Generation • 13B • Updated May 1 • 233 • 469
ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 89

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper • 2310.17796 • Published Oct 26, 2023 • 18
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78
upstage/SOLAR-10.7B-Instruct-v1.0

Text Generation • 11B • Updated Sep 10, 2024 • 27.1k • 643
openchat/openchat-3.5-1210

Text Generation • 7B • Updated May 18, 2024 • 556 • 278

Audio, Speech & Music

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 187k • 935
openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 6.5M • • 5.24k
jonatasgrosman/whisper-large-pt-cv11

Automatic Speech Recognition • Updated Dec 22, 2022 • 44 • 14
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 46.3k • 1.78k

ise-uiuc/Magicoder-S-DS-6.7B

Text Generation • 7B • Updated Mar 6, 2024 • 1.18k • 205
deepseek-ai/deepseek-coder-33b-instruct

Text Generation • 33B • Updated Mar 7, 2024 • 60.8k • 556
Phind/Phind-CodeLlama-34B-v2

Text Generation • Updated Aug 28, 2023 • 2.35k • 833

Image Generation

StarVector: Generating Scalable Vector Graphics Code from Images

Paper • 2312.11556 • Published Dec 17, 2023 • 36
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Paper • 2312.12423 • Published Dec 19, 2023 • 13
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Paper • 2312.11392 • Published Dec 18, 2023 • 20
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 191k • 3.21k

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper • 2310.17796 • Published Oct 26, 2023 • 18
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78
upstage/SOLAR-10.7B-Instruct-v1.0

Text Generation • 11B • Updated Sep 10, 2024 • 27.1k • 643
openchat/openchat-3.5-1210

Text Generation • 7B • Updated May 18, 2024 • 556 • 278

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 29
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution

Paper • 2401.00935 • Published Jan 1, 2024 • 18

Audio, Speech & Music

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 187k • 935
openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 6.5M • • 5.24k
jonatasgrosman/whisper-large-pt-cv11

Automatic Speech Recognition • Updated Dec 22, 2022 • 44 • 14
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 46.3k • 1.78k

Agents, Planning & Tools

Nexusflow/NexusRaven-V2-13B

Text Generation • 13B • Updated May 1 • 233 • 469
ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 89

ise-uiuc/Magicoder-S-DS-6.7B

Text Generation • 7B • Updated Mar 6, 2024 • 1.18k • 205
deepseek-ai/deepseek-coder-33b-instruct

Text Generation • 33B • Updated Mar 7, 2024 • 60.8k • 556
Phind/Phind-CodeLlama-34B-v2

Text Generation • Updated Aug 28, 2023 • 2.35k • 833

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs