Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Aisha Halder's picture

6 20

Aisha Halder

Ahalder

·

AishoHalder

AI & ML interests

AI & ML,Networking,P2P

Organizations

None yet

Ahalder 's collections 16

openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5 • 101k • 1.27k

Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • 0.6B • Updated Jul 28 • 797k • • 220

OpenGVLab/InternVL2-2B

Image-Text-to-Text • 2B • Updated Mar 25 • 1.15M • 76

Image generation

UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion

Paper • 2401.13388 • Published Jan 24, 2024 • 13
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models

Paper • 2401.13974 • Published Jan 25, 2024 • 14
Runtime error

420

Real ESRGAN

🏃

420
Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Mar 25 • 10 • 39

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 21
distilbert/distilbert-base-uncased-finetuned-sst-2-english

Text Classification • 67M • Updated Dec 19, 2023 • 3.41M • • 860
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 20
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 21

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 72

Video generattion

Runtime error

42

Vchitect 2.0

🐢

42

Generate videos from text prompts

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 15.2k • 1.53k

google/timesfm-2.0-500m-pytorch

Time Series Forecasting • 0.5B • Updated Apr 16 • 8.24k • 230

tensoropera/Fox-1-1.6B

Text Generation • 2B • Updated Nov 21, 2024 • 381 • 33

Image Processing

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

Paper • 2401.13627 • Published Jan 24, 2024 • 78
Runtime error

420

Real ESRGAN

🏃

420
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 424 • 1.7k
NexaAI/OmniVLM-968M

0.5B • Updated Aug 20 • 2.99k • 528

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 50
Nfiniteai/product-masks-sample

Viewer • Updated Sep 5, 2024 • 2.71k • 536 • 14
HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 9.45k • 339
rulins/MassiveDS-140B

Viewer • Updated Jul 17, 2024 • 3.08M • 1.96k • 7

Speech and Audio

facebook/wav2vec2-base-960h

Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 2.17M • 384
ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 59
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 18
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 214 • 72

finegrain/finegrain-box-segmenter

Mask Generation • Updated Sep 11, 2024 • 8.29k • 124
Running on Zero

510

Finegrain Object Cutter

✂

510

Create HD cutouts from any image with just a prompt

mixedbread-ai/mxbai-colbert-large-v1

0.3B • Updated Mar 13 • 20k • 52
jinaai/jina-embeddings-v3

Feature Extraction • 0.6B • Updated Feb 24 • 4.98M • 1.11k
Runtime error

8

Paper Whisperer

📈

8

Paper Whisperer

Runtime error

80

Dailypapershackernews

📈

80
Prithvi WxC: Foundation Model for Weather and Climate

Paper • 2409.13598 • Published Sep 20, 2024 • 45
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles

Paper • 2410.05262 • Published Oct 7, 2024 • 11
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

Paper • 2410.15316 • Published Oct 20, 2024 • 12

openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5 • 101k • 1.27k

google/timesfm-2.0-500m-pytorch

Time Series Forecasting • 0.5B • Updated Apr 16 • 8.24k • 230

Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • 0.6B • Updated Jul 28 • 797k • • 220

tensoropera/Fox-1-1.6B

Text Generation • 2B • Updated Nov 21, 2024 • 381 • 33

OpenGVLab/InternVL2-2B

Image-Text-to-Text • 2B • Updated Mar 25 • 1.15M • 76

Image Processing

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

Paper • 2401.13627 • Published Jan 24, 2024 • 78
Runtime error

420

Real ESRGAN

🏃

420
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 424 • 1.7k
NexaAI/OmniVLM-968M

0.5B • Updated Aug 20 • 2.99k • 528

Image generation

UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion

Paper • 2401.13388 • Published Jan 24, 2024 • 13
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models

Paper • 2401.13974 • Published Jan 25, 2024 • 14
Runtime error

420

Real ESRGAN

🏃

420
Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Mar 25 • 10 • 39

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 50
Nfiniteai/product-masks-sample

Viewer • Updated Sep 5, 2024 • 2.71k • 536 • 14
HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 9.45k • 339
rulins/MassiveDS-140B

Viewer • Updated Jul 17, 2024 • 3.08M • 1.96k • 7

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 21
distilbert/distilbert-base-uncased-finetuned-sst-2-english

Text Classification • 67M • Updated Dec 19, 2023 • 3.41M • • 860
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 20
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 21

Speech and Audio

facebook/wav2vec2-base-960h

Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 2.17M • 384
ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 59
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 18
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 214 • 72

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 72

finegrain/finegrain-box-segmenter

Mask Generation • Updated Sep 11, 2024 • 8.29k • 124
Running on Zero

510

Finegrain Object Cutter

✂

510

Create HD cutouts from any image with just a prompt

Video generattion

Runtime error

42

Vchitect 2.0

🐢

42

Generate videos from text prompts

mixedbread-ai/mxbai-colbert-large-v1

0.3B • Updated Mar 13 • 20k • 52
jinaai/jina-embeddings-v3

Feature Extraction • 0.6B • Updated Feb 24 • 4.98M • 1.11k
Runtime error

8

Paper Whisperer

📈

8

Paper Whisperer

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 15.2k • 1.53k

Runtime error

80

Dailypapershackernews

📈

80
Prithvi WxC: Foundation Model for Weather and Climate

Paper • 2409.13598 • Published Sep 20, 2024 • 45
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles

Paper • 2410.05262 • Published Oct 7, 2024 • 11
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

Paper • 2410.15316 • Published Oct 20, 2024 • 12

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs