1 28 183

Joshua Chak

JoshuaChak

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs

liked a model about 2 months ago

MediaTek-Research/Breeze-ASR-25

upvoted a paper about 2 months ago

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

View all activity

Organizations

upvoted a paper 3 days ago

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs

Paper • 2601.01046 • Published 8 days ago • 11

upvoted a paper about 2 months ago

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Paper • 2511.11007 • Published Nov 14, 2025 • 15

upvoted an article about 2 months ago

Article

We’re open-sourcing our text-to-image model and the process behind it

Nov 12, 2025

•

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

upvoted 2 papers 5 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28, 2025 • 57

upvoted an article 6 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

759

upvoted a paper 6 months ago

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Paper • 2506.20639 • Published Jun 25, 2025 • 30

upvoted a paper 7 months ago

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published Jun 9, 2025 • 30

upvoted a paper 8 months ago

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27, 2025 • 38

upvoted a paper 10 months ago

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Paper • 2503.10625 • Published Mar 13, 2025 • 33

upvoted a collection 10 months ago

OLMo 2

Collection

Artifacts for the OLMo 2 release. • 35 items • Updated 18 days ago • 151

upvoted a paper 11 months ago

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published Feb 3, 2025 • 23

upvoted 5 papers about 1 year ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 158

upvoted 2 papers over 1 year ago

Autonomous Character-Scene Interaction Synthesis from Text Instruction

Paper • 2410.03187 • Published Oct 4, 2024 • 7

Presto! Distilling Steps and Layers for Accelerating Music Generation

Paper • 2410.05167 • Published Oct 7, 2024 • 18