ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

liked a model about 6 hours ago

BlinkDL/rwkv7-g1

liked a model 1 day ago

moonshotai/Kimi-Linear-48B-A3B-Instruct

upvoted a collection 3 days ago

View all activity

Organizations

authored a paper 3 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 17

authored 2 papers 5 months ago

Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models

Paper • 2506.11116 • Published Jun 9 • 4

CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

Paper • 2506.07463 • Published Jun 9 • 10

authored 5 papers about 1 year ago

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Paper • 2410.18505 • Published Oct 24, 2024 • 11

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 19

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Paper • 2408.06567 • Published Aug 13, 2024 • 2

Aquila2 Technical Report

Paper • 2408.07410 • Published Aug 14, 2024 • 15