Bowen Pan

bpan

https://people.csail.mit.edu/bpan/

AI & ML interests

Efficient LLM, Mixture-of-Experts, Embodied AI, Dynamic Neural Network

Recent Activity

upvoted a paper about 1 month ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

authored a paper about 1 month ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

upvoted an article over 1 year ago

How NuminaMath Won the 1st AIMO Progress Prize

View all activity

Organizations

upvoted a paper about 1 month ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19 • 52

authored a paper about 1 month ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19 • 52

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 122

upvoted a collection over 1 year ago

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 22 days ago • 199

updated a model over 1 year ago

bpan/LangNav-Sim2k-Llama2

Text Generation • Updated Jun 13, 2024 • 9

upvoted 2 articles over 1 year ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

•

Jun 11, 2024

• 20

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9, 2024

• 30

published an article over 1 year ago

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9, 2024

• 30

authored a paper about 2 years ago

LangNav: Language as a Perceptual Representation for Navigation

Paper • 2310.07889 • Published Oct 11, 2023 • 6

updated a model over 2 years ago

bpan/vit-base-patch16-224-in21k-finetuned-lora-food101

Updated Mar 12, 2023

Bowen Pan

AI & ML interests

Recent Activity

Organizations

bpan's activity

How NuminaMath Won the 1st AIMO Progress Prize

Saving Memory Using Padding-Free Transformer Layers during Finetuning

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive