Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

jhuclsp

JHU-CLSP

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

TaiMingLu authored a paper 12 days ago

Stronger Normalization-Free Transformers

orionweller new activity 13 days ago

jhu-clsp/mmBERT-decay-data:Update README: Fix TiQuAD's language name to Tigrinya

TaiMingLu authored a paper about 1 month ago

World-in-World: World Models in a Closed-Loop World

View all activity

Papers

Genomic Next-Token Predictors are In-Context Learners

Controlled Generation for Private Synthetic Text

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17 • 11.3k • • 57

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7 • 324k • • 170

jhu-clsp/mmBERT-checkpoints

Updated Sep 9 • 3

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21 • 268 • 4

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18 • 220

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18 • 485 • 21

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18 • 185 • • 3

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18 • 13

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18 • 18.7k • • 8

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18 • 203 • 2

datasets 38

jhu-clsp/mmBERT-decay-data

Updated 13 days ago • 14.1k • 3

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13 • 32.7k • 1

jhu-clsp/megawika-2

Updated Sep 3 • 9.01k • 2

jhu-clsp/ettin-pretraining-data

Updated Jul 18 • 21.1k • 8

jhu-clsp/ettin-decay-data

Updated Jul 18 • 2.84k • 1

jhu-clsp/astro-llms-benchmark-dataset

Viewer • Updated Jul 16 • 40 • 38

jhu-clsp/astro-llms-full-query-data

Viewer • Updated Jul 16 • 368 • 37

jhu-clsp/ettin-extension-data

Updated Jul 16 • 2.16k

jhu-clsp/ettin-data-order

Viewer • Updated Jul 16 • 3B • 8 • 1

jhu-clsp/rank1-R1-MSMARCO

Viewer • Updated Feb 26 • 635k • 45 • 2

View 38 datasets