-
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models
Paper • 2404.18796 • Published • 71 -
KAN: Kolmogorov-Arnold Networks
Paper • 2404.19756 • Published • 115 -
The Road Less Scheduled
Paper • 2405.15682 • Published • 26 -
Your Transformer is Secretly Linear
Paper • 2405.12250 • Published • 157
jj
tftf
·
AI & ML interests
None yet
Organizations
None yet
Potential Training
-
Better & Faster Large Language Models via Multi-token Prediction
Paper • 2404.19737 • Published • 81 -
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3
Paper • 2405.00664 • Published • 20 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 122
RAG
interesting
-
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models
Paper • 2404.18796 • Published • 71 -
KAN: Kolmogorov-Arnold Networks
Paper • 2404.19756 • Published • 115 -
The Road Less Scheduled
Paper • 2405.15682 • Published • 26 -
Your Transformer is Secretly Linear
Paper • 2405.12250 • Published • 157
important
Potential Training
-
Better & Faster Large Language Models via Multi-token Prediction
Paper • 2404.19737 • Published • 81 -
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3
Paper • 2405.00664 • Published • 20 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 122
useful
RAG
image