Commit
·
b4f09c6
1
Parent(s):
5cfe54a
Update README.md
Browse files
README.md
CHANGED
|
@@ -7,4 +7,14 @@ sdk: static
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
+
|
| 11 |
+
**LLM360 is an initiative to open-source large language models (LLMs) that foster transparency, trust, and collaborative research.**
|
| 12 |
+
|
| 13 |
+
LLM360 releases complete training details including all of the following artifacts associated with each model:
|
| 14 |
+
|
| 15 |
+
* Frequent Intermediate Model Checkpoints: During training, model parameters and optimizer states are collected regularly. These artifacts can offer valuable insights for studying LLM training dynamics and how it scales with data.
|
| 16 |
+
* Training Data with Full Data Sequence: The entire training dataset is fully disclosed and made publicly available. In addition, the full sequence of data examples used at each training step.
|
| 17 |
+
* Source Code: All the code used, including data processing, training, evaluation, and analysis.
|
| 18 |
+
* Logs and Metrics: All the training logs and evaluations done during training and using the final model are publicly disclosed.
|
| 19 |
+
|
| 20 |
+
LLM360 is a collaboration between Petuum, MBZUAI, and Cerebras.
|