Update README.md
Browse files
README.md
CHANGED
|
@@ -103,6 +103,17 @@ Gradient is accelerating AI transformation across industries. Our AI Foundry inc
|
|
| 103 |
|
| 104 |
Drop an email to [contact@gradient.ai](mailto:contact@gradient.ai)
|
| 105 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 106 |
## References
|
| 107 |
|
| 108 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|
|
|
|
| 103 |
|
| 104 |
Drop an email to [contact@gradient.ai](mailto:contact@gradient.ai)
|
| 105 |
|
| 106 |
+
## Citation:
|
| 107 |
+
|
| 108 |
+
```bibtex
|
| 109 |
+
@misc{gradientlongcontextllama3,
|
| 110 |
+
title={Llama 3 Gradient: A series of long context models},
|
| 111 |
+
author={Leonid Pekelis and Michael Feil and Forrest Moret and Mark Huang and Tiffany Peng},
|
| 112 |
+
year={2024},
|
| 113 |
+
url = {https://gradient.ai/blog/scaling-rotational-embeddings-for-long-context-language-models}
|
| 114 |
+
}
|
| 115 |
+
```
|
| 116 |
+
|
| 117 |
## References
|
| 118 |
|
| 119 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|