The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models Paper โข 2510.13996 โข Published 14 days ago โข 6
view post Post 421 Something very cool is cooking at Lichess See translation 1 reply ยท ๐ 1 1 + Reply
wikimedia-community/scholarly-article-citations-in-wikipedia Viewer โข Updated May 28 โข 1.84M โข 115 โข 1
wikimedia-community/scholarly-article-citations-in-wikipedia Viewer โข Updated May 28 โข 1.84M โข 115 โข 1
view post Post 2103 The folks at Foursquare released a dataset of 104.5 million places of interest ( foursquare/fsq-os-places) and here's all of them on a plot See translation 4 replies ยท ๐ฅ 5 5 ๐ 1 1 ๐ 1 1 + Reply
view post Post 2440 The Lichess database of games, puzzles, and engine evaluations is now on the Hub: Lichess Billions of chess data points to download, query, and stream and we're excited to see what you'll build with it! โ๏ธ ๐ค- https://huggingface.co/collections/Lichess/positions-datasets-66f50837db5cd3287d60d489- https://huggingface.co/collections/Lichess/games-datasets-66f508df78f4b43e1bb2d353 See translation ๐ 7 7 โค๏ธ 2 2 ๐ฅ 1 1 + Reply