LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37 User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37
User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
Leaderboards Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 6.85k MTEB Leaderboard ๐ฅ 6.85k Embedding Leaderboard Running on CPU Upgrade 13.7k Open LLM Leaderboard ๐ 13.7k Track, rank and evaluate open LLMs and chatbots Running 4.7k LMArena Leaderboard ๐ 4.7k Display LMArena Leaderboard
Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.7k Open LLM Leaderboard ๐ 13.7k Track, rank and evaluate open LLMs and chatbots
LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37 User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper โข 2402.01622 โข Published Feb 2, 2024 โข 37
User-LLM: Efficient LLM Contextualization with User Embeddings Paper โข 2402.13598 โข Published Feb 21, 2024 โข 21
Leaderboards Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 6.85k MTEB Leaderboard ๐ฅ 6.85k Embedding Leaderboard Running on CPU Upgrade 13.7k Open LLM Leaderboard ๐ 13.7k Track, rank and evaluate open LLMs and chatbots Running 4.7k LMArena Leaderboard ๐ 4.7k Display LMArena Leaderboard
Running Featured 563 Image Arena Leaderboard ๐ 563 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade 13.7k Open LLM Leaderboard ๐ 13.7k Track, rank and evaluate open LLMs and chatbots