Spaces:

dirkraft
/

fuhgedaboudit

Paused

danemery-ai2 commited on Jul 18

Commit

ac9171f

unverified ·

1 Parent(s): 2972be9

Add column description tooltip icon to leaderboard legend (#11)

Files changed (2) hide show

content.py CHANGED Viewed

@@ -287,4 +287,38 @@ html:not(.dark) #legend-markdown .light-mode-icon,
 #legend-markdown .light-mode-icon, #legend-markdown .dark-mode-icon {
     display: none;
 }
 """

 #legend-markdown .light-mode-icon, #legend-markdown .dark-mode-icon {
     display: none;
 }
+/* Column description tooltip styles */
+#legend-markdown,
+#leaderboard-accordion {
+    overflow: visible !important;
+}
+.tooltip-icon {
+    display: inline-block;
+    margin-left: 6px;
+    cursor: help;
+    position: relative;
+}
+.tooltip-icon::after {
+    content: attr(data-tooltip);
+    position: absolute;
+    bottom: 125%;
+    background-color: #333;
+    color: #fff;
+    padding: 12px 16px;
+    border-radius: 4px;
+    font-size: 12px;
+    opacity: 0;
+    transition: opacity 0.2s;
+    white-space: pre-line;
+    width: 500px;
+    text-align: left;
+    pointer-events: none;
+}
+.tooltip-icon:hover::after {
+    opacity: 1;
+}
 """

ui_components.py CHANGED Viewed

@@ -175,6 +175,21 @@ legend_markdown = f"""
         <div style="display: flex; flex-wrap: wrap; align-items: center; gap: 16px; margin-top: 4px;">{tooling_html}</div>
     </div>
 </div>
 """

         <div style="display: flex; flex-wrap: wrap; align-items: center; gap: 16px; margin-top: 4px;">{tooling_html}</div>
     </div>
+    <div><b>Column Descriptions</b><span class="tooltip-icon" data-tooltip="• Pareto: Indicates if agent is on the Pareto frontier
+        • Openness: Level of accessibility to model and implementation
+        • Agent Tooling: Approach used by the agent
+        • Agent: Name of the AI agent
+        • Overall Score: Performance across all benchmarks
+        • Overall Cost: Cost per task in USD
+        • Literature Understanding Score: Performance on scientific literature tasks
+        • Literature Understanding Cost: Cost per literature understanding task in USD
+        • Data Analysis Score: Performance on data analysis tasks
+        • Code Execution Score: Performance on coding tasks
+        • Code Execution Cost: Cost per code execution task in USD
+        • Discovery Score: Performance on information discovery tasks
+        • Discovery Cost: Cost per discovery task in USD
+        • Categories Attempted: Number of benchmark categories the agent participated in
+        • Logs: Link to detailed evaluation logs">ⓘ</span></div>
 </div>
 """