Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ldwang 's Collections
MiscSpaces
MiscAgentic
MiscIndustry
MiscKernel
MiscR1
MiscModels
MiscDatasets
MiscTools

MiscSpaces

updated Sep 25
Upvote
1

  • Running
    587
    587

    Scaling test-time compute

    πŸ“ˆ

    Implement test-time compute scaling for math problems


  • Running
    1.11k
    1.11k

    FineWeb: decanting the web for the finest text data at scale

    🍷

    Generate high-quality text data for LLMs using FineWeb


  • Running
    3.35k
    3.35k

    The Ultra-Scale Playbook

    🌌

    The ultimate guide to training LLM on large GPU Clusters


  • Running
    188
    188

    FineVision: Open Data is All You Need

    πŸ“

    A new open-source dataset for training VLMs


  • Running
    19
    19

    Megatron Memory Estimator

    πŸ‘

    Estimate GPU memory usage for Megatron models


  • Running on Zero
    15
    15

    Smol2Operator Demo

    🐒

    Smol2Operator Demo: GUI Agent Model

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs