4 16 92

Alfaxad Eyembe PRO

Alfaxad

https://alfaxad.com

AI & ML interests

AI, Robotics

Recent Activity

liked a Space about 18 hours ago

InstaDeepAI/ntv3

liked a dataset 3 days ago

openai/frontierscience

liked a dataset 7 days ago

OpenMol/ChemCoTBench

View all activity

Organizations

liked a Space about 18 hours ago

NTv3 — Foundation Models for Long-Range Genomics

🧬

Generate genomic sequences and analyze data using NTv3 models

liked a dataset 3 days ago

openai/frontierscience

Viewer • Updated 8 days ago • 160 • 4.62k • 116

liked 4 datasets 7 days ago

reacted to danielhanchen's post with 🤗 9 days ago

Post

5156

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥

Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.

GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF
💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3

1 reply

liked a dataset 9 days ago

OpenMed/Medical-Reasoning-SFT-GPT-OSS-120B

Viewer • Updated 12 days ago • 200k • 2.74k • 199

liked a model 10 days ago

openai/circuit-sparsity

Text Generation • 0.4B • Updated 12 days ago • 1.91k • 186

liked a model 11 days ago

NousResearch/nomos-1

Text Generation • 31B • Updated 5 days ago • 1.15k • 128

updated a model 12 days ago

Nadhari/swa-csm-1b

Text-to-Speech • Updated 12 days ago • 223 • 3

reacted to IliaLarchenko's post with 🔥 13 days ago

Post

1035

🏆 BEHAVIOR Challenge 1st Place – Solution Summary

My team recently won 1st place in the BEHAVIOR Challenge at NeurIPS.
The competition focused on training a single policy to complete 50 long-horizon household tasks in simulation.

We built an end-to-end policy based on Pi0.5 with a bunch of custom modifications. Everything is open-sourced, and it should be useful for anyone exploring VLAs or adapting them to specific tasks.

Key Architecture Changes:
- Replaced language model with 50 trainable task embeddings (no text at all)
- Correlated noise for Flow Matching: ϵ ∼ N(0, 0.5I + 0.5Σ) using dataset action covariance
- Learnable mixed-layer attention: each action expert layer attends to a trainable mix of all VLM layers
- System 2 stage tracking: model predicts task stage, we smooth it with voting and feed it back as context

Training:
- Multi-sample Flow Matching: 15 FM samples per VLM pass to reduce gradient variance
- Delta action space + per-timestamp normalization
- FAST auxiliary loss and stage prediction loss
- Trained on 224×224 RGB + proprioception only
- We use 4 fine-tuned checkpoints, all derived from a multi-task model trained on all 50 tasks

Inference Optimizations:
- Soft inpainting: predict 30 actions, execute 26, use 4 as an input for the next chunk
- Correlation-aware guidance of inpainting to keep action chunks smooth
- 1.3× speedup via cubic spline compression
- General correction rule: reopen gripper after failed grasps

🔗 Code and Models:
- Code: https://github.com/IliaLarchenko/behavior-1k-solution
- Weights: IliaLarchenko/behavior_submission
- Paper: Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge (2512.06951)