Community Blog & Articles

Community Articles

Introducing Falcon H1R 7B

The Optimal Architecture for Small Language Models

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

Uncensor any LLM with abliteration

Code a simple RAG from scratch

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

Deriving the PPO Loss from First Principles

Continuity as a First-Class System Property in Artificial Intelligence

Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models

KV Caching Explained: Optimizing Transformer Inference Efficiency

Small Language Models (SLM): A Comprehensive Overview

Diversity Vs Density: A data strategy comparison for fine-tuning VLMs

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

January 6, 2026

Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture

January 5, 2026

partnershipsnvidiarobotics

NVIDIA brings agents to life with DGX Spark and Reachy Mini

January 5, 2026

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

December 23, 2025

tokenizerstransformersopen-source

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+2

December 18, 2025

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

December 17, 2025

CUGA on Hugging Face: Democratizing Configurable AI Agents

December 15, 2025

New in llama.cpp: Model Management

December 11, 2025

llmfine-tuningopen-source

Codex is Open Sourcing AI models

December 11, 2025

swifthubopen-source

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

December 5, 2025

llmreasoningagents

DeepMath: A lightweight math reasoning Agent with smolagents

December 4, 2025

llmfine-tuningopen-source

We Got Claude to Fine-Tune an Open Source LLM

December 4, 2025

transformersv5community

Transformers v5: Simple model definitions powering the AI ecosystem

December 1, 2025

diffusersfluxquantization

Diffusers welcomes FLUX-2

+4

November 25, 2025

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Introducing Falcon H1R 7B

The Optimal Architecture for Small Language Models

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Building Autonomous Vehicles That Reason with the NVIDIA Alpamayo Open Ecosystem

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

Uncensor any LLM with abliteration

Code a simple RAG from scratch

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

Deriving the PPO Loss from First Principles

Continuity as a First-Class System Property in Artificial Intelligence

Understanding Low-Rank Adaptation (LoRA): A Revolution in Fine-Tuning Large Language Models

KV Caching Explained: Optimizing Transformer Inference Efficiency

Small Language Models (SLM): A Comprehensive Overview

Diversity Vs Density: A data strategy comparison for fine-tuning VLMs

View all articles