AI & ML interests

Hugging Face Inference Endpoints Images repository allows AI Builders to collaborate and engage creating awesome inference deployments

Recent Activity

AdinaY 
posted an update 1 day ago
AdinaY 
posted an update 6 days ago
view post
Post
1596
Ming-flash-omni Preview 🚀 Multimodal foundation model from AntGroup

inclusionAI/Ming-flash-omni-Preview

✨ Built on Ling-Flash-2.0: 10B total/6B active
✨ Generative segmentation-as-editing
✨ SOTA contextual & dialect ASR
✨ High-fidelity image generation
AdinaY 
posted an update 6 days ago
view post
Post
1637

Glyph 🔥 a framework that scales context length by compressing text into images and processing them with vision–language models, released by Z.ai.

Paper:https://huggingface.co/papers/2510.17800
Model:https://huggingface.co/zai-org/Glyph

✨ Compresses long sequences visually to bypass token limits
✨ Reduces computational and memory costs
✨ Preserves meaning through multimodal encoding
✨ Built on GLM-4.1V-9B-Base
AdinaY 
posted an update 11 days ago
view post
Post
2561
HunyuanWorld Mirror🔥a versatile feed forward model for universal 3D world reconstruction by Tencent

tencent/HunyuanWorld-Mirror

✨ Any prior in → 3D world out
✨ Mix camera, intrinsics, depth as priors
✨ Predict point clouds, normals, Gaussians & more in one pass
✨ Unified architecture for all 3D task
AdinaY 
posted an update 15 days ago
view post
Post
612
PaddleOCR VL🔥 0.9B Multilingual VLM by Baidu

PaddlePaddle/PaddleOCR-VL

✨ Ultra-efficient NaViT + ERNIE-4.5 architecture
✨ Supports 109 languages 🤯
✨ Accurately recognizes text, tables, formulas & charts
✨ Fast inference and lightweight for deployment
AdinaY 
posted an update 17 days ago
view post
Post
1766
Bee-8B 🐝 open 8B Multimodal LLM built on high quality data, released by
TencentHunyuan

Paper: Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs (2510.13795)
Model: https://huggingface.co/collections/Open-Bee/bee-8b-68ecbf10417810d90fbd9995

✨ Trained on Honey-Data-15M, a 15M-sample SFT corpus with dual-level CoT reasoning
✨ Backed by HoneyPipe, a transparent & reproducible open data curation suite
AdinaY 
posted an update 17 days ago
AdinaY 
posted an update 20 days ago
view post
Post
468
Ring-1T🔥 the trillion-parameter thinking model released by Ant group, the company behind Alipay

inclusionAI/Ring-1T

✨ 1T params (50B active)- MIT license
✨ 128K context (YaRN)
✨ RLVR, Icepop, and ASystem make trillion-scale RL stable
AdinaY 
posted an update 23 days ago
view post
Post
496
KAT-Dev-72B-Exp🔥 Kuaishou's ( the company behind Kring AI ) new open model for software engineering

Kwaipilot/KAT-Dev-72B-Exp

✨ 72B - Apache2.0
✨ Redesigned attention kernel & training engine for efficient context-aware RL
✨ 74.6% accuracy on SWE-Bench Verified
AdinaY 
posted an update 24 days ago
view post
Post
4400
At the close of the National Holiday🇨🇳, Antgroup drops a new SoTA model.

Ling-1T 🔥 the trillion-parameter flagship of the Ling 2.0 series.

inclusionAI/Ling-1T

✨1T total / 50B active params per token
✨20T+ reasoning-dense tokens (Evo-CoT)
✨128K context via YaRN
✨FP8 training: 15%+ faster, same precision as BF16
✨Hybrid Syntax-Function-Aesthetics reward for front-end & visual generation
  • 1 reply
·
evijit 
posted an update 27 days ago
view post
Post
2502
AI for Scientific Discovery Won't Work Without Fixing How We Collaborate.

My co-author @cgeorgiaw and I just published a paper challenging a core assumption: that the main barriers to AI in science are technical. They're not. They're social.

Key findings:

🚨 The "AI Scientist" myth delays progress: Waiting for AGI devalues human expertise and obscures science's real purpose: cultivating understanding, not just outputs.
📊 Wrong incentives: Datasets have 100x longer impact than models, yet data curation is undervalued.
⚠️ Broken collaboration: Domain scientists want understanding. ML researchers optimize performance. Without shared language, projects fail.
🔍 Fragmentation costs years: Harmonizing just 9 cancer files took 329 hours.

Why this matters: Upstream bottlenecks like efficient PDE solvers could accelerate discovery across multiple sciences. CASP mobilized a community around protein structure, enabling AlphaFold. We need this for dozens of challenges.

Thus, we're launching Hugging Science! A global community addressing these barriers through collaborative challenges, open toolkits, education, and community-owned infrastructure. Please find all the links below!

Paper: AI for Scientific Discovery is a Social Problem (2509.06580)
Join: hugging-science
Discord: https://discord.com/invite/VYkdEVjJ5J
AdinaY 
posted an update 27 days ago
AdinaY 
posted an update 30 days ago
view post
Post
602
New release from Ant Group 🔥

inclusionAI/ming-v2-68ddea4954413c128d706630

✨MingTok (Vision & Audio): continuous unified tokenizer, no quantization, preserves semantic & perceptual fidelity, enables faster convergence.

✨Ming-UniVision: MLLM unifying image understanding + generation, supports multi-round editing & visualized CoT.

✨Ming-UniAudio: unified speech LLM for ASR, TTS & free-form editing, integrates semantic + acoustic features for high-fidelity audio.
AdinaY 
posted an update about 1 month ago
view post
Post
549
🔥 September highlights from Chinese open source community

zh-ai-community/september-2025-china-open-source-highlights-68b55c9e757c439ad9dd6aba

✨ Massive releases from the two tech giants

- At Alibaba Cloud Summit, Qwen dropped at least 7 new series of models. ( some are not open sourced )
- Since June, Tencent has doubled down on open source, especially after Hunyuan gained traction

✨ Some of the community’s hottest models come from startups.

- Kimi K2-0905
- GLM v4.6
-OpenBMB MiniCPM 4.1

✨ New players are pushing hard!

- Baidu ERNIE & Qianfan: enterprise-ready focus
- Ant Group: MoE + low-activation; from small to trillion, from core to reasoning fast track
- Xiaomi MiMo: stands out with Any-to-Any audio models

✨ Robotics is joining the open-source wave

- Unitree released its first open-source model
- BAAI launched RoboBrain-X0, an open-source robotics model + dataset

👀 Each month brings cooler models. After the 8-day National Holiday, expect another wave before the end of the year.

Stay tuned!
AdinaY 
posted an update about 1 month ago
view post
Post
2785
GLM-4.6 is here🚀

zai-org/GLM-4.6

✨ 200K context window
✨ Superior coding & polished UI generation
✨ Stronger reasoning & tool use
✨ More capable agents & agent frameworks
AdinaY 
posted an update about 1 month ago
view post
Post
411
MOSS-Speech 🔊 bilingual native speech-to-speech model, from Fudan University.

fnlp/moss-speech-68dbab23bc98501afede0cd3

✨ Supports Chinese & English
✨ Layer-splitting architecture + frozen pretraining
✨ Preserves tone, emotion & prosody
AdinaY 
posted an update about 1 month ago
view post
Post
421
RoboBrain-X0- Preview 🤖 a unified cross-embodiment VLA model from
BAAI.

BAAI/robobrain-x0-68db67d3542e04c5d99f31f9

✨Zero-shot generalization across heterogeneous robots
✨Complex task decomposition & embodied reasoning
✨Unified Action Vocabulary + OmniSAT tokenizer
✨End-to-end: perception > reasoning > execution
✨Full version coming soon 🔥
AdinaY 
posted an update about 1 month ago
view post
Post
1629
Ring-1T-preview 🔥 1T thinking model released by Ant Group.

inclusionAI/Ring-1T-preview

✨ MoE architecture + 20T tokens + RLVR via ASystem
✨ Strong natural language reasoning (AIME’25: 92.6, close to GPT-5)
✨IMO tests: advanced problem-solving & reasoning
AdinaY 
posted an update about 1 month ago