AI & ML interests

large scale real-robot-based benchmark platform of embodied intelligence

Recent Activity

AdinaY 
posted an update 1 day ago
AdinaY 
posted an update 5 days ago
view post
Post
1595
Ming-flash-omni Preview 🚀 Multimodal foundation model from AntGroup

inclusionAI/Ming-flash-omni-Preview

✨ Built on Ling-Flash-2.0: 10B total/6B active
✨ Generative segmentation-as-editing
✨ SOTA contextual & dialect ASR
✨ High-fidelity image generation
AdinaY 
posted an update 6 days ago
view post
Post
1636

Glyph 🔥 a framework that scales context length by compressing text into images and processing them with vision–language models, released by Z.ai.

Paper:https://huggingface.co/papers/2510.17800
Model:https://huggingface.co/zai-org/Glyph

✨ Compresses long sequences visually to bypass token limits
✨ Reduces computational and memory costs
✨ Preserves meaning through multimodal encoding
✨ Built on GLM-4.1V-9B-Base
AdinaY 
posted an update 10 days ago
view post
Post
2560
HunyuanWorld Mirror🔥a versatile feed forward model for universal 3D world reconstruction by Tencent

tencent/HunyuanWorld-Mirror

✨ Any prior in → 3D world out
✨ Mix camera, intrinsics, depth as priors
✨ Predict point clouds, normals, Gaussians & more in one pass
✨ Unified architecture for all 3D task
AdinaY 
posted an update 15 days ago
view post
Post
611
PaddleOCR VL🔥 0.9B Multilingual VLM by Baidu

PaddlePaddle/PaddleOCR-VL

✨ Ultra-efficient NaViT + ERNIE-4.5 architecture
✨ Supports 109 languages 🤯
✨ Accurately recognizes text, tables, formulas & charts
✨ Fast inference and lightweight for deployment
AdinaY 
posted an update 16 days ago
view post
Post
1765
Bee-8B 🐝 open 8B Multimodal LLM built on high quality data, released by
TencentHunyuan

Paper: Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs (2510.13795)
Model: https://huggingface.co/collections/Open-Bee/bee-8b-68ecbf10417810d90fbd9995

✨ Trained on Honey-Data-15M, a 15M-sample SFT corpus with dual-level CoT reasoning
✨ Backed by HoneyPipe, a transparent & reproducible open data curation suite
AdinaY 
posted an update 17 days ago
AdinaY 
posted an update 19 days ago
view post
Post
468
Ring-1T🔥 the trillion-parameter thinking model released by Ant group, the company behind Alipay

inclusionAI/Ring-1T

✨ 1T params (50B active)- MIT license
✨ 128K context (YaRN)
✨ RLVR, Icepop, and ASystem make trillion-scale RL stable
AdinaY 
posted an update 23 days ago
view post
Post
496
KAT-Dev-72B-Exp🔥 Kuaishou's ( the company behind Kring AI ) new open model for software engineering

Kwaipilot/KAT-Dev-72B-Exp

✨ 72B - Apache2.0
✨ Redesigned attention kernel & training engine for efficient context-aware RL
✨ 74.6% accuracy on SWE-Bench Verified
AdinaY 
posted an update 24 days ago
view post
Post
4400
At the close of the National Holiday🇨🇳, Antgroup drops a new SoTA model.

Ling-1T 🔥 the trillion-parameter flagship of the Ling 2.0 series.

inclusionAI/Ling-1T

✨1T total / 50B active params per token
✨20T+ reasoning-dense tokens (Evo-CoT)
✨128K context via YaRN
✨FP8 training: 15%+ faster, same precision as BF16
✨Hybrid Syntax-Function-Aesthetics reward for front-end & visual generation
  • 1 reply
·
AdinaY 
posted an update 27 days ago
AdinaY 
posted an update 30 days ago
view post
Post
602
New release from Ant Group 🔥

inclusionAI/ming-v2-68ddea4954413c128d706630

✨MingTok (Vision & Audio): continuous unified tokenizer, no quantization, preserves semantic & perceptual fidelity, enables faster convergence.

✨Ming-UniVision: MLLM unifying image understanding + generation, supports multi-round editing & visualized CoT.

✨Ming-UniAudio: unified speech LLM for ASR, TTS & free-form editing, integrates semantic + acoustic features for high-fidelity audio.
AdinaY 
posted an update about 1 month ago
view post
Post
549
🔥 September highlights from Chinese open source community

zh-ai-community/september-2025-china-open-source-highlights-68b55c9e757c439ad9dd6aba

✨ Massive releases from the two tech giants

- At Alibaba Cloud Summit, Qwen dropped at least 7 new series of models. ( some are not open sourced )
- Since June, Tencent has doubled down on open source, especially after Hunyuan gained traction

✨ Some of the community’s hottest models come from startups.

- Kimi K2-0905
- GLM v4.6
-OpenBMB MiniCPM 4.1

✨ New players are pushing hard!

- Baidu ERNIE & Qianfan: enterprise-ready focus
- Ant Group: MoE + low-activation; from small to trillion, from core to reasoning fast track
- Xiaomi MiMo: stands out with Any-to-Any audio models

✨ Robotics is joining the open-source wave

- Unitree released its first open-source model
- BAAI launched RoboBrain-X0, an open-source robotics model + dataset

👀 Each month brings cooler models. After the 8-day National Holiday, expect another wave before the end of the year.

Stay tuned!
AdinaY 
posted an update about 1 month ago
view post
Post
2785
GLM-4.6 is here🚀

zai-org/GLM-4.6

✨ 200K context window
✨ Superior coding & polished UI generation
✨ Stronger reasoning & tool use
✨ More capable agents & agent frameworks