Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
andre
achang
Follow
jremmy's profile picture
1 follower
·
6 following
Andrechang
AI & ML interests
None yet
Recent Activity
reacted
to
mindchain
's
post
with 🤝
5 days ago
The Architecture of 2026: Beyond the Token Trap 🚀 We are witnessing a tectonic shift in Transformer architecture. It’s no longer just about "predicting the next token"—it’s about executing latent plans on a high-speed data highway. What happens when we combine DeepSeek’s stability with Google’s strategic intelligence? 1️⃣ The Infrastructure: DeepSeek’s mHC Moving from a single-lane residual stream to a multi-lane highway. Using the Birkhoff Polytope, mHC ensures mathematical stability (Identity Mapping) while routing specialized data through dedicated lanes. 2️⃣ The Intelligence: Google’s Meta-Controller An internal AI unit that lives inside the Transformer. It escapes the "Token Trap" by extracting data to create a latent plan, steering the model via Temporal Abstraction. The Synergy: In a Topological Transformer, the Meta-Controller finally has the "dedicated lanes" it needs to steer complex reasoning without causing gradient explosions. We aren't just making models bigger; we are making them architecturally smarter. 🧠 #MachineLearning #DeepSeek #GoogleAI #Transformer #AIArchitecture
updated
a dataset
23 days ago
achang/atradebot_collect_09_to_12_2025
published
a dataset
23 days ago
achang/atradebot_collect_09_to_12_2025
View all activity
Organizations
achang
's models
7
Sort: Recently updated
achang/llama-3.2-3b_lora
Updated
Jan 12, 2025
achang/fin_dolly7b_one_nvda_v3_weekly
Updated
Aug 17, 2023
achang/fin_dolly7b_one_nvda_v2
Updated
Aug 17, 2023
achang/fin_gpt2_one_nvda_v2
Text Generation
•
Updated
Aug 16, 2023
•
12
achang/fin_gpt2_one_nvda
Text Generation
•
Updated
Aug 13, 2023
•
6
achang/fin_alloc_small0
Text Generation
•
Updated
Aug 6, 2023
•
7
achang/fin_forecast_0
Updated
Jul 14, 2023