AI Disruption

AI Disruption

Share this post

AI Disruption
AI Disruption
Tencent, NVIDIA Launch Hybrid Models: Mamba-Transformer Rising?
Copy link
Facebook
Email
Notes
More

Tencent, NVIDIA Launch Hybrid Models: Mamba-Transformer Rising?

Tencent & NVIDIA adopt Mamba-Transformer hybrid models for faster AI. Will this architecture dominate next-gen LLMs? Explore key innovations.

Meng Li's avatar
Meng Li
Mar 24, 2025
∙ Paid
6

Share this post

AI Disruption
AI Disruption
Tencent, NVIDIA Launch Hybrid Models: Mamba-Transformer Rising?
Copy link
Facebook
Email
Notes
More
2
Share

"AI Disruption" Publication 5000 Subscriptions 20% Discount Offer Link.


Over the past one or two years, Transformer architecture has continuously faced challenges from emerging architectures.

Among the numerous non-transformer architectures, Mamba undoubtedly stands out as one with significant attention and promising subsequent development.

However, unlike the initial "irreconcilable" standoff at the time of its release, these two architectures seem to be moving toward convergence in recent times.

Last Friday, Tencent announced the official release of its self-developed deep reasoning model "Hunyuan T1," a powerful inference model capable of instant responses, rapid token generation, and excelling at processing ultra-long texts.

The reason it boasts these advantages is largely due to Tencent's adoption of a Hybrid-Mamba-Transformer architecture.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Meng Li
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More