redditalpha logoredditalpha
← Back to dashboard
Share
31100%
r/babar/baba· u/Beneficial-Ice-6164· 24d agoNews 13

Alibaba Cloud has launched its newest AI chip, the Zhenwu M890, for training and inference tasks.

Investor summaryBullish

Alibaba Cloud launches its new Zhenwu M890 AI chip for training and inference.

Bull points
  • Alibaba Cloud is expanding its AI infrastructure capabilities with proprietary chips.
  • The new Zhenwu M890 chip supports both training and inference, enhancing their AI service offerings.
BABA半导体
Post body

https://preview.redd.it/sx65w6abj72h1.png?width=759&format=png&auto=webp&s=d51cf88e02824bd788148beacf1047fe61cb7473

Discussion · top comments13 selected
u/PepinoCholula 9· 24d ago

Stock down

u/Beneficial-Ice-6164 -1· 24d ago

relax this news is still fresh..it hasnt hit the mainstream news yet

u/Available_Chapter685 7· 24d ago

Hedgies pick this news up almost instantly btw

u/uedison728 4· 24d ago

From China banning Nvidia chips, the path is clear, they want to do themselves without US.

u/mojitosupreme 2· 24d ago

I see you also use CN Wire. A man of taste.

u/Ok_Side_2564 2· 24d ago

News right before Nvidia earnings? Good: More memory than H20. Fp4 support.

u/Inside_Radio8996 2· 24d ago

Is this causing the turnaround

https://preview.redd.it/cuuv46rso72h1.jpeg?width=1080&format=pjpg&auto=webp&s=fc7ddb9fd6db0f3721acd87f12f8248d64ae6e52

u/Beneficial-Ice-6164 2· 24d ago

yes AliCloud is holding an event now..expecting more new releases today

u/R3tardod 2· 24d ago

Is it good?

u/mlnet 3· 24d ago

Eh, depends on the price. 800GB/s memory bandwidth (what determines how fast LLMs can generate a response) is what an Apple Mac Studio with M3 Ultra chip has. This is 7 times slower than AMD's Instinct accelerator chips.

But, they're bought by the pallet, and the chips clustered together can be quite powerful--enough to run a multinational's token generation needs for an agentic inference platform, e.g. 5 trillion tokens a year. And yes, unfortunately large enterprises actually budget by a goal for decode tokens per year (it's a stupid metric).

So for the right price, this can be a good thing from Alibaba's T-Head arm that makes chips.

It's a real foot in the door as companies shift to GPU-based computing to replace headcount. Combined with closed-source Qwen models, it's a compelling offer; enterprises are looking for a one-stop shop. It's what makes Microsoft's GitHub Copilot so wildly successful in the US.

And just an aside: Qwen 3.6 is fantastic. I host the dense and mixture of experts models and they actually cut down on token use because they get it right the first time or two without needing much re-prompting.

u/Weikoko 1· 24d ago

Yes and No.

u/Comfortable-Ear1525 1· 24d ago

Jeez what a garbage stock she is

u/Comfortable-Ear1525 1· 24d ago

Is this bullish?