ISCA 2024: TCP -- Tensor Contraction Processor for AI Workloads

  Рет қаралды 281

FuriosaAI

FuriosaAI

5 күн бұрын

FuriosaAI CTO Hanjoon Kim presents “TCP: A Tensor Contraction Processor for AI Workloads" at ISCA 2024, the annual International Symposium on Computer Architecture.
The Tensor Contraction Processor (TCP) is a novel chip architecture that delivers several noteworthy technical innovations which make it easier to program and optimize, while also enabling greater data reuse and energy efficiency.
TCP is the underlying architecture used in Furiosa's second-gen chip, RNGD, which is designed to accelerate inference with a wide range of models -- in particular, LLMs and multi-modal models. RNGD (pronounced "Renegade") is designed with a 150W TDP and utilizes 48GB of the latest HBM3 memory.
RNGD supports BF16 to directly handle floating-point models, and it also provides precision options for quantization, such as INT8/INT4 and FP8.
Learn more and sign up for updates: furiosa.ai/
Paper: furiosa.ai/download/FuriosaAI-...
Blog post: furiosa.ai/blog/tensor-contra...

Пікірлер
AI’s Hardware Problem
16:47
Asianometry
Рет қаралды 619 М.
RISC-V 2024 Update: RISE, AI Accelerators & More
14:03
ExplainingComputers
Рет қаралды 83 М.
I wish I could change THIS fast! 🤣
00:33
America's Got Talent
Рет қаралды 98 МЛН
Survival skills: A great idea with duct tape #survival #lifehacks #camping
00:27
THEY made a RAINBOW M&M 🤩😳 LeoNata family #shorts
00:49
LeoNata Family
Рет қаралды 12 МЛН
eBPF: Unlocking the Kernel [OFFICIAL DOCUMENTARY]
30:00
Speakeasy Productions
Рет қаралды 88 М.
Are LLMs Just Databases? The Real Story + Apple AI Predictions
59:39
Navarre Training
Рет қаралды 1,7 М.
What is RAG? (Retrieval Augmented Generation)
11:37
Don Woodlock
Рет қаралды 105 М.
AI Hardware, Explained.
15:24
a16z
Рет қаралды 20 М.
A Systematic Approach To Designing AI Accelerator Hardware
10:49
Why the Future of AI & Computers Will Be Analog
17:36
Undecided with Matt Ferrell
Рет қаралды 532 М.
Explaining Distributed Systems Like I'm 5
12:40
HashiCorp
Рет қаралды 34 М.
How Chips That Power AI Work | WSJ Tech Behind
6:29
The Wall Street Journal
Рет қаралды 348 М.