1. News
  2. Technology
  3. Apple embraces Nvidia GPUs to accelerate LLM inference via its open source ReDrafter tech

Apple embraces Nvidia GPUs to accelerate LLM inference via its open source ReDrafter tech

featured
Share

Share This Post

or copy the link

[ad_1]


  • ReDrafter delivers 2.7x more tokens per second compared to traditional auto-regression
  • ReDrafter could reduce latency for users while using fewer GPUs
  • Apple hasn’t said when ReDrafter will be deployed on rival AI GPUs from AMD and Intel

Apple has announced a collaboration with Nvidia to accelerate large language model inference using its open source technology, Recurrent Drafter (or ReDrafter for short).

The partnership aims to address the computational challenges of auto-regressive token generation, which is crucial for improving efficiency and reducing latency in real-time LLM applications.

[ad_2]

Source link

0
joy
Joy
0
cong_
Cong.
0
loved
Loved
0
surprised
Surprised
0
unliked
Unliked
0
mad
Mad
Apple embraces Nvidia GPUs to accelerate LLM inference via its open source ReDrafter tech
Comment

Your email address will not be published. Required fields are marked *

Login

To enjoy 9News privileges, log in or create an account now, and it's completely free!

Follow Us