Apple skips Nvidia's GPUs for its AI models, uses thousands of Google TPUs instead (2024)

Apple skips Nvidia's GPUs for its AI models, uses thousands of Google TPUs instead (1)

Apple has revealed that it didn’t use Nvidia’s hardware accelerators to develop its recently revealed Apple Intelligence features. According to an official Apple research paper (PDF), it instead relied on Google TPUs to crunch the training data behind the Apple Intelligence Foundation Language Models.

Systems packing Google TPUv4 and TPUv5 chips were instrumental to the creation of the Apple Foundation Models (AFMs). These models, AFM-server and AFM-on-device models, were designed to power online and offline Apple Intelligence features which were heralded back at WWDC 2024 in June.

Apple skips Nvidia's GPUs for its AI models, uses thousands of Google TPUs instead (2)

AFM-server is Apple’s biggest LLM, and thus it remains online only. According to the recently released research paper, Apple’s AFM-server was trained on 8,192 TPUv4 chips “provisioned as 8 × 1,024 chip slices, where slices are connected together by the data-center network (DCN).” Pre-training was a triple-stage process, starting with 6.3T tokens, continuing with 1T tokens, and then context-lengthening using 100B tokens.

Apple said the data used to train its AFMs included info gathered from the Applebot web crawler (heeding robots.txt) plus various licensed “high-quality” datasets. It also leveraged carefully chosen code, math, and public datasets.

Of course, the ARM-on-device model is significantly pruned, but Apple reckons its knowledge distillation techniques have optimized this smaller model’s performance and efficiency. The paper reveals that AFM-on-device is a 3B parameter model, distilled from the 6.4B server model, which was trained on the full 6.3T tokens.

Unlike AFM-server training, Google TPUv5 clusters were harnessed to prepare the ARM-on-device model. The paper reveals that “AFM-on-device was trained on one slice of 2,048 TPUv5p chips.”

It is interesting to see Apple has released such a detailed paper, revealing techniques and technologies behind Apple Intelligence. The company isn’t renowned for its transparency but seems to be trying hard to impress in AI, perhaps as it has been late to the game.

Stay On the Cutting Edge: Get the Tom's Hardware Newsletter

Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.

Apple skips Nvidia's GPUs for its AI models, uses thousands of Google TPUs instead (3)

According to Apple’s in-house testing, AFM-server and AFM-on-device excel in benchmarks such as Instruction Following, Tool Use, Writing, and more. We’ve embedded the Writing Benchmark chart, above, for one example.

If you are interested in some deeper details regarding the training and optimizations used by Apple, as well as further benchmark comparisons, check out the PDF linked in the intro.

Mark Tyson is a news editor at Tom's Hardware. He enjoys covering the full breadth of PC tech; from business and semiconductor design to products approaching the edge of reason.

More about artificial intelligence

Nvidia to deliver Blackwell engineering samples this week — chips on track for fourth quarter launchNew memory tech unveiled that reduces AI processing energy requirements by 1,000 times or more

Latest

New Chinese office GPU can double as a budget 1080p gaming GPU — MTT S50 wields 2,048 MUSA cores, 8GB VRAM, 85W TGP
See more latest►

9 CommentsComment from the forums

  • Heat_Fan89

    This is NO surprise as Apple is still stinging from their last encounter with Nvidia which went bad and cost Apple a lot of money many years ago with failed GPU's that Nvidia did not take responsibility for. Both sides went blaming the other. Apple is not the only company burned by Nvidia.

    Reply

  • EMI_Black_Ace

    Well, given that all Apple wants out of the hardware is AI compute, Google TPUs deliver that on a more cost efficient basis without any of the other "stuff" Nvidia offers.

    Reply

  • Kamen Rider Blade

    Apple has a "Don't do Business" with nVIDIA & it's hardware after they were burned by nVIDIA.

    Reply

  • husker

    Perhaps Apple is willing to disclose its methods in order to show others that there is an AI road that does not lead to Nvidia.

    Reply

  • ezst036

    How serious can Apple be about gaming if they refuse to bury the hatchet with Nvidia?

    Reply

  • Makaveli

    Heat_Fan89 said:

    This is NO surprise as Apple is still stinging from their last encounter with Nvidia which went bad and cost Apple a lot of money many years ago with failed GPU's that Nvidia did not take responsibility for. Both sides went blaming the other. Apple is not the only company burned by Nvidia.

    add microsoft to that list with the original xbox.

    There is a reason all the consoles are using AMD IP.

    ezst036 said:

    How serious can Apple be about gaming if they refuse to bury the hatchet with Nvidia?

    Apple is a trillion dollar company if they cared about gaming they would have been in that market along time ago.

    Reply

  • Mattzun

    Makaveli said:

    add microsoft to that list with the original xbox.

    There is a reason all the consoles are using AMD IP.

    Apple is a trillion dollar company if they cared about gaming they would have been in that market along time ago.

    Apple has already captured a lot of gaming revenue - its just not on laptops/desktops.
    55 percent of gaming revenue is on mobile devices and Apple is getting a huge cut of that from both device sales and the app store.

    Back to the article
    Its great that Apple was able to define specific goals for AI processing and that Google TPUs worked for them.
    It allowed Apple to meet its goals and it freed up general purpose NVidia units for companies that need them.
    Given that there is a huge backlog for NVidia AI processors, this seems like a win-win.

    Reply

  • Makaveli

    Mattzun said:

    Apple has already captured a lot of gaming revenue - its just not on laptops/desktops.
    55 percent of gaming revenue is on mobile devices and Apple is getting a huge cut of that from both device sales and the app store.

    This I know but i'm PCMR I don't consider mobile gaming real gaming :)

    You are 100% correct!

    Reply

  • renz496

    husker said:

    Perhaps Apple is willing to disclose its methods in order to show others that there is an AI road that does not lead to Nvidia.

    Easy. All they need to do is make as much money as Apple in quarterly basis.

    Reply

Most Popular
China's newest homegrown AI chip matches industry standard at 45 TOPS — 6nm Arm-based 12-core Cixin P1 starting mass production
Alphawave develops 3nm UCIe chiplet IP for die-to-die connectivity
China limits civilian drone exports starting September 1 — country aims to prevent use by foreign military or terror orgs
Newegg launches CPU trade-in program with low payouts — $300 for a Core i9-14900K or $220 for a Ryzen 7 7800X3D
Asus has a new laptop that comes with a fragrance dispenser — Adol Book laptop always smells good
Ampere unveils monstrous 512-core AmpereOne Aurora processor — custom AI engine, support for HBM memory
Nvidia to deliver Blackwell engineering samples this week — chips on track for fourth quarter launch
AMD Ryzen 9000 price listings now on Best Buy — costs significantly less than Ryzen 7000 launch prices
New US government rules to allow export of some equipment to China by ASML, Tokyo Electron
AMD's Ryzen 9 5900XT, Ryzen 7 5800XT launch today for $349 and $249, respectively — existing Ryzen 5000 is less expensive
The cheapest Qualcomm Snapdragon X Elite PC is its mini desktop Dev Kit, and its available for preorder now
Apple skips Nvidia's GPUs for its AI models, uses thousands of Google TPUs instead (2024)
Top Articles
Latest Posts
Article information

Author: Twana Towne Ret

Last Updated:

Views: 5698

Rating: 4.3 / 5 (44 voted)

Reviews: 83% of readers found this page helpful

Author information

Name: Twana Towne Ret

Birthday: 1994-03-19

Address: Apt. 990 97439 Corwin Motorway, Port Eliseoburgh, NM 99144-2618

Phone: +5958753152963

Job: National Specialist

Hobby: Kayaking, Photography, Skydiving, Embroidery, Leather crafting, Orienteering, Cooking

Introduction: My name is Twana Towne Ret, I am a famous, talented, joyous, perfect, powerful, inquisitive, lovely person who loves writing and wants to share my knowledge and understanding with you.