Even tech giants with their own large language models are struggling to find enough computing power. Google was forced to cap Meta's use of its Gemini AI model after Mark Zuckerberg's company exceeded its computing capacity, sources familiar with the matter told the Financial Times. The incident reveals that even tech giants with their own LLMs are having trouble finding enough computing power for themselves, let alone their customers.
Meta does not operate its own cloud business and is trying to rapidly expand its data center build-out, having pledged $600 billion in cloud computing investments over the next two years. Google reportedly warned the social media company about its capacity limits in March, forcing Meta to request that employees use tokens more efficiently. Gemini AI is being used by Meta for customer service, advertiser chatbots, and coding, alongside processes like harmful content takedowns and scam detection. Meta initially chose Gemini because it outperformed its own Llama open-source models.
Sponsored Protocol
Computing power shortage hits even the biggest players
Despite billions spent on data centers, big companies are struggling to get enough capacity. Google itself recently agreed to pay $920 million per month to SpaceX for access to xAI's data centers, due to the extra computing power required for Gemini Enterprise. AI power users are benefiting from the boom, but providers like OpenAI aren't profiting yet, since revenue from AI is a small percentage of costs. Recently, token prices have surged, forcing some companies to back off on AI usage — including the AI companies themselves.
Sponsored Protocol
For related coverage, see how OpenAI tightens API access in Europe and Google's warning on search data hacking. For more background, read the Wikipedia page on Gemini.
Source: https://www.engadget.com/2203579/google-reportedly-capped-meta-use-of-gemini-ai-for-coding-chatbots