Google's TurboQuant Algorithm Reduces Memory Consumption of Models by Six Times

Technology 13:40, 27-03-2026

Google TurboQuant algorithm reduced memory consumption of language models sixfold

Google company has introduced a new algorithm called TurboQuant. This was reported by Zamin.uz.

This algorithm is capable of reducing the memory consumption of large language models by up to six times. According to the company’s data, this method maintains accuracy and does not significantly harm the system’s performance.

As a result, it will be possible to make artificial intelligence systems cheaper and easier to deploy. This was reported by the Tech.onliner.by website.

The main goal of the TurboQuant algorithm is to efficiently manage the cache memory used by language models during conversations. The cache stores necessary data to prevent repeating the same calculations in the system.

However, as the interaction with the user lengthens, the cache size also increases. This can slow down response speed and increase the demand for hardware resources.

According to Google, TurboQuant works in several stages by compressing stored data and correcting errors that occur during this process. This algorithm reduces memory pressure while also lowering computational costs.

Importantly, TurboQuant can be applied to existing models without additional preparation. This innovation will be especially useful for artificial intelligence tools running on smartphones and other devices with limited resources.

If TurboQuant is widely implemented, it will help reduce the operational costs of AI services. It will also enable efficient use of advanced models on smaller and less powerful devices.

This will create a foundation for broader application of artificial intelligence technologies.

Google Turboquant Zamin Tech.onliner.by Ai

Nodirbek Razzokov

Muxbir

Similar news

Redmi K90 Max smartphone updated annual sales record

Xiaomi has provided information about the initial sales figures for the Redmi K90 Max smartphone, recently presented to the public. This was reported by Zamin.uz. The new device achieved great

Technology 18:43, 26-04-2026

Tests of new-generation rocket engine successfully completed in the US

America's Astrobotic company announced the successful completion of tests of a promising rocket engine called Chakram. This was reported by Zamin.uz. This device belongs to the type of rotating

Technology 18:26, 26-04-2026

New dangerous banking trojan discovered in Google Play store

Another serious warning has been announced for smartphone users. This was reported by Zamin.uz. It has become known that even the official application stores considered the most reliable are not

Technology 18:18, 26-04-2026

Kioxia introduced new high-speed drives

Japan's Kioxia continues to expand its share in the market of affordable data storage devices. This was reported by Zamin.uz. This manufacturer has introduced new BG8 series solid-state drives

Technology 18:11, 26-04-2026

Apple may delay launch of new MacBook Ultra model

Apple is preparing to introduce the MacBook Ultra model, which will usher in a new era in the laptop market. This was reported by Zamin.uz. According to sources supplying components for the device,

Technology 17:35, 26-04-2026

Large companies are limiting AI capacity for startups

The world's largest cloud service providers – giants such as Microsoft, Amazon, and CoreWeave – are reserving graphics processor capacity mainly for their internal projects and major partners

Technology 17:31, 26-04-2026