Flash News

Google Gemini 3.5 Launches Live Translate Feature, Supports Real-Time Translation in 70+ Languages with Just a Few Seconds Delay

Google Gemini 3.5 has launched the Live Translate feature, supporting real-time translation in over 70 languages with just a few seconds delay for bidirectional translation. It retains the speaker's original tone, rhythm, and pitch, and has strong automatic noise filtering capabilities, making it suitable for noisy environments.

The Translate App has introduced a "Receiver Mode" for ear-to-ear translation. Developers can directly call the Gemini Live API and Google AI Studio, supporting automatic language detection without the need to specify the language in advance. Platforms like Grab are already testing it for multilingual communication between drivers and passengers.

Source: Public Information

ABAB AI Insight

Google has previously enhanced audio multimodal capabilities in the Gemini series, and this Live Translate upgrade continues its transition from text translation to full real-time voice interaction. Similar technologies have been rapidly deployed in Google Meet, Pixel devices, and NotebookLM to expand ecosystem penetration.

On the capital front, Google is leveraging the Gemini model family and cloud infrastructure resources to transform low-latency translation into core advantages for Workspace, enterprise APIs, and consumer applications. This move enhances global user engagement and creates a stable revenue stream through API calls, while also providing more real-time multilingual data support for search, advertising, and collaboration businesses.

Similar to the ongoing iterations of Microsoft Translator and DeepL in real-time scenarios, Gemini 3.5 Live Translate is currently in an expansion phase, transitioning from an experimental translation tool to mainstream productivity and communication infrastructure. It is solidifying its leading position in global real-time communication tools through noise filtering and tone retention.

Essentially, this represents a shift towards technological substitution and capital concentration: real-time low-latency translation directly replaces traditional human translation and inefficient tool chains, accelerating the concentration of global communication capital towards the Google AI platform through automatic detection and native audio retention, reshaping the cost structure and pricing power of international business, education, travel, and other scenarios.

ABAB News · Cognitive Law

The lower the translation delay, the greater the leverage for cross-language collaboration.
The more complex the noise environment, the more rare the competitive advantage of retaining native tone.
The stronger the automatic detection, the more thoroughly language barriers are broken.

Source

·ABAB News
·
2 min read
·18d ago
分享: