Imagine livedubbing for your audio or video content. With the Gemini API, now you can. Watch Gemini 3.5 Live Translate process speech-to-speech in near real-time, streaming translated audio directly from a video source - going from English to Hindi in seconds! 👇 Watch the full video to see it in action: https://goo.gle/4oeB47z 📝 Read the full blog announcement: https://goo.gle/4fuRlmG
Google for Developers
Technology, Information and Internet
Mountain View, CA 4,101,593 followers
Join a community of creative developers and learn how to use the latest in technology—from AI and cloud, to mobile & web
About us
Discover the latest technologies, resources, events, and announcements to help you build smarter and ship faster. Explore more at developers.google.com
- Website
-
http://developers.google.com
External link for Google for Developers
- Industry
- Technology, Information and Internet
- Company size
- 10,001+ employees
- Headquarters
- Mountain View, CA
- Specialties
- coding, engineering, firebase, android, cloud, web development, and mobile development
Updates
-
Google for Developers reposted this
Lighthouse is getting a new agentic browsing category. Instead of just auditing for human users, it tracks WebMCP schemas and layout shifts to see if AI agents can actually navigate your site. If machine-readability is on your roadmap, this is worth a look → https://goo.gle/3ReK3JD #GoogleIO
-
-
Introducing DiffusionGemma, an experimental open 26B Mixture of Experts model that moves beyond traditional sequential generation to process and generate entire blocks of text simultaneously. DiffusionGemma unlocks new value for developers: 🏃♀️ Generates 1,000+ tokens/sec on an NVIDIA H100 and 700+ tokens/sec on an RTX 5090 🔄 Optimizes non-linear workflows like code infilling, inline editing, and real-time self-correction 📦 Comfortably within 18GB VRAM limits of high-end dedicated consumer GPUs when quantized. 🔧 Supports native integration for MLX, vLLM, Hugging Face, and Unsloth with advanced NVIDIA NVFP4 kernel optimization 📥 Download the Apache 2.0 weights on Hugging Face: https://goo.gle/4uCwBgQ 📖 Explore the developer guide on the blog: https://goo.gle/3RX6zHn
-
-
Your code shouldn't just compile, it should communicate 🗣️ Ensure your identifier names stay clean with these practices: 📏 Use brief names for local scopes 🛑 Drop redundant context 💻 Rely on modern editor type detection Paraphrase your naming conventions to find an easier way to solve readability problems.
-
Gemini 3.5 Live Translate is now in Public Preview via the Gemini API, delivering low-latency speech-to-speech translation across over 70 languages and 2,000 language pairs! 🌍 Try the challenge in the comments: What is the most “uncommon” language pair your voice application needs to translate? Tell us, and we will let you know in the comments if the model supports it! Read the full blog announcement: https://goo.gle/3QzaHwN
-
💬 Gemini 3.5 Live Translate is now in public preview via the Gemini API and Google AI Studio. While voice translation applications often rely on combining separate speech-to-text, translation, and text-to-speech tools, this new release offers a completely streamlined alternative. Gemini 3.5 Live Translate unifies the entire stack into a single, natively multimodal model. By processing speech as it streams, it delivers low-latency performance while beautifully preserving the speaker’s original tonality, pitch, and pacing for a more natural audio experience. Key developer features: 🛠️ Streamlined architecture: Receive translated audio and text transcriptions in a single model call 🌍 Broad language support: Build workflows across more than 70 languages and 2,000 language pairs 🧠 Automatic language detection: Translate multiple languages in a single stream for seamless performance across long sessions Explore how you can build digital dubbing tools, conversational experiences, and more in AI Studio: https://goo.gle/4vYctH5 Start building with Gemini Live API: https://goo.gle/43kCfcd Get more in the blog: https://goo.gle/3QzaHwN
-
Gemini models are now accessible to millions of Apple developers through Apple’s Foundation Models framework and natively within Xcode. This partnership provides seamless cloud-hosted inference to build dynamic, agentic app experiences and boost development velocity. Additionally, you can use agentic coding assistance from Gemini in Xcode to accelerate multi-step development tasks. Key developer features: 🔄 Shared API surface: Swap effortlessly between local and cloud inference with a single code change to optimize costs and latency. 🛡️ No backend required: Use Firebase AI Logic to connect apps directly to Gemini models securely without maintaining a separate backend server. 💻 Gemini in Xcode: Perform complex, multi-step tasks like reviewing code, fixing bugs, and building features faster right from the Intelligence side panel. 🔑 Flexible authentication: Connect easily using self-serve keys from Google AI Studio or corporate quotas via the Gemini Enterprise Agent Platform. Explore the full announcement to start building next-generation AI experiences on Apple platforms today: https://goo.gle/3Q1YDnD
-
Catch up on announcements and highlights from I/O by selecting an audio overview, powered by Gemini in NotebookLM. Get all of the NotebookLM podcasts: AI → https://goo.gle/4vgknM1 Android podcast → https://goo.gle/4xgW1D8 Chrome podcast → https://goo.gle/4ogir3j Cloud podcast → https://goo.gle/4fsDihs Check out the full notebook for even more I/O nuggets → https://goo.gle/3SbBLma