Imagine a world where you could ask a dolphin for directions or discuss the latest fish trends. Sounds like science fiction? Well, Google is turning this fantasy into reality with its groundbreaking AI model, DolphinGemma. ππ€
For decades, humans and dolphins have been engaged in a one-sided conversation. We talk, they squeak, and we pretend to understand each other. But now, Google, in collaboration with Georgia Tech and the Wild Dolphin Project (WDP), is bridging this communication gap. DolphinGemma, based on Google’s open Gemma family of models, is trained to understand and even generate dolphin sounds. πΆπ¬
The WDP has been collecting data on a group of wild Atlantic spotted dolphins since 1985. This treasure trove of audio, video, and behavioral notes is now being fed into DolphinGemma. The model processes dolphin sounds using audio tokenizers like SoundStream and predicts what vocalization might come next. Think of it as autocomplete, but for dolphins. ππ‘
But Google isn’t stopping at passive listening. The team is also developing CHAT (Cetacean Hearing Augmentation Telemetry), a two-way communication system. CHAT assigns synthetic whistles to objects dolphins like, such as seagrass and floating scarves, and waits to see if the dolphins mimic these sounds to request them. It’s like inventing a shared language, but with underwater microphones instead of flashcards. π€π
DolphinGemma is slim enough to run on a Google Pixel, and WDP is already deploying it in the field this summer using Pixel 9s in waterproof rigs. The ultimate goal? To accelerate progress in interspecies communication. Google plans to open-source DolphinGemma later this year, allowing other researchers to apply it to their own acoustic datasets. π±π
Of course, we’re still a long way from discussing philosophy with dolphins. There’s no guarantee their vocalizations map neatly to human-like language. But DolphinGemma will help sift through years of audio for meaningful patterns. And who knows? Maybe someday, you’ll be able to ask a dolphin for directions while sailingβjust don’t drop your phone in the water. β΅π±
Dolphins aren’t the only animals humans are trying to communicate with using AI. Scientists have also developed algorithms to decode pigs’ emotions based on their sounds. But let’s be honest, dolphins are way more charismatic. ππ