Google Gemini's AI Avatar Tool Creates Unnervingly Realistic Digital Clones

Google's Gemini app now allows users to create AI avatars, digital clones of themselves that can be inserted into AI-generated videos. The feature offers a glimpse into the future of personalized content creation but raises uncanny questions about digital identity.

By Jason Young

Jason Young covers green tech for Techawave.

May 23, 20263 min read0 views

SAN FRANCISCO – A digital doppelganger of the author is seen singing to a dinosaur in a sun-drenched park, and moments later, surfing beneath the Golden Gate Bridge. These scenarios, seemingly pulled from a science fiction film, were generated using Google's new AI avatar feature within the Gemini app. This technology allows subscribers to create a digital replica of themselves that can then be placed into AI-generated video content, powered by the company's Omni video model.

The feature, available to Google's AI Pro plan subscribers who pay $20 per month, offers a novel way to interact with artificial intelligence. However, the experience of seeing oneself recreated so realistically can be both impressive and disquieting. While the generated clips exhibited occasional glitches, such as nonsensical outfits or jumbled moments, the uncanny resemblance to the user was undeniable. The author noted the digital clone was a near-perfect replica, down to subtle facial features, raising profound questions about digital identity and self-representation.

The setup process for creating an avatar is straightforward. Users are instructed to sit in a well-lit room with their phone's camera focused on their face. The process involves reading a series of two-digit numbers aloud, then slowly turning the head from side to side. This five-minute procedure effectively births a digital twin, ready to star in personalized AI creations. The author advised paying attention to clothing choices during setup, as apparel is likely to be replicated in AI generations.

A Digital Reflection of Reality

One generated clip depicted the AI avatar singing a birthday song to an aging dinosaur at Dolores Park, a well-known San Francisco landmark. The prompt specified the setting, and the resulting video impressively captured the park's photorealistic details, including palm-tree-lined sidewalks and the distant Salesforce Tower. This attention to environmental detail is likely a result of Google's extensive mapping capabilities.

As the AI avatar began the song, the initial bars flowed naturally, with gestures mirroring a conductor. However, the scene then devolved into a more chaotic sequence, featuring a sudden appearance of a cupcake and the avatar exhaling smoke after blowing out a candle. A second generated clip showed the avatar surfing under the Golden Gate Bridge, though oddly clad in denim instead of a wetsuit. This clip even incorporated dynamic shots mimicking those from a GoPro attached to a surfboard.

While the AI-generated videos offered moments of impressive realism, they also highlighted the current limitations of generative AI. The digital clone, dubbed "Reece 2.0," was described as seamless and always ready for any task, a stark contrast to the author's own human limitations. The experience was noted as feeling more real than rewatching personal voicemails or past video clips, due to the hyper-realized nature of the digital self. This raises a critical point about how we perceive and interact with our digital representations in an increasingly AI-driven world.

Google emphasizes its commitment to safety with the rollout of this new feature. Nicole Brichtova, who leads the product team for Omni at Google DeepMind, stated, "We try to prevent harm. And, we try to do it in a way where we're not blocking benign things." This approach aims to balance the potential for misuse, such as the creation of nonconsensual deepfakes which increasingly target women using less-regulated AI models, with the desire to enable creative and harmless applications. The ability to create such lifelike digital versions of oneself, even with imperfections, represents a significant step in AI's capabilities and brings to the forefront discussions about digital ethics and the future of personal media.

SourceWIRED

Topicsai avatar google gemini digital clone generative ai nicole brichtova omni video model