Google Launches AI Video Creation Feature on Gemini
Google has just announced that users can now create videos using artificial intelligence through its Gemini chatbot and the recently launched experimental tool Whisk.
AI image generation tools have been delighting us for years now, thanks to OpenAI, Imagen, Adobe Firefly, DALL-E-3 , and more. As the technology has advanced, we’ve had more and more options to refine our results. Now, Google Labs has released Whisk, a tool that lets you upload images as instructions instead of text prompts.
Google Labs' Whisk creates images from other images
If you live in the US, you'll now have access to Whisk from Google Labs, a "generative AI experiment," according to Google's blog. With Whisk, instead of relying solely on a descriptive text prompt, you can add images as references. The platform asks for three main characteristics: Subject, scene, and style. The tool then blends those elements together and creates the perfect image for you.
Note : Whisk uses Imagen 3, Google's latest image generation model.
Google hasn’t completely killed off the text prompt with Whisk. You still have the option to write an image prompt for each of the three categories or add a general note. You can also tweak the image after seeing Whisk’s initial test. For example, let’s say you create a vintage greeting card of a cat lying in the snow. After seeing the results, you might be tempted to add snowflakes to complete the look.
Every time you add or create an image in any of Whisk's three categories, the platform does the work of generating a detailed text description of that image, so if you want to add or edit an existing image, all you have to do is customize the text.
Finally, if you're lacking inspiration, you can randomize your visual elements by selecting a dice icon. For more complex creations, you can also add more than one theme, scene, or style reference.
Once you're happy with your masterpiece, you can save it to the platform or download it for local access.
Is it worth using Whisk?
With all the advanced AI image-making options out there for enhancing photos or creating “original” art, Google’s new tool might seem like a gimmick. But the way Whisk leverages visual references in its image-making process is unique, and you can see how it could be valuable in creative and professional situations.
Let's say you're working on a pitch deck and need images that look similar to a reference you already have. Instead of trying to reverse engineer that reference verbally, you simply upload the file, along with a brief text description of how you want your new image to be different.
To differentiate Whisk from other existing AI visualization software, Google has made it clear that the platform is designed for exploration, not refinement. While other products may be better suited to fine-tuning, Whisk is best suited for brainstorming:
"We built it for rapid visual exploration, not pixel-perfect editing. Whisk is about exploring ideas in new and creative ways, letting you play with dozens of options and download your favorites."
Honestly, sometimes it's hard to describe things with words. Whisk opens up some new possibilities when you simply "want an image to look like this".
Installing more RAM is the most effective solution to speed up your computer. Even if your computer is new, after only a few years of use, you will have to install more RAM to ensure better speed. In addition, new operating systems also require more memory. When a computer does not have enough RAM, it will exchange data streams with the hard drive, and that is the reason why your system runs slowly.
Gemini Advanced is a paid subscription from Google that gives you access to more advanced AI models. After signing up for the Gemini Advanced plan, if you no longer need to use it, you should cancel the Gemini Advanced plan, according to the article below.
In this article, WebTech360 will guide you how to install and experience Windows 11 on VMWare virtual machine.
Grouping layers in Canva makes your design more professional and also makes it easier to edit and work with your design.
Safe Communication will blur sensitive images received on your child's iPhone via Messages, AirDrop. Here's how to use Safe Communication on iPhone.
Marksmen return in TFT season 14 and are still a powerful class with outstanding long-range physical damage.
Some computers after upgrading to Windows 10 version have the problem of losing sound. We can check the audio device connections to the computer, or adjust the sound settings on the operating system.
In this article students will learn how to add sounds and use sounds in ScratchJR for each of their characters.
ScratchJR helps students create command-based programs for characters, and you can use it to build a foreign language learning program on ScratchJR.
The iPhone iMessage group chat feature helps us text and chat more easily with many people, instead of sending individual messages.
TikTok has an option to set a nickname for your friends' accounts to choose a name that is easier to remember in your friend list. This article will guide you to set a nickname for your friends' accounts on TikTok.
Search and service improvement is a setting in Microsoft Edge that lets the company use your web search data to improve your search and web experience.
Weibo accounts also have options to edit the account, such as changing the Weibo password. Here are instructions for changing the Weibo password.
Claude AI now allows you to choose from a variety of text writing styles so users get the text they need.
There are several ways to recover deleted messages on iPhone, using iCloud, using iTunes, and using third-party apps.