When ChatGPT made its debut, it left people astonished with its remarkably human-like comprehension of inquiries and its adept responses. This AI chatbot swiftly became an overnight sensation, dominating conversations across social media platforms. Notably, worldwide Google searches for the phrase “artificial intelligence” skyrocketed to an unprecedented peak, underscoring the profound consumer fascination with this cutting-edge technology.
Just as people’s attention was starting to wane, OpenAI dropped a bombshell update: ChatGPT can now browse the web. The most impressive new feature was its ability to analyze images with a level of detail that rivals human vision.
As interest in ChatGPT was rekindled, we’ve gathered some of the most noteworthy instances showcasing the innovative applications of its new image recognition feature.
Recognising complicated diagrams
Diagrams are a powerful tool for simplifying complex information, but what if the diagrams themselves are too complex to understand? ChatGPT’s new image capabilities can help, by breaking down even the most convoluted diagrams into language that is easy for anyone to understand, even a child. For example, one Twitter user asked ChatGPT to explain a flow diagram with hundreds of elements, and the AI chatbot was able to provide a clear and concise explanation.
Helping you learn
Conversely, it also excels in the reverse role. Whether you require supplementary context or annotations for a straightforward diagram or flowchart, or simply seek to decipher its content and purpose, ChatGPT proves highly adept in this regard.
Finding the sources of images
One Twitter user posted a screenshot from the movie Gladiator and asked ChatGPT to identify the source and transcribe the dialogue. ChatGPT responded as if it had actually watched the movie, providing not only the answers to the original query, but also additional context that enriched the user’s understanding of the scene.
The extent of this feature’s effectiveness with random movie stills remains to be confirmed, as it may be optimized for well-known scenes. Nonetheless, this tool holds significant potential for enhancing reverse image searches, particularly when utilized in conjunction with ChatGPT web-browsing capabilities.
Understanding memes and ideas
The enigma of viral memes often lies in their context or in the subjective nature of humor. Sometimes, a meme can appear utterly nonsensical or cliché, leaving you perplexed by its popularity. When you find yourself unable to fathom why a meme has garnered hundreds of thousands of likes, ChatGPT can step in to provide insights and clarity.
While tools like Google Lens and Microsoft Visual Lens are great for translating text on images, they can sometimes fail to provide accurate results. ChatGPT can be a valuable alternative when other translation tools struggle, such as when trying to translate text on a billboard, road sign, storefront, or other public display.
Generating code from images
One of the most remarkable applications of this feature lies in its capacity to decipher website and project code solely from screenshots, then replicate it accurately.
ChatGPT’s new abilities to see, hear, and speak have only been available for a few days, so it’s clear that we’ve only just begun to scratch the surface of its potential. As people continue to experiment with different types of inputs, we can expect to see a host of cool new applications emerge in the near future.
The implementation of ChatGPT’s new image and voice features is still ongoing, and Plus and Enterprise users are the only ones who can currently utilise them.