What Can ChatGPT Vision Do? Complex Coding, PowerPoint Slides ... And Even Finding Waldo

Zinger Key Points
  • OpenAI gave ChatGPT the power of vision, hearing, and voice, recently.
  • Now, ChatGPT Vision can help with things like coding, translating, deciphering manuscripts and even memes.
  • Here are the most shocking things people are doing with ChatGPT Vision.

OpenAI's ChatGPT has a new power – vision. ChatGPT Vision is its new image analyzer tool that can help you with a range of things like generating code from a flow chart to understanding complex PowerPoint slides and memes.

Now, instead of explaining ChatGPT something, ChatGPT Vision can simply look at the object itself and answer all kinds of questions you might have.

This could be anything, even food – imagine you are at a fancy diner and have ordered something that sounds fancy but you don't really know what it is. Once the food arrives, you can pull out your phone and use ChatGPT Vision to figure out what exactly is on your plate.

While that sounds quite basic, ChatGPT Vision can be a brilliant coder, a math genius, an expert PowerPoint designer, and even your meme partner.

See Also: OpenAI CEO Sam Altman Tells Joe Rogan Future GPT Could Display ‘Word Soup' In Your Head: ‘A Very Valuable Tool'

Shocking Things You Can Do With ChatGPT Vision

Here are some of the shocking things that people are doing using the Microsoft Corp.-backed MSFT OpenAI's ChatGPT Vision:

Coding – Entry-Level To Complex

Regardless of your expertise in coding, you can take ChatGPT's assistance to help you do some entry-level JSON generation to write apps and use Figma screenshots to write far more complex code.

You can even simply draw a basic app and ChatGPT will spit out the code for you to begin with.

Understanding PowerPoint Slides

Simplicity is the essence of a good PowerPoint slide, but unfortunately, not everyone gets it, or sometimes the topic is too complex.

Now, whether the slide is talking about a complex Pentagon system or your ten-generation family tree, you can ask ChatGPT about it and it will explain everything to you like you're five.

While this is not quite a PowerPoint slide, it is still more complex than what you might find at a traffic signal. ChatGPT Vision boiled it down to just one single line.

Art Critiques

If you are an artist and need someone's opinion on your latest creation or a work-in-progress, you can now ask ChatGPT what it thinks about it.

The icing on the cake is that ChatGPT will even give you recommendations on how you can improve your creation.

Analyzing Nutrition

ChatGPT can also be used to analyze how much nutrition a food item consists of. All you have to do is point ChatGPT Vision at the food item and viola, it will break down details like calories, carbohydrates, protein, fat, and more.

Translating Manuscripts

Manuscripts can be some of the hardest pieces of text to translate – some details can simply be lost due to wear and tear, while others might not be easily decipherable. It requires expert-level knowledge for an accurate translation.

But ChatGPT can do even that, quite effortlessly, too.

It remains to be seen if ChatGPT Vision is good enough for more complex translations, but for now, it has shown the potential.

Decoding Christopher Nolan's Outline For Inception

Christopher Nolan's Inception has quite a few people puzzled even now, but that wouldn't stop ChatGPT, would it?

ChatGPT managed to decipher the multiple levels of dreams that Nolan explains in Inception and even broke it down in simple terms.

If you don't understand Inception yet, this might actually help!

Explaining Memes

ChatGPT can understand pop culture references even if you can't.

If you're ever stuck with a meme that you don't understand and are too afraid to ask anyone what it means, you can use ChatGPT Vision and ask it to explain to you what is going on.

Are you not entertained?

Finding Waldo

Playing a fun game of Finding Waldo but don't have the patience to actually find him yourself?

You can now ask ChatGPT Vision!

Check out more of Benzinga's Consumer Tech coverage by following this link.

Read Next: Google Japan's Funky Invention Lets You Wear Your Keyboard As A Cap — Here’s How You Can Make One

Photo via Shutterstock

Market News and Data brought to you by Benzinga APIs
Posted In: NewsTechAIartificial intelligenceChatGPTConsumer TechOpenAiSoftware & Apps
Benzinga simplifies the market for smarter investing

Trade confidently with insights and alerts from analyst ratings, free reports and breaking news that affects the stocks you care about.

Join Now: Free!

Loading...