What Can ChatGPT Vision Do? Complex Coding, PowerPoint Slides ... And Even Finding Waldo


27% profits every 20 days?

This is what Nic Chahine averages with his options buys. Not selling covered calls or spreads... BUYING options. Most traders don't even have a winning percentage of 27% buying options. He has an 83% win rate. Here's how he does it.


OpenAI's ChatGPT has a new power – vision. ChatGPT Vision is its new image analyzer tool that can help you with a range of things like generating code from a flow chart to understanding complex PowerPoint slides and memes.

Now, instead of explaining ChatGPT something, ChatGPT Vision can simply look at the object itself and answer all kinds of questions you might have.

This could be anything, even food – imagine you are at a fancy diner and have ordered something that sounds fancy but you don't really know what it is. Once the food arrives, you can pull out your phone and use ChatGPT Vision to figure out what exactly is on your plate.

While that sounds quite basic, ChatGPT Vision can be a brilliant coder, a math genius, an expert PowerPoint designer, and even your meme partner.

See Also: OpenAI CEO Sam Altman Tells Joe Rogan Future GPT Could Display ‘Word Soup' In Your Head: ‘A Very Valuable Tool'

Shocking Things You Can Do With ChatGPT Vision

Here are some of the shocking things that people are doing using the Microsoft Corp.-backed (NASDAQ:MSFT) OpenAI's ChatGPT Vision:

Coding – Entry-Level To Complex

Regardless of your expertise in coding, you can take ChatGPT's assistance to help you do some entry-level JSON generation to write apps and use Figma screenshots to write far more complex code.

ChatGPT Vision can take in screenshots from Figma and generate code.

Building with AI is getting wild. pic.twitter.com/D8yeJW1kGR

— Mckay Wrigley (@mckaywrigley) September 29, 2023

You can even simply draw a basic app and ChatGPT will spit out the code for you to begin with.

Hello World coding using nothing but a drawing for GPT-4V multimodal.

Coding an app is now closer to drawing an app…

Welcome to the future. pic.twitter.com/bFQ7QoXBLv

— Brian Roemmele (@BrianRoemmele) September 27, 2023

Understanding PowerPoint Slides

Simplicity is the essence of a good PowerPoint slide, but unfortunately, not everyone gets it, or sometimes the topic is too complex.

Now, whether the slide is talking about a complex Pentagon system or your ten-generation family tree, you can ask ChatGPT about it and it will explain everything to you like you're five.

ChatGPT image recognition vs "Crazy Pentagon PowerPoint Slides:"

(h/t @jonst0kes 🫡) pic.twitter.com/MX3NhTpG1n

— Sean Spriggens (@seanspriggens) September 26, 2023

While this is not quite a PowerPoint slide, it is still more complex than what you might find at a traffic signal. ChatGPT Vision boiled it down to just one single line.

Art Critiques

If you are an artist and need someone's opinion on your latest creation or a work-in-progress, you can now ask ChatGPT what it thinks about it.

The icing on the cake is that ChatGPT will even give you recommendations on how you can improve your creation.

I've been really excited about the potential for AI to make people better at painting, and I think we just made a big leap with GPT-4V.

It identified the main thing to fix in the flower painting (darkening the shadows) and made multiple good suggestions for the cow painting 🤯 pic.twitter.com/uKSVCSHKVR

— Marissa Montgomery (@marissamary) September 27, 2023

Analyzing Nutrition

ChatGPT can also be used to analyze how much nutrition a food item consists of. All you have to do is point ChatGPT Vision at the food item and viola, it will break down details like calories, carbohydrates, protein, fat, and more.

ChatGPT Vision takes an image of groceries and converts it to JSON based on the instructions.

GPT-4V is an image processing supertool. pic.twitter.com/Vx7loyvJNi

— Mckay Wrigley (@mckaywrigley) October 1, 2023

Translating Manuscripts

Manuscripts can be some of the hardest pieces of text to translate – some details can simply be lost due to wear and tear, while others might not be easily decipherable. It requires expert-level knowledge for an accurate translation.

But ChatGPT can do even that, quite effortlessly, too.

GPT-4V will be able to transcribe and translate manuscripts and texts.

I am excited to try out Arabic manuscripts to see how well it does. It does a phenomenal job with transcription even better than most humans. pic.twitter.com/K6y6WffLvz

— muin (@qamarunshadow) September 27, 2023

It remains to be seen if ChatGPT Vision is good enough for more complex translations, but for now, it has shown the potential.

Decoding Christopher Nolan's Outline For Inception

Christopher Nolan's Inception has quite a few people puzzled even now, but that wouldn't stop ChatGPT, would it?

ChatGPT managed to decipher the multiple levels of dreams that Nolan explains in Inception and even broke it down in simple terms.

ChatGPT Vision breaks down Christopher Nolan's early diagram for Inception.

Best part?

The diagram doesn't mention the word "Inception" once.

Crazy. pic.twitter.com/grPpTjvg3d

— Mckay Wrigley (@mckaywrigley) September 30, 2023

If you don't understand Inception yet, this might actually help!

Explaining Memes

ChatGPT can understand pop culture references even if you can't.

If you're ever stuck with a meme that you don't understand and are too afraid to ask anyone what it means, you can use ChatGPT Vision and ask it to explain to you what is going on.

Are you not entertained?

Finding Waldo

Playing a fun game of Finding Waldo but don't have the patience to actually find him yourself?

You can now ask ChatGPT Vision!

It's game over for Waldo 😂

#ChatGPT #GPT4V pic.twitter.com/rurEAZYkeP

— ‏‏‎ ‎+ (@tiredbyte) September 29, 2023

Check out more of Benzinga's Consumer Tech coverage by following this link.

Read Next: Google Japan's Funky Invention Lets You Wear Your Keyboard As A Cap — Here’s How You Can Make One

Photo via Shutterstock


27% profits every 20 days?

This is what Nic Chahine averages with his options buys. Not selling covered calls or spreads... BUYING options. Most traders don't even have a winning percentage of 27% buying options. He has an 83% win rate. Here's how he does it.


ENTER TO WIN $500 IN STOCK OR CRYPTO

Enter your email and you'll also get Benzinga's ultimate morning update AND a free $30 gift card and more!

Posted In: NewsTechAIartificial intelligenceChatGPTConsumer TechOpenAiSoftware & Apps