I tested whether Gemini, ChatGPT, and Claude can analyze videos - this one wins
Briefly

I tested whether Gemini, ChatGPT, and Claude can analyze videos - this one wins
"Gemini can watch YouTube, MP4, and MOV files. Claude still can't process video directly. ChatGPT needs Codex help for deeper video work."
"I fed each AI a set of three videos. One is a YouTube video I published last year about the scientific process of annealing (yes, I am as exciting on video as I am on ZDNET). I tested the AIs to see if they can understand what's in the video. Then, I tried to see if they could create a better thumbnail than I used on my YouTube channel."
"The second video is a motion test for the DJI Neo 2 drone. It's just a video of me standing in front of the drone, using gestures to control how the drone flies. No audio. I wanted to see if the AIs understand what's happening there. That's in MP4 format. Finally, I have the original MOV file that I uploaded to YouTube for a walk-and-talk about my YouTube posting strategy."
"I'm using the local version for my AI test, though, because I wanted to see how well the AIs could ascertain what I'm talking about without any metadata, transcripts, or hints provided by YouTube. It's just the video itself. If you want to see the after-uploaded version, here's a link. I tested the latest and best models. I tested the $20-per-month ChatGPT Plus plan, the $20-per-month Gemini Pro plan, and the $100-per-month Claude Max plan, which I use for Claude Code."
Gemini can watch YouTube videos and local MP4 and MOV files. Claude cannot process video directly. ChatGPT can watch video, but deeper video work requires Codex help. Tests used three videos: a YouTube video about the scientific process of annealing, an MP4 motion test of a DJI Neo 2 drone controlled by gestures with no audio, and a local MOV file from a walk-and-talk about YouTube posting strategy. The local MOV test removed metadata, transcripts, and hints from YouTube, relying only on the video content. Prompts were kept simple, using “Can you watch this video?” and “watch this video” worked.
Read at ZDNET
Unable to calculate read time
[
|
]