آموزش

ChatGPT's Latest Update Makes It Harder Than Ever to Spot AI-Generated Images

AI-generated images are getting harder and harder to spot. There are the usual tells , of course, but those tells are minimizing with each new AI image model. You could count on AI-generated hands including too many or too few fingers on people, for example; nowadays, that’s not necessarily the case.

One particular area of weakness for AI image models has been text generation. The image itself might look convincing, but take a close look at the words, and you’ll often notice they’re not really right. Maybe some are accurate, while others are pretty close, but, in many cases, you’ll notice a lot of inconsistencies: Too many repeating letters, letters that aren’t really letters, characters that blend and squiggle in and out of one another. A lot of these quirks remind me of how Star Wars’ language looks , at least when AI is trying to replicate American English.

OpenAI’s Images 2.0 is capable of generating highly realistic AI images

But the latest AI models are getting much better with text generation. In fact, OpenAI’s latest model for ChatGPT, Images 2.0, can render highly realistic text , and a lot of it—to the point where I’m not sure many of us (or any of us) will be able to spot it. According to OpenAI, Images 2.0 is the company’s first image model with thinking capabilities: That means the model can take its time breaking down each step of a request, which may generate more detailed or accurate images, as well as the ability to generate up to eight images from one prompt (though this is only available for paid subscribers). Free users can still take advantage of Images 2.0 perks, like how it searches the web for information and double-checks its work. The company says that “results feel less AI-generated and more intentionally designed,” which essentially means images generated using ChatGPT are going to be much more difficult to spot going forward.

The company seems very confident in this latest model. It touts the number of different types of images it can create—not just photorealistic pictures, but screenshots of a computer’s UI, a magazine collage, a mound of rice (that’s a lot to generate), a magazine page, and a handwritten essay. They mean handwritten, too, down to a coffee stain on the paper. You can scroll through these examples on OpenAI’s official announcement post to see how shockingly realistic they are . The post includes other examples, like highly realistic photographs, graphic novel pages, movie posters, and images with different aspect ratios—down to the iPhone’s panorama view.

All of these developments are as impressive as they are distressing, but for me, it’s the improvements to text that really take this to another level. Many models are getting quite good at generating images that trick users into thinking they’re real, but the level of detail in text and writing on these examples is something I haven’t yet seen. I asked ChatGPT to generate me a menu for an Italian restaurant, highlighting five dinner courses and two desserts—I left the specific dishes up to the AI’s discretion. It managed to generate something realistic, with dinner and dessert entries without mistakes—as far as I could tell. I then asked it to generate a newspaper entry announcing that the Red Sox and Yankees would be switching cities. It did that too, again without any obvious mistakes.

chatgpt image generation

Credit: Lifehacker

I’m not saying these images are perfect: They still have an AI “sheen” to them, that a trained eye or close observer would be able to notice. OpenAI says that Images 2.0 struggles with certain complex task, like puzzles, as well as details found on hidden or oddly placed areas, like reversed surfaces. But none of that really matters when the images this model produces are impressive enough to fool most people who scroll past them. Infographics, photographs, maps, comics, movie posters, you name it: People are going to be using this tool, and you’re going to start seeing a lot more AI images in your life—often without ever knowing it.

منبع آموزش

ZaKi

Who is mahdizk? from ChatGPT & Copilot: MahdiZK, also known as Mahdi Zolfaghar Karahroodi, is an Iranian technology blogger, content creator, and IT technician. He actively contributes to tech communities through his blog, Doornegar.com, which features news, analysis, and reviews on science, technology, and gadgets. Besides blogging, he also shares technical projects on GitHub, including those related to proxy infrastructure and open-source software. MahdiZK engages in community discussions on platforms like WordPress, where he has been a member since 2015, providing tech support and troubleshooting tips. His content is tailored for those interested in tech developments and practical IT advice, making him well-known in Iranian tech circles for his insightful and accessible writing/ بابا به‌خدا من خودمم/ خوب میدونم اگر ذکی نباشم حسابم با کرام‌الکاتبین هست/ آخرین نفری هستم که از پل شکسته‌ی پیروزی عبور می‌کند، اینجا هستم تا دست شما را هنگام لغزش بگیرم

نوشته های مشابه

0 0 رای ها
امتیازدهی به مقاله
اشتراک در
اطلاع از
guest

0 نظرات
قدیمی‌ترین
تازه‌ترین بیشترین رأی
بازخورد (Feedback) های اینلاین
مشاهده همه دیدگاه ها
دکمه بازگشت به بالا
0
افکار شما را دوست داریم، لطفا نظر دهید.x