OpenAI, the parent company of ChatGPT, has given its first official public preview of DALL-E 3, its latest image generation model. Rolled out Wednesday at a small event for reporters, DALL-E 3 is being pitched as a tool that fully understands complex text prompts, and produces images to match them in complexity.
As a new information page about DALL-E 3 on the OpenAI website notes, "Modern text-to-image systems have a tendency to ignore words or descriptions, forcing users to learn prompt engineering. DALL-E 3 represents a leap forward in our ability to generate images that exactly adhere to the text you provide."
SEE ALSO: OpenAI releases new teacher guide for ChatGPT in classroomsPossible images from an in-progress version of DALL-E 3 were leaked onto Discord earlier this summer, and those showed enormous potential along the lines depicted in the press preview. The leaker claimed to have fed DALL-E 3 the lengthy prompt "painting of a pink jester giving a high five to a panda while in a cycling competition. The bikes are made of cheese and the ground is very muddy. They are driving in a foggy forest. The panda is angry." The resulting image was downright astonishing in its fidelity to that request.
Image generators like Midjourney and Stable Diffusion, while capable of mimicking photorealism and producing representations of a wide-range of objects, styles, and people (with no small amount of controversy to go with them) will undoubtedly struggle to produce anything this complex.
Those image generators, and OpenAI's own previous offerings in this area, also famously fall short when asked to produce images that feature text — usually producing garbled nonsense at best, and hilarious malapropisms at worst. DALL-E 3 appears to me much more capable of incorporating coherent text into images, as demonstrated in a cartoon posted on X by OpenAI CEO Sam Altman.
Tweet may have been deleted
Open AI says it will integrate DALL-E 3 into ChatGPT directly, and strongly implies that the chatbot will transition from one model to another, depending on the content of the prompt. ChatGPT, once purely a user-friendly spigot for text outputs from the GPT-3.5 model is rapidly evolving — incorporating third-party plugins with the ability to pull text from other sources, including the web. This move further diversifies ChatGPT's capabilities, broadening the already strained definition of the term "chatbot."
DALL-E 3 "will ramp to all ChatGPT+ users over the next couple of weeks," according to Altman. The OpenAI website says all ChatGPT Plus and ChatGPT Enterprise customers will be able to use it "in early October," and that OpenAI won't be making any copyright claims on the model's outputs. However, if you plan to generate something with DALL-E 3 and then copyright it yourself, that's a whole other can of worms.
Copyright © 2023 Powered by
OpenAI just demoed its most sophisticated image generator yet, DALL-鼓盆之戚网
sitemap
文章
795
浏览
43
获赞
67
Planned Parenthood's app is expanding access to birth control
The Trump administration is doing everything it can to undermine Planned Parenthood's law-abiding, siRobot promises its new Roomba won't smear dog poop all over
They finally did it, folks. It's been five, long shit-smeared years since the world was shocked to dAmazon completely redesigns Prime Video interface
Amazon’s got a big fall season coming up, so the Prime Video app is getting overhauled ahead oApple's new M2 MacBook Air is coming July 15, report says
When Apple launched its new MacBook Air with the M2 chip at WWDC in June, it never gave us an exactSlack to Microsoft: Bundling Teams with Office is an antitrust violation
Slack is accusing Microsoft of breaking antitrust law in the European Union by bundling its competinTaylor Swift TikTok is the perfect place for fans new and old
My TikTok For You Page is all Taylor Swift.Now, this is by design. The all-knowing TikTok algorithmUse Facebook's Feeds to clean up your Facebook
Facebook has finally done it: Last week, the company introduced Feeds on mobile, giving users the opElon Musk's Boring Company to accept Dogecoin in Las Vegas loop
The Boring Company, Elon Musk's other, other company specializing in digging tunnels, will accept DoThe $80,000 Lucid Air: It'll be nice when we can drive it
Lucid they may be, but they're not exactly transparent. The buzzworthy Bay Area car company, which mGoogle engineer officially fired for alleging AI was sentient
When Google engineer Blake Lemoine claimed an AI chat system that the company's been developing wasPeloton fires back at 'And Just Like That' with a cheeky PSA video featuring Mr. Big
UPDATE: Dec. 17, 2021, 5:38 p.m. AEDT Peloton has removed the video from their social media accountsTikTok users are holding their university accounts hostage
Colleges better step up their social media game — or they risk students taking them over.FederWatch a loose bat fly around a Spirit Airlines plane mid
Forget snakes on a plane. We have bats to worry about, now. On Wednesday morning, passengers on a 6:Finally, the feminine urge is taking over Twitter
Are you feeling the feminine urge to read this article? It's the latest meme taking over your TwitteInstagram is developing a way to protect users from cyberflashing
Cyberflashing, the practice of sending unsolicited and unwanted nude photos to people via social med