Openai Sora Promises and Rolling in Chatgpt

Openai merges the possibilities of generating images in Sora directly into Chatgpt starting today – this feature is called “Pictures in Chatgpt”. While it was just a former It can be accessed through a separate locationUsers can now use it to create images within Chatgpt itself.

Sora has been announced as a video in which artificial intelligence works, but this initial version focuses only on creating images and will be available via Chatgpt Plus, Pro, TEAM and Free Support levels. A spokesperson for the D. freedomBut they added that they “did not have a specific number to participate” and “these might change over time based on demand.” per Chatgpt is common questionsFree users were previously able to create “three pictures per day with Dall · E 3.” As for the Dal E.

“This model is to change a higher step than previous models,” said Lead Research is Gabriel Goh freedomAdding that the team used the GPT-4O “Omnimodal-or a model that can create any kind of data such as text, image, sound and video-for this repetition of Sora.

GOH’s improvements include “linking”, which indicate the quality of AI AI’s image generators between features and objects; For example, a poor bonding model, a blue star, as well as a red triangle, and a red star, or a triangle. Goh said that most photo models are struggling with this, often, mixing colors and shapes when they are asked to provide multiple elements – it is usually about 5 to 8. He says that the new Sora images can connect features from 15 to 20 objects without confusion, which represents a significant improvement in accuracy and reliability.

An example of the capabilities of Sora “linking”.

Openai

Users will also notice an improvement in presenting the text, which facilitates the creation of a coherent text without typographical errors on an image (in the existing tools, you will often notice this text It is very distorted). Joho said that obtaining text texts was a great challenge. If the small titles or text elements contain errors or errors, the entire image may become unusable.

“This was the same as the repetition, which took a lot, several months of correction,” said Joh. Although it is not perfect, the team has reached a point where the quality of the text is constantly used (as it tends to error is a truly small text). “Many months of small improvements have passed.”

The system uses an automatic approach-sequential images from left to right, similar to how to write the text-instead of the spread of the spread model used by most photo generators (such as Dall-E) that creates the entire image simultaneously. Goh predicts that this artistic difference can be what gives Sora the possibilities of displaying text and linking better.

An example was created by artificial intelligence on the ability of Sura to generate the text. It shows 4 more popular cocktails, with ingredients for making them.

An example of Sura’s ability to generate a coherent text.

Openai

In briefing before launching the feature, the team showed many examples that show the capabilities of the system, including scientific plans such as Newton’s PRISM experience with components called properly, multi -panel comedy with consistent characters, text bodies, and media stickers with a careful text. They also highlighted practical applications such as creating transparent background pictures of stickers, restaurants and slogans.

“If I go to draw a picture, I do it with the restrictions of my own skill … but also with all the knowledge of the world that I have built,” Jackie Shannon explained at Chatgpt Multimdal. “The model brings global knowledge to the equation, so when you ask for a picture of the Newton’s PRISM experience, you do not have to explain what is to restore the image.”

The new system takes longer to create more photos than before, although Openai indicates that this is a worthy of attention. “Although we definitely have room to improve cumin … the quality of these images, ability, and knowledge of the world, really compensates for the additional seconds they will wait,” Shannon said.

AI's created image to experience PRISM's Newton on a notebook in Washington Square Park.

The Newton’s experience was presented on a notebook in Washington Square Park.

Openai

When asked about guarantees – refer to Poor naked from Taylor Swift Created using the Microsoft model, Grok’s ability from Xai to introduce Kamala Harris in a pistolAnd Google Gemini’s talent to remove watermarks Openai team emphasized that the system includes strong guarantees to prevent misuse. Shannon said the tool prohibits the removal of the watermark, prevents the deepest of sexual generation, and rejects the CSAM requests.

The new Openai’s image generation system does not include visual water marks or indicators that show the created images. However, Shannon explained that “all the pictures created will include Standard C2PA To celebrate the picture as created by Openai, the company will have some internal tools to be able to search for images as well. “

Shannon added: “In the end, there is no ideal system for this type of things, but we constantly improve our guarantees and think about this as a starting point.” “There is one right thing about all the pictures created from Chatgpt is that the user owns it and is free to use it within the limits of our use policies as they wish.”

Leave a ReplyCancel Reply