OpenAI's newest model Sora can generate videos -- and they look decent

Screenshot_2024-02-15_at_2.45.48-PM-transformed

OpenAI, following in the footsteps of startups like Runway and tech giants like Google and Meta, is getting into video generation.

OpenAI today unveiled Sora, a GenAI model that creates video from text. Given a brief — or detailed — description or a still image, Sora can generate 1080p movie-like scenes with multiple characters, different types of motion and background details, OpenAI claims.

Sora can also “extend” existing video clips — doing its best to fill in the missing details.

“Sora has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions,” OpenAI writes in a blog post. “The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.”

Now, there’s a lot of bombast in OpenAI’s demo page for Sora — the above statement being an example. But the cherry-picked samples from the model do look rather impressive, at least compared to the other text-to-video technologies we’ve seen.

For starters, Sora can generate videos in a range of styles (e.g. photorealistic, animated, black and white, etc.) up to a minute — far longer than most text-to-video models. And these videos maintain reasonable coherence in the sense that they don’t always succumb to what I like to call “AI weirdness,” like objects moving in physically impossible directions.

Check out this tour of an art gallery, all generated by Sora (ignore the graininess — compression from my video-GIF conversion tool):

Image Credits: OpenAI

Or this animation of a flower blooming:

Image Credits: OpenAI

I will say that some of Sora’s videos with a humanoid subject — a robot standing against a cityscape, for example, or a person walking down a snowy path — have a video game-y quality to them, perhaps because there’s not a lot going on in the background. AI weirdness manages to creep into many clips besides, like cars driving in one direction then suddenly reversing or arms melting into a duvet cover.

Image Credits: OpenAI

OpenAI — for all its superlatives — acknowledges the model isn’t perfect. It writes:

“[Sora] may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark. The model may also confuse spatial details of a prompt, for example, mixing up left and right, and may struggle with precise descriptions of events that take place over time, like following a specific camera trajectory.”

OpenAI’s very much positioning Sora as a research preview, revealing little about what data was used to train the model (short of ~10,000 hours of “high-quality” video) and refraining from making Sora generally available. Its rationale is the potential for abuse; OpenAI correctly points out that bad actors could misuse a model like Sora in myriad ways.

OpenAI says it’s working with experts to probe the model for exploits and building tools to detect whether a video was generated by Sora. The company also says that, should it choose to build the model into a public-facing product, it’ll ensure that provenance metadata is included in the generated outputs.

“We’ll be engaging policymakers, educators and artists around the world to understand their concerns and to identify positive use cases for this new technology,” OpenAI writes. “Despite extensive research and testing, we cannot predict all of the beneficial ways people will use our technology, nor all the ways people will abuse it. That’s why we believe that learning from real-world use is a critical component of creating and releasing increasingly safe AI systems over time.”

Source link

Previous News

Crypto VC exits were low in Q4 2023, Phantom MAU’s reach new highs and spot bitcoin ETF volumes are still rising

Next News

OpenAI’s newest model Sora can generate videos — and they look decent

Disclaimer

Popular

Chrome is testing a feature that will speed up your daily browsing

No, AMD Is Not Buying Intel

There’s One Super Mario Bros. Movie You Can’t Stream Anywhere. Here’s Why and How to Watch It

Apple debuts 50th anniversary exhibit at Apple Park with iconic products and photography

Samsung Frame Pro and OLED TV News: What You Need To Know in 2026

More Like this

Anthropic Says That Claude Contains Its Own Kind of Emotions

Change This One Setting to Improve Your TV’s Picture

Anthropic: You Can’t Use OpenClaw With Claude Without Paying Extra

AMD’s upcoming Ryzen 9 9950X3D2 listed around $1,000 at several retailers across Canada and the UK — New flagship dual-cache CPU might demand a...

Apple Brings Device-Level Age Verification to Two More Countries

Is this really the ‘endgame handheld’?

OpenAI’s newest model Sora can generate videos — and they look decent

Disclaimer

More like this

Anthropic Says That Claude Contains Its Own Kind of...

Change This One Setting to Improve Your TV’s Picture

Anthropic: You Can’t Use OpenClaw With Claude Without Paying...

Popular

Block title

A16z-backed Yupp AI shuts down operations; here is why

John Perry Barlow, JFK Jr., and a Night of Grief I Can’t Forget

DoT SIM Binding Mandate Pushed till the End of 2026

Hands-on with Modos Tech 13.3-inch e-paper monitors — we tried the current Dev Kit...

AI needs ‘better marketing’, says Sam Altman after buying chat show TBPN

Meta begins testing premium Instagram Plus subscription with exclusive Story features: here’s what you...

Memory will consume 30% of hyperscaler data center spending this year, a 4X increase...

Startup Events

Trending News

Anthropic Says That Claude Contains Its Own Kind of Emotions

Change This One Setting to Improve Your TV’s Picture

Anthropic: You Can’t Use OpenClaw With Claude Without Paying Extra

AMD’s upcoming Ryzen 9 9950X3D2 listed around $1,000 at several retailers across Canada and the UK — New flagship dual-cache CPU might demand a...

Apple Brings Device-Level Age Verification to Two More Countries

About

Partnership

Contact us