尝试OpenAI Sora
从文本/图像创建视频，生成循环视频，向前和向后延长视频
Sora 上线时第一个知道！

关于OpenAI Sora

什么是Sora

OpenAI的文本到视频模型。Sora可以生成长达一分钟的视频，同时保持视觉质量并遵循用户的文本指令。

是帮助用户更好地利用数字货币和区块链技术。

Sora作为可以理解和模拟现实世界的模型的基础，帮助人们解决需要现实世界互动的问题。

进展

仅限红队成员和受邀的视觉艺术家、设计师和电影制作人。

特点

支持多个角色、特定动作类型、主题和背景细节的准确呈现；模型了解这些事物在现实世界中的存在方式，在单个视频中进行多次拍摄。

限制

在准确模拟复杂物理过程方面存在困难，空间细节混乱，物体和角色的突然出现，物理建模不准确和物体变形不自然。

安全

与红队合作进行对抗性测试，以识别和解决模型中的安全问题，构建工具来帮助检测使用检测分类器和C2PA元数据的误导内容。

展示 - 每日更新

Prompt

a brown and white border collie stands on a skateboard, wearing sunglasses

Prompt

1st person view taking the longest zip-line in the world through Dubai

Prompt

Style: Modern cinematic realism with vivid visual accents. A summer evening. A group of young friends is gathered on a rooftop, overlooking the glowing city lights. They’re laughing, chatting, and enjoying the vibe with soft music playing in the background. The camera slowly zooms in on a bottle of YOMI beer on the table. Cold condensation drips down the glass, highlighting the vibrant golden hue of the drink. The focus shifts to a hand reaching for the bottle. The camera follows the motion, capturing the crisp sound of the bottle cap popping open. A sip. A deep breath. A smile. In the background, a voice speaks: ‘YOMI — the taste of the moment. Capture your inspiration.’ Final scene: A bottle of YOMI stands against the backdrop of a setting sun, its golden light refracting through the beer. The brand logo and tagline appear on screen: ‘YOMI. The time of your story.

Prompt

The camera follows behind a white vintage SUV with a black roof rack as it speeds up a steep dirt road surrounded by pine trees on a steep mountain slope, dust kicks up from its tires, the sunlight shines on the SUV as it speeds along the dirt road, casting a warm glow over the scene

Prompt

POV, ACTION SHOTS, JUMPCUTS, Montage,, tracking shot, from the side hyperspeed, 30x speed, cinematic atmosphere, person having a futuristic neon beachpunk in punkexosuit form around them, suiting up, glow and light, Phanto-Cinematic still, beachpunk gigadream, kodak etkar 100, hypersurrealist retrowave religiouscience fiction, Southern California, emocore, hyperfuturistic, beachpunk ISO: T2.8, compression: ARRIRAW, lighting_conditions: ultraviolet blacklight, backlit,

Prompt

Close-up shot of a freeride skier carving through deep, untouched powder snow during a vibrant sunset in the Alps. The camera starts low, tracking alongside the skier as they make a powerful turn, sending a spray of fine snow into the air. The spray catches the warm golden-pink light of the setting sun, creating a stunning glow and sparkling reflections. The camera then pans upward and slightly rotates, revealing the majestic alpine peaks bathed in the sunset’s hues. The skier continues gracefully downhill, leaving a glowing trail of light and snow in their wake as the scene fades into the serene mountain landscape.

Prompt

An elegant scene set in Egypt featuring a female anthropomorphic fox character. She has vibrant red-orange fur and vivid green eyes, posing gracefully near ancient Egyptian ruins with the iconic pyramids in the background. She is wearing a flowing, semi-transparent, culturally inspired robe with golden patterns. The setting includes sandy terrain, scattered palm trees, and hints of ancient stone structures adorned with hieroglyphics. The sky is clear, and the sun casts a warm glow over the scene, emphasizing the mystique of the Egyptian desert landscape.

Prompt

A stylish woman walks down a Seoul street filled with warm glowing neon and animated city signage. She wears a black leather jacket, a long red dress, and black boots, and carries a black purse. She wears sunglasses and red lipstick. She walks confidently and casually. The street is damp and reflective, creating a mirror effect of the colorful lights. Many pedestrians walk about.

Prompt

Company	Generation Type	Max Length	Extend?	Camera Controls? (zoom, pan)	Motion Control? (amount)	Other Features	Format
Runway	Text-to-video, image-to-video, video-to-video	4 sec	Yes	Yes	Yes	Motion brush, upscale	Website
Pika	Text-to-video, image-to-video	3 sec	Yes	Yes	Yes	Modify region, expand canvas, upscale	Website
Genmo	Text-to-video, image-to-video	6 sec	No	Yes	Yes	FX presets	Website
Kaiber	Text-to-video, image-to-video, video-to-video	16 sec	No	No	No	Sync to music	Website
Stability	Image-to-video	4 sec	No	No	Yes		WebsiteLocal model, SDK
Zeroscope	Text-to-video	3 sec	No	No	No		Local model
ModelScope	Text-to-video	3 sec	No	No	No		Local model
Animate Diff	Text-to-video, image-to-video, video-to-video	3 sec	No	No	No		Local model
Morph	Text-to-video	3 sec	No	No	No		Discord bot
Hotshot	Text-to-video	2 sec	No	No	No		Website
Moonvalley	Text-to-video, image-to-video	3 sec	No	Yes	No		Discord bot
Deforum	Text-to-video	14 sec	No	Yes	No	FX presets	Discord bot
Leonardo	Image-to-video	4 sec	No	No	Yes		Website
Assistive	Text-to-video, Image-to-video	4 sec	No	No	Yes		Website
Neural Frames	Text-to-video, image-to-video, video-to-video	Unlimited	No	No	No	Sync to music	Website
MagicHour	Text-to-video, image-to-video, video-to-video	Unlimited	No	No	No	Face swap, sync to music	Website
Vispunk	Text-to-video	3 sec	No	Yes	No		Website
Decohere	Text-to-video, Image-to-video	4 sec	No	No	Yes		Website
Domo Al	Image-to-video, video-to-video	3 sec	No	No	Yes		Discord bot

博客

人们在x上谈论Sora

SoraAI by OpenAI is wild.

These are 100% generated only from text and take just 1 minute 🤯

10 wild examples ( 2nd is WOW ) pic.twitter.com/NLetbJVa2v
— Alamin (@iam_chonchol) February 18, 2024

If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all… pic.twitter.com/pRuiXhUqYR
— Jim Fan (@DrJimFan) February 15, 2024

"this close-up shot of a futuristic cybernetic german shepherd showcases its striking brown and black fur..."

Video generated by Sora. pic.twitter.com/Bopbl0yv0Y
— Bill Peebles (@billpeeb) February 18, 2024

Sora and Stable Video, text to video compare. pic.twitter.com/pZzSeSXPtN
— Retropunk (@RetropunkAI) February 17, 2024

OpenAI's Sora is the most advanced text-to-video tool yet. 💡

It can generate compellingly realistic characters, create multiple dynamic shots in a single video, with accurate details of both subjects and background.

Here's the 10 best generations so far
🧵👇 pic.twitter.com/FHp0cxt0Ll
— Escher (@Escher_AI) February 16, 2024

OpenAI's Sora is going to change marketing forever, enabling anyone to unleash his inner creativity.

Check this 100% AI-generated video of Mammoth generated with the new "text-to-video" OpenAI model: pic.twitter.com/DcDGPjpBXC
— William Briot (@WilliamBriot) February 15, 2024

"a photorealistic video of a butterfly that can swim navigating underwater through a beautiful coral reef"

Video generated by Sora pic.twitter.com/nebCKLa09U
— Tim Brooks (@_tim_brooks) February 17, 2024

Another Sora video, Sora can generate multiple videos side-by-side simultaneously.

This is a single video sample from Sora. It is not stitched together; Sora decided it wanted to have five different viewpoints all at once! pic.twitter.com/q2rfxh61CQ
— 🅱️WhiteAfricanSpaceJesus (@zespacejesus) February 18, 2024

Sora can also generate stories involving a sequence of events, although it's far from perfect.

For this video, I asked that a golden retriever and samoyed should walk through NYC, then a taxi should stop to let the dogs pass a crosswalk, then they should walk past a pretzel and… pic.twitter.com/OhqVFqR5vA
— Bill Peebles (@billpeeb) February 17, 2024

https://t.co/uCuhUPv51N pic.twitter.com/nej4TIwgaP
— Sam Altman (@sama) February 15, 2024

https://t.co/P26vJHlw06 pic.twitter.com/AW9TfYBu3b
— Sam Altman (@sama) February 15, 2024

https://t.co/rPqToLo6J3 pic.twitter.com/nPPH2bP6IZ
— Sam Altman (@sama) February 15, 2024

https://t.co/WJQCMEH9QG pic.twitter.com/Qa51e18Vph
— Sam Altman (@sama) February 15, 2024

a wizard wearing a pointed hat and a blue robe with white stars casting a spell that shoots lightning from his hand and holding an old tome in his other hand
— biden or buster (@willofdoug) February 15, 2024

常见问题解答

Sora是由OpenAI开发的AI模型，可以根据文本指令创建逼真且富有想象力的视频场景。它旨在模拟运动中的物理世界，生成长达一分钟的视频，同时保持视觉质量并遵循用户的提示。
Sora是一个扩散模型，它从类似静态噪音的视频开始，并逐步通过多个步骤去除噪音来转换它。它使用了类似于GPT模型的变压器架构，并将视频和图像表示为称为补丁的较小数据单元的集合。
Sora可以生成各种视频，包括具有多个角色的复杂场景、特定类型的动作以及主题和背景的精确细节。它还可以将现有静止图像动画化，或通过填补缺失的帧来延长现有视频。
Sora可能会在准确模拟复杂场景的物理、理解特定的因果关系实例以及在时间上保持空间细节方面遇到困难。有时会产生物理上不合理的运动或混淆空间细节。
OpenAI正在与红队合作对模型进行对抗性测试，并正在构建工具来检测误导性内容。他们计划在未来将C2PA元数据纳入其中，并利用其其他产品中现有的安全方法，如文本分类器和图像分类器。
Sora目前可供红队人员评估危害或风险的关键领域，并为视觉艺术家、设计师和电影制作人提供反馈，以推动创意专业人士的模型发展。
如果您是一名创意专业人士，您可以通过OpenAI申请访问Sora。一旦获得访问权限，您可以使用该模型根据您的文本提示生成视频，为您的创意项目增添独特而富有想象力的场景。
Sora作为能够理解和模拟现实世界的模型的基础，OpenAI认为这是实现人工通用智能（AGI）的重要里程碑。
Sora对语言有着深刻的理解，能够准确解释文本提示，并生成生动的角色和场景，表达丰富的情感。它可以在单个视频中创建多个镜头，同时保持一致的角色和视觉风格。
Sora使用了类似于GPT模型的变压器架构，并将视频和图像表示为称为补丁的较小数据单元的集合。这种数据表示的统一使得模型可以在更广泛范围的视觉数据上进行训练。
通过一次性给模型多帧的预见，Sora可以确保主体即使暂时离开视野，也能保持一致。
Sora使用了来自DALL·E 3的重新字幕技术，这涉及为视觉训练数据生成高度描述性的字幕。这有助于模型更忠实地遵循用户的文本指令在生成的视频中。
OpenAI计划在将Sora整合到其产品之前采取几项安全措施，包括对抗性测试、开发检测分类器，并利用来自其他产品（如DALL·E 3）的现有安全方法。
Sora可以被电影制作人、动画师、游戏开发者和其他创意专业人士使用，以快速高效地生成视频内容、分镜头，甚至用于快速有效地原型设计想法。
OpenAI正在积极与政策制定者、教育工作者和艺术家合作，以了解关注点并确定技术的积极应用案例。他们承认虽然他们无法预测所有有益的用途或滥用，但从现实世界的使用中学习对于随着时间推移创建更安全的人工智能系统至关重要。
OpenAI拥有文本分类器，用于检查和拒绝违反使用政策的文本输入提示，例如请求极端暴力、性内容、仇恨图像或未经授权使用知识产权的内容。
在AI中，“世界模型”指的是一个计算模型，模拟物理世界及其动态，使AI能够理解和预测其中的物体和实体如何相互作用。在Sora的背景下，这意味着该模型已经经过训练，能够生成视频，不仅遵循文本提示，还遵守真实世界的物理定律和行为，如重力、运动和物体相互作用。这种能力对于从文本描述中创建逼真和连贯的视频内容至关重要。

尝试OpenAI Sora
从文本/图像创建视频，生成循环视频，向前和向后延长视频
Sora 上线时第一个知道！

关于OpenAI Sora

什么是Sora

是帮助用户更好地利用数字货币和区块链技术。

进展

特点

限制

安全

展示 - 每日更新

Other AI video products

博客

OpenAI Sora泄露：艺术家批评剥削和不合理的补偿

AI改变了视频制作的永恒命运

OpenAI发布文本转视频工具Sora

你能分辨出什么是真实的吗？- 人工智能生成的视频

人们在x上谈论Sora

常见问题解答

尝试OpenAI Sora从文本/图像创建视频，生成循环视频，向前和向后延长视频Sora 上线时第一个知道！

关于OpenAI Sora

什么是Sora

是帮助用户更好地利用数字货币和区块链技术。

进展

特点

限制

安全

展示 - 每日更新

Other AI video products

博客

OpenAI Sora泄露：艺术家批评剥削和不合理的补偿

AI改变了视频制作的永恒命运

OpenAI发布文本转视频工具Sora

你能分辨出什么是真实的吗？- 人工智能生成的视频

人们在x上谈论Sora

常见问题解答

Sora是什么？

Sora如何工作？

Sora能生成哪些类型的视频？

Sora的一些限制是什么？

OpenAI如何确保Sora内容的安全性？

谁可以访问Sora?

如何在我的创意项目中使用Sora？

Sora在研究方面的未来是什么

Sora如何处理文本提示？

Sora的架构技术细节是什么？

Sora如何确保生成视频中主题的一致性？

在Sora的训练中，重述技术扮演着什么角色？

OpenAI计划如何将Sora整合到其产品中？

Sora在创意产业中的潜在应用有哪些？

使用Sora时的道德考虑是什么？

Sora如何处理潜在风险内容的生成？

在AI和Sora的背景下，&#39;世界模型&#39;是什么？

尝试OpenAI Sora
从文本/图像创建视频，生成循环视频，向前和向后延长视频
Sora 上线时第一个知道！