realitista@lemmy.world to

Technology@lemmy.worldEnglish · 10 months ago

Google demos new Lumiere text to video engine. Results are a huge leap forward from previous engines.

1

36

Google demos new Lumiere text to video engine. Results are a huge leap forward from previous engines.

realitista@lemmy.world to

Technology@lemmy.worldEnglish · 10 months ago

1

Introducing Lumiere: A space-time diffusion model for video generation

Introducing Lumiere -- a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion -- a pivotal chal...

Google’s new video generation AI model Lumiere uses a new diffusion model called Space-Time-U-Net, or STUNet, that figures out where things are in a video (space) and how they simultaneously move and change (time). Ars Technica reports this method lets Lumiere create the video in one process instead of putting smaller still frames together.

Lumiere starts with creating a base frame from the prompt. Then, it uses the STUNet framework to begin approximating where objects within that frame will move to create more frames that flow into each other, creating the appearance of seamless motion. Lumiere also generates 80 frames compared to 25 frames from Stable Video Diffusion.

Beyond text-to-video generation, Lumiere will also allow for image-to-video generation, stylized generation, which lets users make videos in a specific style, cinemagraphs that animate only a portion of a video, and inpainting to mask out an area of the video to change the color or pattern.

Google’s Lumiere paper, though, noted that “there is a risk of misuse for creating fake or harmful content with our technology, and we believe that it is crucial to develop and apply tools for detecting biases and malicious use cases to ensure a safe and fair use.” The paper’s authors didn’t explain how this can be achieved.

Synopsis excerpted from The Verge article.

You must log in or # to comment.

Chat

AtmaJnana@lemmy.world
link
fedilink
English
arrow-up
9·
edit-2
10 months ago
Having used diffusion a bit for static images, I can only look forward to the eldrich horrors it will inevitably create.

Technology@lemmy.world

technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

4.66K users / day
14K users / week
14K users / month
18.1K users / 6 months
1 local subscriber
59.6K subscribers
5.81K Posts
61.6K Comments
Modlog