NeoBabel: A Multilingual Open Tower for Visual Generation
Select an example to see Input, Mask, and Output.
Input
Mask
Output
The model extends the image based on different left/right prompts.
Left Prompt
...
Right Prompt
...
One prompt, multiple languages.
Multilingual Prompt:
English Translation: