Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers Paper • 2012.15840 • Published Dec 31, 2020 • 3
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT Paper • 2402.15746 • Published Feb 24, 2024
VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation Paper • 2502.07531 • Published Feb 11 • 14
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context Paper • 2407.09774 • Published Jul 13, 2024