Text-Based Adventure Game in Java Visual Output

Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking

Abstract: Visual-Language Tracking (VLT) is emerging as a promising paradigm to bridge the human-machine performance gap. For single objects, VLT broadens the problem scope to text-driven video ...

IEEE

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Abstract: Diffusion models have shown excellent performance in text-to-image generation. Nevertheless, existing methods often suffer from performance bottlenecks when handling complex prompts that ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Mono3DVLT: Monocular-Video-Based 3D Visual Language Tracking

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Trending now