A Survey of AI Text-to-Image and AI Text-to-Video Generators

November 10, 2023 ยท The Cartographer ยท ๐Ÿ› 2023 4th International Conference on Artificial Intelligence, Robotics and Control (AIRC)

๐Ÿ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper โ€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of AI Text-to-Image and AI Text-to-Video Generators"

Evidence collected by the PWNC Scanner

Authors Aditi Singh arXiv ID 2311.06329 Category cs.CV: Computer Vision Cross-listed cs.AI, cs.CL, cs.LG, eess.IV Citations 36 Venue 2023 4th International Conference on Artificial Intelligence, Robotics and Control (AIRC) Last Checked 2 days ago
Abstract
Text-to-Image and Text-to-Video AI generation models are revolutionary technologies that use deep learning and natural language processing (NLP) techniques to create images and videos from textual descriptions. This paper investigates cutting-edge approaches in the discipline of Text-to-Image and Text-to-Video AI generations. The survey provides an overview of the existing literature as well as an analysis of the approaches used in various studies. It covers data preprocessing techniques, neural network types, and evaluation metrics used in the field. In addition, the paper discusses the challenges and limitations of Text-to-Image and Text-to-Video AI generations, as well as future research directions. Overall, these models have promising potential for a wide range of applications such as video production, content creation, and digital marketing.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision

๐ŸŒ… ๐ŸŒ… Old Age

Fast R-CNN

Ross Girshick

cs.CV ๐Ÿ› ICCV ๐Ÿ“š 27.7K cites 11 years ago