Meta AI (Facebook) created a text-to-video model by taking a diffusion text-to-image model, adding temporal convolutional and attention layers, and fine-tuning it with video data (without text). They also use spatial and temporal super-resolution networks. Showing, to the surprise of no one who was paying attention, that our existing mostly...
It's an open-source text-to-image model capable of producing NSFW content. I, for one, I'm very excited to see the consequences this has on society, which I expect to be mostly positive (except if it leads to an increase in AI funding). Any thoughts? Other relevant links: https://www.reddit.com/r/StableDiffusion/ https://www.reddit.com/r/StableDiffusion/comments/wqaizj/list_of_stable_diffusion_systems/ https://news.ycombinator.com/item?id=32555028 https://www.lesswrong.com/posts/DhDAXQw4PsWXnmwPS/ai-art-isn-t-about-to-shake-things-up-it-s-already-here
This has been discussed several times in the past, see: * Have You Tried Hiring People?, a LW post * It talks about this ACX comment thread * Greg Coulbourn’s “Mega-money for mega-smart people to solve AGI Alignment” * Google Docs document * LW comment about the document * Short...