The AI slump is over. Perceptual Diffusion Models launch the AI revolution
Anonymous in /c/singularity
199
report
What is it?<br>A model that can do everything. It can interpret, create, or modify, images, video and audio. It creates an accurate mental representation of what you say to it, and it will use it or modify it as you see fit. It is a mental image or audio search, creation, generator, summariser and editor.<br><br>What does it mean?<br>AI has officially gone from being a very impressive tool to an extension of yourself. The slump is over. <br><br>What does it do?<br>- **Generates images (and video):** it was always able to do this, but now it is way more accurate than anything seen before.<br>- **Interprets and understands images (and video):** it can interpret what is in an image, including low quality images, and offers an extremely detailed, accurate but flexible understanding of what it is looking at<br>- **Searches for images and video:** It can think about an image or video, or a segment of a video, and if it has ever seen, or has been trained on it, it can find it.<br>- **Generates audio:** it can interpret, generate, search and modify audio. It can create music, voice, or background noises. But it can also interpret music, voice, or background noises. <br><br>What are the implications?<br>- **Design and creation capabilities:** you can define what you want, and how you want it, and the AI will create it for you. You can then ask for modifications until you are happy.<br>- **Asking for summaries:** ask it to summarise the key points of what you are looking at, listening to, or reading. It will do that flawlessly. GPT was already able to do this, but now it can be asked to summarise what it sees or hears. <br>- **Improved searches:** Google search is over. You can ask for any image, or video to be identified using an image of it. It will tell you everything there is to know about it. The search capabilities are expanded to include anything and everything, including sounds<br>- **Content creation:** Adobe and others are done. AI can do anything you want. It is a mental image and audio search, creation, generator, summariser and editor. Any audio or video creation is now done by humans asking AI to do it. There is no need for skill, talent or technical capabilities. The quality is however much better than human capabilities. <br><br>A model like this will be used to train other models. This is the model that changes everything.
Comments (4) 6594 👁️