Hello Hacker News!
I’ve been having some fun recently and I wanted to share with you. My idea was to feed ChatGPT an image and ask for a caption, feed that back to DallE, then call this in a loop, and observe how the image that was generated changed over time. Similar to playing the game “telephone”. The results were really intriguing, so I built something that you can play with.
Note this is best on a desktop (streamlit is optimized for a large screen), but if you’re on mobile you’ll want to expand the sidebar to start — it’s the carrot on top.
My initial intention was for you to be able to submit seed images on the site and watch them progress, but that was too expensive and slow. So instead I built an explorer. While this also illustrates a framework I’m developing that helps structure code, it should be interesting regardless of whether or not you’re looking for new tools.
I’m happy to add more images — feel free to suggest starting ones you’d like to see!
Is this because GPT4 is out of a highly stylized space? I really haven't played with it. My objections include: water droplets on the mug, bad framing, B- latte art, mug looks basic (and the handle has geometry that makes you question geometry), table surface looks sus, unidentified brown splotches on the table, what's going on with the chairs, what's up with the floor, what's the small bright vertical plane that descends from halfway in the plane about, and so on. I don't know if the caption is random, but here is the caption I'm responding to: