The thing is, the “good” models can’t reconstruct the image in detail. It’s considered a sign of “overfitting” if you reconstruct the input exactly. Even if you put the exact query that was associated with that image, you’ll get the weighted average (feature-wise) image associated with the query. This applies to all like machine learning models without loss of generality.
Sure, but I could write a program to spew out an unbounded number of images containing random pixels. It could create an image that is identical to a copyrighted image, but if I just keep that image on my hard drive, have I violated copyright? I don't think I would be, but if I started distributing them, yes I would.