The crux of the issue is whether Stable Diffusion is 'learning' like a human would be if they just scrolled through Getty Images or someone's DeviantArt to learn to draw and copy their style (a style is not copyrightable).
This is why Getty Images' claim is that Stable Diffusion is a "collage tool", while SD will most likely argue on the basis of the AI being capable of exhibiting the same level of creativity as a human.
This seems totally irrelevant to the case in my view, which is whether Stability Ai had the right to use Getty’s content for training in the manner that they allegedly did.
In general, consuming content cannot be a violation of copyright or a license agreement; What is controlled, however, is stuff like reproduction. So if you do see a Getty Images picture, redraw it almost verbatim (even without tracing), you've effectively reproduced it, but the act of viewing it isn't illegal. In this vein, "exhibiting the same level of creativity as a human" is important - if it is, you can argue that the AI is just creating new stuff and the training set was simply "viewed", so anything it creates isn't necessarily infringement of any one of Getty's photos.
That’s really helpful clarification, thanks :) - and helps explain why the claim doc includes a focus on the stages in the process where the Getty content is allegedly copied.
With this in mind it will be even more interesting to see how this plays out.
It's best understood as an extremely sophisticated collage tool. We can make vague analogies about how training ML models is sort of like human learning, but ultimately it's not actually that.
It will still spit out stuff like watermarks and product photos unprompted, because its "learning" is still fundamentally mindless. It works strictly in the realm of pixels, it has no mechanism for understanding context.
It's really really not best understood as a sophisticated collage tool.
You can't take a person and turn them into a cartoon if you're just pasting existing parts of images together. Stable diffusion understands what is a cartoon and it understands what is you (assuming you train it on your face).
It spits out watermarks because it understands that watermarks appear in images and it's going to try to reproduce the watermarks if you ask for something that tends to have a watermark in it.
This is why Getty Images' claim is that Stable Diffusion is a "collage tool", while SD will most likely argue on the basis of the AI being capable of exhibiting the same level of creativity as a human.