Ask HN: To what extent is a lack of copyrighted (i.e. protected or non-public) creative content (i.e. novels, screenplays and movies) in training data limiting the potential sophistication of AI-based computer generated content?
Example:
GPT-3 may be able to superficially impersonate Tyrion Lannister or describe the world in which he inhabits, but because OpenAI can’t (lawfully) use George R.R. Martin’s novels as training data, it will never be able to generate a convincing persona/interactive experience with Tyrion.