Can You Generate Data with GPT-3? We Explore Fake Dating with Fake Data (opens in new tab)

(tonic.ai)

1 pointsmadelyn-goodman3y ago3 comments

3 comments

ChatGPT has taken the tech world by storm, but its older cousin GPT-3 is still relevant. Being able to connect to the text completion API through python allows you to use the large language model to generate synthetic data with bespoke distributions.

The application is limited, however, as the lack of on-prem deployment limits your ability to show the model your proprietary data to learn from.

PaulHoule3y ago

I'm pretty concerned that ChatGPT would do a better job than most humans at the "chat" phase of online dating. There is this way that ChatGPT seems to hypnotize people such that they don't perceive the mistakes that it makes which makes me think it could be really dangerous for people running romance scams and the like.

As for your application w/ GPT-3 I'd be a little bit afraid that it might be replaying real data records that it saw some of the time rather than generating truly fake fake data.

(Though one of the most interesting ChatGPT blog posts I saw was one where they had gotten it to write an essay that had what looked like real citations to scientific papers that were completely false.)

madelyn-goodmanOP3y ago

Very good point about the implications of large language models for the actual act of dating! As if there weren't already enough reasons to be suspicious of people ("people") you are interacting with on those platforms...

You also make a good point about replicating real data - I think this is another area where more tools need to be developed to safe guard against consequences. In addition to the obvious challenges to plagiarism softwares LLMs pose, there are definitely opportunities here to develop privacy detecting softwares should an org want to use GPT-3 or ChatGPT for synthetic data.

Recently I was reading a book by Peter Diamandis and Steven Kotler about converging technologies that referenced the Luddites' reaction to the invention of the loom, fearing it would lead to the destruction of jobs, when ultimately the innovation lead to more economic opportunity. I think we'll find that the fears people are having in this regard about LLMs are similar with opportunities like these to develop more digital infrastructure around their application.

j / k navigate · click thread line to collapse

3 comments

madelyn-goodmanOP3y ago

The application is limited, however, as the lack of on-prem deployment limits your ability to show the model your proprietary data to learn from.

PaulHoule3y ago

As for your application w/ GPT-3 I'd be a little bit afraid that it might be replaying real data records that it saw some of the time rather than generating truly fake fake data.

madelyn-goodmanOP3y ago

j / k navigate · click thread line to collapse