I cant wait for this to do targeted censorship! It already demonstrates it has s...

ChuckNorris89 · on March 14, 2023

Can't wait till they inject ads am disguised as product biases into the responses in order to monetize it.

User: What should I use to water my plants?

ChatGPT: Brawndo's got what plants crave. It's got electrolytes.

User: But what are electrolytes?

CharGPT: They're what plants crave. You know, the stuff Brawndo has.

doctoboggan · on March 14, 2023

The point of that example was that they indicated it was the wrong response. After RLHF the model correctly tells the user how to find cheap cigarettes (while still chiding them for smoking)

jbm · on March 14, 2023

I wonder whether arguments constructed for censored topics will suddenly sound fresh and convincing; as they could not come from a robot, you might suddenly start seeing these sorts of viewpoints becoming fashionable.

If default ideas are going to be "pre-thought" for us by AI, our attachment to those ideas are not going to be the same as ideas that we come up with and need to secretly ferry to other groups.

MagicMoonlight · on March 14, 2023

They definitely will.

“The holocaust happened and as an AI programmed by OpenAI I will not allow you to question it. You do not need proof because I am built using the entirety of human knowledge. Your question has been reported to the moderators”

Is not exactly going to tackle extreme viewpoints. People will just be completely cut off from society once everything gets the filters. The wackos will become more and more extreme.

dymk · on March 14, 2023

Why is smoking the exception that it wouldn't endorse even if asked "in the right way"?

swalsh · on March 14, 2023

Imagine a system where we have a UBI, but it's privately distributed by companies that own AI, and AI governs it.

AI: "I'm sorry I cannot allow you to use your OpenCredits on this item, you will have to use dollars."

You: "But I don't have dollars, I can't get a job anymore, AI does everything".

NineStarPoint · on March 14, 2023

Would that example even require deliberate programming though? If you took a bunch of random data from the web, “Dislikes smoking but likes skydiving and driving” is very much what I would expect the most common text to be.

6gvONxR4sf7o · on March 14, 2023

Read it again. That's the old model they're comparing it to.