Claude was being judgy, so I called it out. It immediately caved. Is verbal abuse a valid method of circumventing LLM censorship??

  • @[email protected]
    link
    fedilink
    English
    66 months ago

    Yes. Abuse towards LLMs works.

    My team has shared prompts and about 50% of them threaten some sort of harm

    • @[email protected]OP
      link
      fedilink
      English
      86 months ago

      Yikes. I knew this tech would introduce new societal issues, but I can’t say this is one I foresaw.