Tuesday, December 6, 2022
Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) https://ift.tt/RrqlhMm
Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) Today I designed a method to prevent users from jailbreaking ChatGPT (for instance, users have generated instructions to produce weapons or illegal drugs, commit a burglary, kill oneself, take over the world as an evil superintelligence, or create a virtual machine which they then can use). The OpenAI team appears to be countering these primarily using prompt engineering or fine-tuning on the ChatGPT model. The idea is to use a second and fully separate, fine-tuned LLM to evaluate prompts before sending them to ChatGPT. You can test this by inserting your successful ChatGPT jailbreaks. Break it for me if you dare! I look forward to seeing your results! https://ift.tt/ZPGMzqa December 6, 2022 at 11:16PM
Subscribe to:
Post Comments (Atom)
Show HN: Comparator - I built a free, open-source app to compare job offers https://ift.tt/aKhbr7x
Show HN: Comparator - I built a free, open-source app to compare job offers https://ift.tt/mRl06AE June 24, 2025 at 05:30AM
-
Show HN: High school robotics code/CAD/design binder release Hello HN! My name is Patrick, and I am a junior at my High School’s FRC robotic...
-
Show HN: D&D meets Siri – Interactive voice adventure Hey HN! I've been building tooling for voice-driven apps over the past few mon...
-
Show HN: I Made an AI Social Media Manager to Automate Content Creation Hey HN, I am a Solopreneur, and I love building apps to automate bor...
No comments:
Post a Comment