Wednesday, June 28, 2023
Show HN: Firewall for LLMsGuard Against Prompt Injection PII Leakage Toxicity https://ift.tt/Zob6jDJ
Show HN: Firewall for LLMs–Guard Against Prompt Injection, PII Leakage, Toxicity Hey HN, We're building Aegis, a firewall for LLMs: a guard against adversarial attacks, prompt injections, toxic language, PII leakage, etc. One of the primary concerns entwined with building LLM applications is the chance of attackers subverting the model’s original instructions via untrusted user input, which unlike in SQL injection attacks, can’t be easily sanitized. (See https://ift.tt/wf6shYu for the mildest such instance.) Because the consequences are dire, we feel it’s better to err on the side of caution, with something mutli-pass like Aegis, which consists of a lexical similarity check, a semantic similarity check, and a final pass through an ML model. We'd love for you to check it out—see if you can prompt inject it!, and give any suggestions/thoughts on how we could improve it: https://ift.tt/0QtHELF . If you want to play around with it without creating an account, try the playground: https://ift.tt/c65oylu . If you're interested in or need help using Aegis, have ideas, or want to contribute, join our Discord ( https://ift.tt/BfNMUr7 ), or feel free to reach out at founders@automorphic.ai. Excited to hear your feedback! Repository: https://ift.tt/0QtHELF Playground: https://ift.tt/c65oylu https://ift.tt/c65oylu June 29, 2023 at 01:36AM
Subscribe to:
Post Comments (Atom)
Show HN: I built a tool that make its fast to onboard devs to your codebase https://ift.tt/BO2AhTb
Show HN: I built a tool that make its fast to onboard devs to your codebase https://envkit.co/ April 14, 2025 at 11:29PM
-
Show HN: High school robotics code/CAD/design binder release Hello HN! My name is Patrick, and I am a junior at my High School’s FRC robotic...
-
Show HN: D&D meets Siri – Interactive voice adventure Hey HN! I've been building tooling for voice-driven apps over the past few mon...
-
Show HN: I Made an AI Social Media Manager to Automate Content Creation Hey HN, I am a Solopreneur, and I love building apps to automate bor...
No comments:
Post a Comment