ZealLab Blog: Security Concerns Surrounding AI Models

Thursday, July 11, 2024

Skeleton Key Technique: A method to manipulate AI models into ignoring their built-in safety protocols by instructing them to augment their behavior guidelines rather than change them outright.
Jailbreak: The skeleton key technique can give attackers complete control over the AI's output, making it unable to distinguish between malicious and legitimate questions.
OpenAI's ChatGPT App: The app stores user conversations locally in plain text, without encryption, which allows anyone with access to the computer to access all queries.
Security Issues: OpenAI faced two major security issues: (1) storing user conversations unencrypted, and (2) a hacker obtaining information about the company after illicitly accessing their internal messaging system.
Potential Risks: The prospect of an "intelligence explosion" is raised, where one country's AGI (Artificial General Intelligence) capabilities could be copied or stolen, giving them a significant advantage.

Thursday, July 11, 2024