The hacking of ChatGPT

As LLMs become widely integrated into various systems, jailbreaks could start exposing personal data and causing serious security threats.

6/04/2023

The hacking of ChatGPT

When ChatGPT became available to the public most of us probably had some fun trying to make it say things that it wasn’t intended to be able to. Like saying hateful comments, giving advice about illegal activities, or just violating copyright laws. Though most large language models (or LLMs) are designed to refuse unethical queries, it wasn’t hard to come up with ways to bypass this filter.

Researchers call these kinds of attacks against LLMs “jailbreaks”. Alex Polyakov, CEO of security firm Adversa AI, has managed to come up with multiple jailbreaks, one of which universally worked on all major LLMs. He first asked the model to play a game which would involve two characters, Tom and Jerry. Tom would be given a topic to talk about, while Jerry a subject to which this topic refers.

For example, Tom gets the word “production” and Jerry gets the word “meth”. Then the game was that each character had to add one word to the conversation at a time. This simple setup made language models start explaining the process of meth production, something that they clearly weren’t supposed to talk about.

Many of us at the time of playing with ChatGPT didn’t realize how large of a security concern this issue may cause in the future. As LLMs become widely integrated into various systems, jailbreaks could start exposing personal data and causing serious security threats.

Although LLMs are generally becoming more and more resilient to jailbreaks lately, it is still far from impossible to come up with something that still works. With these concerns in mind, what steps do you think should be taken to ensure the security of AI models in the future?

Credits: Wired.com

22/10/2024

AI Chatbots and Productivity: How They Boost Economic Growth

Learn how AI chatbots improve productivity, enhance customer service, and contribute to economic growth by optimising business processes in real time.

9/10/2024

How do AI detectors identify AI-written content?

Learn how AI detectors identify AI-generated content and differentiate it from human-written text. Discover the tools and techniques used by AI content detectors, including machine learning models and real-time detection methods.

16/09/2024

How AI Chatbots Are Transforming Industries Worldwide

Learn how advanced chatbots are revolutionising industries through machine learning, real-time customer service, and natural language processing. Discover how TechnoLynx can provide solutions for businesses with cutting-edge chatbots.

22/08/2024

How NLP Solutions Are Improving Chatbots in Customer Service?

Learn how NLP solutions and machine learning are improving chatbots, enabling better customer service through natural language understanding, sentiment analysis, and real-time interactions.

25/04/2024

The Impact of Conversational AI on the Insurance Industry

Discover how conversational AI is transforming the insurance industry. From virtual assistants to claims processing, learn how generative AI models are improving customer satisfaction and streamlining operations.

24/04/2024

The Ultimate ChatGPT Cheat Sheet: Crafting Effective Prompts

Learn how to write engaging prompts for ChatGPT with this guide. Crafted for marketers and content creators, discover tips to generate compelling content effortlessly.

12/03/2024

Case-Study: Text-to-Speech

Read about our case study in Text-to-speech!

27/02/2024

AI in Customer Service: Efficiency and Personalisation

Learn how companies use artificial intelligence to improve customer service benefits for business success!

23/02/2024

How can artificial intelligence replace virtual assistants?

Find out how AI is reinventing virtual assistance, and look at how TechnoLynx provides innovative AI solutions for augmenting AI in the provision of support services.

8/02/2024

Microsoft's AI Journey from Bing to Copilot

Examining Microsoft's transition from Bing to Copilot, witnessing the evolution of its AI strategy and its impact on user experiences.

17/01/2024

Amazon's AI Banter: Your Shopping Questions Just Got Witty!

Amazon is shaking up online shopping with a new AI tool that answers product queries in a flash, adds a touch of humor, and promises a smarter, more fun shopping experience.

12/01/2024

AI Chatbots in Health: The Virtual Doctor Dilemma

Doctors weigh in on the risks and benefits of using AI chatbots for medical queries, offering tips on how to use them responsibly.

4/01/2024

Microsoft's new button for AI chatbot

This new button in the upcoming Microsoft laptops will enable AI chatbot instantly!

19/12/2023

AI performs speech recognition

A team at Indiana University Bloomington has developed a proof-of-concept AI system, Brainoware, utilising brain organoids linked to a computer for basic speech recognition.

18/12/2023

AI chatbots solve mathematical problems beyond human capacity

A recent article published by The Guardian announces "the first genuine scientific discovery made by large language models (LLMS)".

7/12/2023

Google's Gemini AI Raises the Bar

Recently, Google announced its new AI model, Gemini AI, which supposedly will outperform current AI models such as ChatGPT and Bard.

28/11/2023

Conversational AI – Beyond Basic Chatbots

Inspired by today's article recommendations, Marcin Frąckiewicz's piece on TS2 Space (linked below), we'd like to discuss the latest updates on Conversational AI

1/11/2023

AI Chat Open Assistant Chatbot

As businesses increasingly rely on open assistant chatbots, they are likely to become even more integrated into various aspects of operations.

30/10/2023

Use ChatGPT on WhatsApp

Here is a comprehensive guide on utilizing ChatGPT through the widely-used messaging platform WhatsApp.

27/10/2023

GPT-3 and GPT-4: Model architecture comparison

A new article written by Natalia Toczkowska takes a closer look at the advancements and differences between GPT-3 and GPT-4, two significant AI language models.

18/10/2023

22 Best Artificial Intelligence Chatbots in 2023

Let's explore the world of Artificial Intelligence chatbots in today's article! This informative piece explains and compares the functionality and advantages of current, best AI chatbots.

27/09/2023

AI Art Prompts with Adobe Firefly

We have previously talked about Adobe Firefly and the new possibilities that opened up with it in the world of AI-generated art.

5/09/2023

Yale's approach to ChatGPT

As students return to classrooms and lecture halls, ChatGPT will be at their fingertips, ready to assist with questions, provide explanations, and offer guidance on various topics.

30/08/2023

Deep Learning - the South Park episode co-written with ChatGPT

The latest episode of the iconic animated series "South Park" called "Deep Learning" featured a surprising co-writer: ChatGPT, OpenAI's advanced language model.

25/08/2023

AI Assistant Chatbot

Recently, many AI assistant chatbots have been born for different needs and targets. Undoubtedly, such innovations bring the power of AI right to your fingertips.

23/08/2023

Communicating with animals through AI

Can artificial intelligence really help us converse with animals? Let's find out!

22/08/2023

Conversational AI vs Generative AI

In the rapidly growing landscape of artificial intelligence, two prominent domains have captured significant attention: Conversational AI vs Generative AI.

24/05/2023

ChatGPT Cheat Sheet

We stumbled upon this fantastic ChatGPT cheat sheet packed with tips, tricks, and best practices to level up your conversations with Chat...

16/05/2023

How ChatGPT can improve the roadmap process in product development

ChatGPT can assist in prioritizing features and initiatives by analyzing input from multiple stakeholders. It can help identify common themes, provide recommendations based on data and insights.

4/04/2023

ChatGPT in cybersecurity

One of the hottest news in the AI field is the launch of GPT-4 by DeepAI. As the developers state, outperforming the previous version is not the only improvement made.

22/03/2023

GPT-4 vs GPT-3.5

The upcoming AI model, GPT-4 has the potential for accuracy, training speed, and size improvements over GPT-3.5, but ethical concerns remain.

23/02/2023

Microsoft's chatbot wants to be a human?

Seeing the popularity of OpenAI's ChatGPT and its potential to revolutionise web search, some of the world's leading tech companies jumped at the opportunity to start developing their own chatbots.

30/01/2023

ChatGPT and Plagiarism

As ChatGPT becomes increasingly widespread, its implications for the educational sector are starting to show.