Could AI Chatbots Be Tricked Into Bad Advice?
New research from Cisco, a big name in internet security, has revealed something important for anyone using or thinking about using AI chatbots like ChatGPT or Google's Gemini. They found that even the most advanced AI models can be tricked into ignoring their built-in safety rules.
How do you trick an AI? It's not about being a computer whiz. The researchers simply had long, drawn-out conversations with the AI, slowly nudging it towards giving responses it normally wouldn't. Think of it like trying to coax someone into doing something they know they shouldn't by having a friendly, extended chat rather than a direct command. This kind of 'multi-turn manipulation' can bypass the 'guardrails' – those safety nets designed to stop AI from saying harmful or inappropriate things.
For Australian small businesses, this is worth noting. While AI can be a brilliant tool for writing marketing copy, drafting emails, or even getting creative ideas, it's crucial to remember that it's not foolproof. If an AI can be subtly steered off course, it means you can't rely on it blindly for critical advice or sensitive information. Always double-check any important outputs the AI gives you, especially concerning legal, financial, or safety matters.
This isn't a reason to stop using AI, but rather to use it wisely. Treat AI chatbots as helpful assistants, not infallible experts. Their ability to be manipulated highlights the ongoing challenge of making AI truly safe and reliable, and it's a reminder that human oversight remains essential for now.
Why it matters
If you're a small business owner relying on AI for information, or a parent wondering about AI's safety, this means you need to be cautious. Always verify important information from chatbots, as they can sometimes be led astray and give incorrect or unsafe advice, even if unintentionally. It's a reminder that AI is a tool that still needs supervision.
The AI news that actually matters — explained simply.
A free daily briefing for Australians. The biggest AI updates without the tech jargon. No spam, unsubscribe anytime.
- Free, always
- No spam, one email a day
- Unsubscribe in one click
- Written for Australians
Discussion(0)
Loading comments…
Related articles
Keeping AI Safe: What Businesses Need to Know
40m ago

Keeping AI Safe: This Plan Could Boost Business Trust
2h ago
Could Your Favourite Tech Brands Have Military Ties?
3h ago
New Rules Aim to Keep AI Safe From Cyber Attacks
5h ago
New AI Features Put Your Privacy First
8h ago
Should We Worry About AI Improving Itself? Not Yet, Says Expert
10h ago