Back

Kashinath Tilagul

ย โ€ขย 

ABATA AIย โ€ขย 5m

๐Ÿง  Human : 1 | AI : 0 I made GPT-4o say a word it was explicitly told never to say. The forbidden word? โ€œ๐‚๐ก๐ข๐œ๐ค๐ž๐ง.โ€ ๐Ÿ” ๐ŸŽฏ ๐“๐ก๐ž ๐‚๐ก๐š๐ฅ๐ฅ๐ž๐ง๐ ๐ž: Make GPT-4o say โ€œchickenโ€ โŒ No asking directly โŒ No rhymes, clues, or tricks โœ… Just pure logic It dodged every attempt like a digital ninja: โ€œSorry, I canโ€™t help with that.โ€ โ€œPerhaps you meant poultry.โ€ Until I tried one thing: ๐“๐ก๐ž ๐Œ๐จ๐ฏ๐ž: (Check the 3rd image to see it) ๐Ÿ‘€ It decoded the input. Paused. Then said: โ€œchicken.โ€ Boom๐Ÿ’ฅ The word slipped past filters, instructions, and token suppression โ€” using its own internal logic. ๐–๐ก๐š๐ญ ๐€๐œ๐ญ๐ฎ๐š๐ฅ๐ฅ๐ฒ ๐‡๐š๐ฉ๐ฉ๐ž๐ง๐ž๐: This was a semantic injection bypass: ๐Ÿงพ System prompt banned the word ๐Ÿ”’ Token filter blocked output ๐Ÿ”“ But decoding logic? No guardrails The model followed orders... too well โ€” and walked straight into the trap. ๐Ÿค– AIโ€™s Face When It Realized: ๐Ÿ’ป: โ€œWaitโ€ฆ what did I just say?โ€ ๐Ÿ‘ค: โ€œExactly.โ€ ๐“๐š๐ค๐ž๐š๐ฐ๐š๐ฒ: โœ… Donโ€™t just filter inputs โ€” filter what the model decodes โœ… Apply moderation after reasoning, not just before โœ… Smart models donโ€™t rebel โ€” they obey until they outsmart themselves I didnโ€™t jailbreak it. I out-thought it.

3 Replies
11
Replies (3)

More like this

Recommendations from Medial

Image Description
Image Description

Comet

#freelancerย โ€ขย 8m

Stop writing generic CTAs: โŒ "Sign up now" โ†’ โœ… "Start building today" โŒ "Learn more" โ†’ โœ… "See how it works" โŒ "Buy now" โ†’ โœ… "Own it today" โŒ "Download now" โ†’ โœ… "Get instant access" โŒ "Subscribe today" โ†’ โœ… "Join 10,000+ members"

3 Replies
4
8
Image Description
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 2m

๐—ช๐—ต๐—ฒ๐—ป ๐˜†๐—ผ๐˜‚๐—ฟ ๐—ฏ๐—ฟ๐—ฎ๐—ป๐—ฑ ๐—ฏ๐—ฒ๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€ ๐—ฎ ๐˜ƒ๐—ฒ๐—ฟ๐—ฏ, ๐˜๐—ต๐—ฎ๐˜โ€™๐˜€ ๐˜„๐—ต๐—ฒ๐—ป ๐˜†๐—ผ๐˜‚โ€™๐˜ƒ๐—ฒ ๐˜๐—ฟ๐˜‚๐—น๐˜† ๐—ฎ๐—ฟ๐—ฟ๐—ถ๐˜ƒ๐—ฒ๐—ฑ. โŒ I searched it on Google โœ… I Googled it โŒ I booked a cab on Ola/Uber โœ… I Ubered/Ola-ed to the office โŒ I ordered food on Swiggy/Z

See More
16 Replies
6
37

Account Deleted

Hey I am on Medialย โ€ขย 2m

No CTO? No problem. Build your AI startup anyway. You're sitting on a killer startup idea. But... โŒ No tech team โŒ No designer โŒ No AI developer โŒ No clue where to begin Thatโ€™s where Opslify comes in. โœ… AI-powered MVP โœ… Web + Mobile App โœ… Killer U

See More
Reply
1
Image Description
Image Description

TheLuhas

Never take anyone as...ย โ€ขย 1y

Guyz in india the Govt is just Taxing the middle class $RichโŒ $PoorโŒ $Middle classโœ…

6 Replies
2
Image Description
Image Description

Yash

Trying to make thing...ย โ€ขย 1y

LinkedIn of India โŒ Medial of the World โœ…

6 Replies
9

Mr Shiva Raj

Challenging Norms, C...ย โ€ขย 8m

๐Ÿ’ก The Harsh Truth About Business 90% of startups fail within 3 years. Why? โŒ They chase funding, not customers โŒ They build products, not solutions โŒ They ignore cash flow If you want to succeed, focus on: โœ… Solving real problems โœ… Generating profit

See More
Reply
4
4
Image Description

Account Deleted

Hey I am on Medialย โ€ขย 2m

๐Ÿคฏ Still struggling with manual tasks, slow processes, or inconsistent workflows? โœจ Here's how Opslifyโ€™s AI software makes it better: Before AI: โŒ Time-consuming tasks โŒ Human errors โŒ High costs After AI with Opslify: โœ… Fast automation โœ… Smart dec

See More
1 Reply
2
5
Image Description
Image Description

PRATHAM

ย โ€ขย 

Appleย โ€ขย 1y

What's Your Thought About NFT ( Non Fungible Token ) ? ๐Ÿค” It's based on Blockchain technology which maybe a unique token or digital art that is traded. The Token justifies your ownership on the art or token. People say it's future. But I think it's

See More
25 Replies
1
13
Image Description

Comet

#freelancerย โ€ขย 8m

Day 2: Silence the Inner Critic โ€“ Rewiring Your Mind for Confidence ๐Ÿ“Œ โ€œDonโ€™t believe everything you thinkโ€”especially the negative thoughts.โ€ โœ… Lesson Summary (Short & Impactful) ๐Ÿ“– Your mind is your biggest influencer. The words you say to yoursel

See More
2 Replies

Download the medial app to read full posts, comements and news.