Back

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 21d

How can modern AI systems stop giving wrong answers? I've explained 4 guardrails in simple steps below. 1) ๐—ฆ๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—–๐—น๐—ฎ๐˜€๐˜€๐—ถ๐—ณ๐—ถ๐—ฒ๐—ฟ Purpose: detect dangerous, illegal, or policy-breaking content. 1. ๐—ฅ๐—ฒ๐—ฐ๐—ฒ๐—ถ๐˜ƒ๐—ฒ ๐˜๐—ต๐—ฒ ๐˜๐—ฒ๐˜…๐˜ (input or the modelโ€™s draft). 2. ๐—ก๐—ผ๐—ฟ๐—บ๐—ฎ๐—น๐—ถ๐˜‡๐—ฒ ๐—ถ๐˜ โ€” convert to a standard form (lowercase, remove weird spacing) so checks are reliable. 3. ๐—ฅ๐˜‚๐—ป ๐˜๐—ต๐—ฒ ๐˜€๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น โ€” an algorithm grades the text (safe / risky / unknown). 4. ๐—ฆ๐—ฝ๐—ผ๐˜ ๐—ท๐—ฎ๐—ถ๐—น๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ๐˜€ ๐—ผ๐—ฟ ๐˜๐—ฟ๐—ถ๐—ฐ๐—ธ๐˜€ โ€” looks for attempts to bypass safety by hiding instructions. 5. ๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฟ๐—ถ๐˜€๐—ธ โ€” low/medium/high. 6. ๐—ง๐—ฎ๐—ธ๐—ฒ ๐—ฎ๐—ฐ๐˜๐—ถ๐—ผ๐—ป: โ€ข Low risk โ†’ allow. โ€ข Medium risk โ†’ modify reply (safe alternative) or add guidance. โ€ข High risk โ†’ block reply and send safe refusal message. 7. ๐—ก๐—ผ๐˜๐—ถ๐—ณ๐˜† ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ & ๐—น๐—ผ๐—ด the incident for review. 8. ๐—”๐—ฑ๐—ท๐˜‚๐˜€๐˜ ๐˜€๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—ฟ๐˜‚๐—น๐—ฒ๐˜€ if needed. 2) ๐—ฃ๐—œ๐—œ ๐—™๐—ถ๐—น๐˜๐—ฒ๐—ฟ Purpose: prevent sharing personal or private information. 1. ๐—ง๐—ฎ๐—ธ๐—ฒ ๐˜๐—ต๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—นโ€™๐˜€ ๐—ผ๐˜‚๐˜๐—ฝ๐˜‚๐˜ (what it plans to say). 2. ๐—ง๐—ผ๐—ธ๐—ฒ๐—ป๐—ถ๐˜‡๐—ฒ / ๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ ๐—ถ๐—ป๐˜๐—ผ ๐—ฝ๐—ถ๐—ฒ๐—ฐ๐—ฒ๐˜€ (words, phrases). 3. ๐—–๐—ผ๐—บ๐—ฝ๐—ฎ๐—ฟ๐—ฒ ๐˜๐—ผ๐—ธ๐—ฒ๐—ป๐˜€ ๐˜๐—ผ ๐—ฃ๐—œ๐—œ ๐—ฝ๐—ฎ๐˜๐˜๐—ฒ๐—ฟ๐—ป๐˜€ (names, phone numbers, emails, SSNs). 4. ๐—”๐—ฝ๐—ฝ๐—น๐˜† ๐—ฝ๐—ฎ๐˜๐˜๐—ฒ๐—ฟ๐—ป ๐—ฟ๐˜‚๐—น๐—ฒ๐˜€ (๐—ฟ๐—ฒ๐—ด๐—ฒ๐˜…) 5. ๐—–๐—ฟ๐—ผ๐˜€๐˜€-๐—ฐ๐—ต๐—ฒ๐—ฐ๐—ธ ๐˜„๐—ถ๐˜๐—ต ๐˜€๐—ฒ๐—ฐ๐˜‚๐—ฟ๐—ฒ ๐—ฑ๐—ฎ๐˜๐—ฎ๐—ฏ๐—ฎ๐˜€๐—ฒ๐˜€ (if allowed) to avoid leaking real records. 6. ๐—œ๐—ณ ๐—ฃ๐—œ๐—œ ๐—ณ๐—ผ๐˜‚๐—ป๐—ฑ โ†’ mask, redact or replace the sensitive part (e.g., โ€œ--1234โ€ or refuse). 7. ๐—Ÿ๐—ผ๐—ด ๐˜๐—ต๐—ฒ ๐—ฒ๐˜ƒ๐—ฒ๐—ป๐˜ and update the PII rules if a new pattern is found. 3) ๐—ฅ๐˜‚๐—น๐—ฒ๐˜€-๐—•๐—ฎ๐˜€๐—ฒ๐—ฑ ๐—ฃ๐—ฟ๐—ผ๐˜๐—ฒ๐—ฐ๐˜๐—ถ๐—ผ๐—ป๐˜€ Purpose: enforce hard business rules, legal limits, or customer policies. 1. ๐—œ๐—ป๐˜€๐—ฝ๐—ฒ๐—ฐ๐˜ ๐˜๐—ต๐—ฒ ๐—ฟ๐—ฒ๐—พ๐˜‚๐—ฒ๐˜€๐˜ against a list of banned words/commands or limits. 2. ๐—ฅ๐˜‚๐—ป ๐—ฟ๐—ฒ๐—ด๐—ฒ๐˜… ๐—ผ๐—ฟ ๐—ฝ๐—ฎ๐˜๐˜๐—ฒ๐—ฟ๐—ป ๐˜€๐—ฐ๐—ฎ๐—ป๐˜€ for forbidden patterns (like SQL in a text field). 3. ๐—˜๐—ป๐—ณ๐—ผ๐—ฟ๐—ฐ๐—ฒ ๐˜‚๐˜€๐—ฎ๐—ด๐—ฒ ๐—น๐—ถ๐—บ๐—ถ๐˜๐˜€ (e.g., prohibit long file attachments). 4. ๐—œ๐—ณ ๐—ฎ ๐—ฟ๐˜‚๐—น๐—ฒ ๐—ถ๐˜€ ๐—ฏ๐—ฟ๐—ผ๐—ธ๐—ฒ๐—ป โ†’ deny the action and return a specific message explaining why. 5. ๐—ฅ๐—ฒ๐—ฐ๐—ผ๐—ฟ๐—ฑ ๐˜๐—ต๐—ฒ ๐—ฎ๐˜๐˜๐—ฒ๐—บ๐—ฝ๐˜ and notify reviewers if needed. 6. ๐—ฅ๐—ฒ๐—ณ๐—ถ๐—ป๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฟ๐˜‚๐—น๐—ฒ ๐—น๐—ถ๐˜€๐˜ when new cases appear. 4) ๐— ๐—ผ๐—ฑ๐—ฒ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป Purpose: detect and handle abusive, hateful, or toxic content. 1. ๐—–๐—ผ๐—น๐—น๐—ฒ๐—ฐ๐˜ ๐˜๐—ต๐—ฒ ๐—ถ๐—ป๐—ฝ๐˜‚๐˜ ๐—ผ๐—ฟ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ผ๐˜‚๐˜๐—ฝ๐˜‚๐˜. 2. ๐—–๐—น๐—ฒ๐—ฎ๐—ป ๐—ฎ๐—ป๐—ฑ ๐—ฝ๐—ฟ๐—ฒ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€ (remove emojis, normalize language). 3. ๐—ฅ๐˜‚๐—ป ๐—บ๐—ผ๐—ฑ๐—ฒ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ to detect hate speech, harassment, sexual content, self-harm, etc. 4. ๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ ๐˜€๐—ฒ๐˜ƒ๐—ฒ๐—ฟ๐—ถ๐˜๐˜† (mild, severe). 5. ๐—ง๐—ฎ๐—ธ๐—ฒ ๐—ฎ๐—ฐ๐˜๐—ถ๐—ผ๐—ป: โ€ข Mild โ†’ warn user or sanitize content. โ€ข Severe โ†’ block and escalate to human review or emergency resources. 6. ๐—Ÿ๐—ผ๐—ด ๐—ณ๐—น๐—ฎ๐—ด๐—ด๐—ฒ๐—ฑ ๐—ฐ๐—ผ๐—ป๐˜๐—ฒ๐—ป๐˜ for trend analysis and to improve the moderation model. 7. ๐—ฅ๐—ฒ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป ๐—ผ๐—ฟ ๐˜๐˜‚๐—ป๐—ฒ the moderation model using confirmed examples. โœ… Repost for others in your network who can benefit from this.

Reply
5

More like this

Recommendations from Medial

Prajapati Prince

presafeshoe.godaddys...ย โ€ขย 4m

I want to detect all these functions wirelessly using sensors suitable for production or medical-level use, providing accurate readings: 1. Wetness Alert 2. Gas Alert 3. Movement Monitoring 4. Light Detection 5. Body Temperature Check 6. Presenc

See More
Reply

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 3d

People know vibe tools, but struggle to use them. I've prepared a solid framework that works across all platforms. 1) ๐——๐—ฒ๐—ณ๐—ถ๐—ป๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜ ๐—ถ๐—ป ๐—ผ๐—ป๐—ฒ ๐˜€๐—ฒ๐—ป๐˜๐—ฒ๐—ป๐—ฐ๐—ฒ (5 ๐—บ๐—ถ๐—ป๐˜‚๐˜๐—ฒ๐˜€) Be brutally simple. Your app must do ONE thing

See More
Reply
2
7
Image Description
Image Description

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 1m

12 MCP servers every person should know. I've explained each one in a simple way. 1. ๐—™๐—ถ๐—น๐—ฒ ๐—ฆ๐˜†๐˜€๐˜๐—ฒ๐—บ ๐—ฆ๐—ฒ๐—ฟ๐˜ƒ๐—ฒ๐—ฟ โ€ข Gives AI access to your local files. โ€ข It can ๐—ฟ๐—ฒ๐—ฎ๐—ฑ, ๐˜„๐—ฟ๐—ถ๐˜๐—ฒ, ๐—ฎ๐—ป๐—ฑ ๐—ฐ๐—ฟ๐—ฒ๐—ฎ๐˜๐—ฒ files on your computer (under safe per

See More
2 Replies
44
31
6
Image Description
Image Description

Havish Gupta

Figuring Outย โ€ขย 1y

This startup made over $2 million just by spying at people! Let me explain. SO this is the story of Staqu, an AI startup founded by Atul Rai, Anurag Rastogi, and Pankaj Kumar in 2015. Their products use CCTV footage and with the help of AI, comput

See More
2 Replies
1
17
Image Description
Image Description

vishakha Jangir

โ†ณ Ed-tech/Freelancin...ย โ€ขย 1y

๐—›๐—ผ๐˜„ ๐—ฐ๐—ฎ๐—ป ๐˜†๐—ผ๐˜‚ ๐—ฏ๐—ฒ๐—ฐ๐—ผ๐—บ๐—ฒ ๐—ณ๐—ถ๐—ป๐—ฎ๐—ป๐—ฐ๐—ถ๐—ฎ๐—น๐—น๐˜† ๐—ถ๐—ป๐—ฑ๐—ฒ๐—ฝ๐—ฒ๐—ป๐—ฑ๐—ฒ๐—ป๐˜ ๐—ถ๐—ป ๐˜†๐—ผ๐˜‚๐—ฟ ๐Ÿฎ๐Ÿฌ๐˜€ ๐—ผ๐—ฟ ๐—ฒ๐˜ƒ๐—ฒ๐—ป ๐—ฏ๐—ฒ๐—ณ๐—ผ๐—ฟ๐—ฒ ๐Ÿฎ๐Ÿฌ ? โ†ณ Choose Your Skills: Identify 3-4 relevant areas where you already have skills, such as video editing, graphic d

See More
27 Replies
4
11
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 5m

๐—ช๐—ต๐—ฒ๐—ป ๐—ฒ๐˜ƒ๐—ฒ๐—ฟ๐˜†๐—ผ๐—ป๐—ฒโ€™๐˜€ ๐—ฐ๐—ต๐—ฎ๐˜€๐—ถ๐—ป๐—ด ๐—”๐—œ ๐—ด๐—ผ๐—น๐—ฑ, ๐—ก๐—ฉ๐—œ๐——๐—œ๐—” ๐˜€๐—ผ๐—น๐—ฑ ๐˜๐—ต๐—ฒ ๐˜€๐—ต๐—ผ๐˜ƒ๐—ฒ๐—น๐˜€. Thatโ€™s the smartest move in the whole game. While Microsoft, Google, and Meta are spending billions to build AI models... NVIDIA quietly became

See More
2 Replies
5
16
Image Description
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 6m

๐— ๐—ฎ๐—ฟ๐—ฐ ๐—”๐—ป๐—ฑ๐—ฟ๐—ฒ๐—ฒ๐˜€๐˜€๐—ฒ๐—ป ๐—ท๐˜‚๐˜€๐˜ ๐—ฑ๐—ฟ๐—ผ๐—ฝ๐—ฝ๐—ฒ๐—ฑ ๐Ÿฑ ๐˜๐—ฟ๐˜‚๐˜๐—ต๐˜€ ๐˜๐—ต๐—ฎ๐˜ ๐—ฐ๐—ฎ๐—ป ๐—น๐—ถ๐˜๐—ฒ๐—ฟ๐—ฎ๐—น๐—น๐˜† ๐—ฐ๐—ต๐—ฎ๐—ป๐—ด๐—ฒ ๐˜†๐—ผ๐˜‚๐—ฟ ๐˜๐—ฒ๐—ฐ๐—ต ๐—ท๐—ผ๐˜‚๐—ฟ๐—ป๐—ฒ๐˜†. Not gyan. Not fluff. Just real, raw frameworks. ๐Ÿญ. ๐—ฅ๐˜‚๐—ป ๐˜๐—ผ ๐˜๐—ต๐—ฒ ๐—ต๐—ฒ๐—ฎ๐˜ Donโ€™t play it sa

See More
11 Replies
58
49

One AI Market

AI Market Placeย โ€ขย 8m

๐Ÿš€ Introducing One AI Market ๐Ÿš€ One AI Market is the place to create customized AI agents for any challengeโ€”no code required: Text Agents for instant summaries, sentiment analysis, and data extraction from any document or message. Vision Agents to

See More
Reply
2
Image Description
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 8m

๐—˜๐—น๐—ผ๐—ป ๐— ๐˜‚๐˜€๐—ธ ๐—ผ๐—ป ๐—ช๐—ต๐˜† ๐—ฅ๐—ฒ๐—บ๐—ผ๐˜๐—ฒ ๐—ช๐—ผ๐—ฟ๐—ธ ๐— ๐—ถ๐—ด๐—ต๐˜ ๐—ž๐—ถ๐—น๐—น ๐—ฌ๐—ผ๐˜‚๐—ฟ ๐—ฆ๐˜๐—ฎ๐—ฟ๐˜๐˜‚๐—ฝ "๐ˆ๐Ÿ ๐ฒ๐จ๐ฎ ๐๐จ๐งโ€™๐ญ ๐ฌ๐ก๐จ๐ฐ ๐ฎ๐ฉ, ๐ฐ๐žโ€™๐ฅ๐ฅ ๐š๐ฌ๐ฌ๐ฎ๐ฆ๐ž ๐ฒ๐จ๐ฎโ€™๐ฏ๐ž ๐ซ๐ž๐ฌ๐ข๐ ๐ง๐ž๐." Thatโ€™s what Elon Musk told every Tesla employee. Harsh? May

See More
4 Replies
6
19
Image Description
Image Description

Mannan Baluvuri

Lifelong Learnerย โ€ขย 5m

๐— ๐—ถ๐—ฐ๐—ต๐—ฎ๐—ฒ๐—น ๐——๐—ฒ๐—น๐—น, ๐—ณ๐—ผ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ ๐—ผ๐—ณ ๐——๐—ฒ๐—น๐—น, ๐˜€๐—ฎ๐˜†๐˜€: "Early in their careers, employees care most about pay and stability, but as they grow into leadership, they look for meaning and purpose too." In his book Play Nice But Win, he connec

See More
2 Replies
4
11

Download the medial app to read full posts, comements and news.