Back

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 3m

How can modern AI systems stop giving wrong answers? I've explained 4 guardrails in simple steps below. 1) ๐—ฆ๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—–๐—น๐—ฎ๐˜€๐˜€๐—ถ๐—ณ๐—ถ๐—ฒ๐—ฟ Purpose: detect dangerous, illegal, or policy-breaking content. 1. ๐—ฅ๐—ฒ๐—ฐ๐—ฒ๐—ถ๐˜ƒ๐—ฒ ๐˜๐—ต๐—ฒ ๐˜๐—ฒ๐˜…๐˜ (input or the modelโ€™s draft). 2. ๐—ก๐—ผ๐—ฟ๐—บ๐—ฎ๐—น๐—ถ๐˜‡๐—ฒ ๐—ถ๐˜ โ€” convert to a standard form (lowercase, remove weird spacing) so checks are reliable. 3. ๐—ฅ๐˜‚๐—ป ๐˜๐—ต๐—ฒ ๐˜€๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น โ€” an algorithm grades the text (safe / risky / unknown). 4. ๐—ฆ๐—ฝ๐—ผ๐˜ ๐—ท๐—ฎ๐—ถ๐—น๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ๐˜€ ๐—ผ๐—ฟ ๐˜๐—ฟ๐—ถ๐—ฐ๐—ธ๐˜€ โ€” looks for attempts to bypass safety by hiding instructions. 5. ๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฟ๐—ถ๐˜€๐—ธ โ€” low/medium/high. 6. ๐—ง๐—ฎ๐—ธ๐—ฒ ๐—ฎ๐—ฐ๐˜๐—ถ๐—ผ๐—ป: โ€ข Low risk โ†’ allow. โ€ข Medium risk โ†’ modify reply (safe alternative) or add guidance. โ€ข High risk โ†’ block reply and send safe refusal message. 7. ๐—ก๐—ผ๐˜๐—ถ๐—ณ๐˜† ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ & ๐—น๐—ผ๐—ด the incident for review. 8. ๐—”๐—ฑ๐—ท๐˜‚๐˜€๐˜ ๐˜€๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—ฟ๐˜‚๐—น๐—ฒ๐˜€ if needed. 2) ๐—ฃ๐—œ๐—œ ๐—™๐—ถ๐—น๐˜๐—ฒ๐—ฟ Purpose: prevent sharing personal or private information. 1. ๐—ง๐—ฎ๐—ธ๐—ฒ ๐˜๐—ต๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—นโ€™๐˜€ ๐—ผ๐˜‚๐˜๐—ฝ๐˜‚๐˜ (what it plans to say). 2. ๐—ง๐—ผ๐—ธ๐—ฒ๐—ป๐—ถ๐˜‡๐—ฒ / ๐—ฏ๐—ฟ๐—ฒ๐—ฎ๐—ธ ๐—ถ๐—ป๐˜๐—ผ ๐—ฝ๐—ถ๐—ฒ๐—ฐ๐—ฒ๐˜€ (words, phrases). 3. ๐—–๐—ผ๐—บ๐—ฝ๐—ฎ๐—ฟ๐—ฒ ๐˜๐—ผ๐—ธ๐—ฒ๐—ป๐˜€ ๐˜๐—ผ ๐—ฃ๐—œ๐—œ ๐—ฝ๐—ฎ๐˜๐˜๐—ฒ๐—ฟ๐—ป๐˜€ (names, phone numbers, emails, SSNs). 4. ๐—”๐—ฝ๐—ฝ๐—น๐˜† ๐—ฝ๐—ฎ๐˜๐˜๐—ฒ๐—ฟ๐—ป ๐—ฟ๐˜‚๐—น๐—ฒ๐˜€ (๐—ฟ๐—ฒ๐—ด๐—ฒ๐˜…) 5. ๐—–๐—ฟ๐—ผ๐˜€๐˜€-๐—ฐ๐—ต๐—ฒ๐—ฐ๐—ธ ๐˜„๐—ถ๐˜๐—ต ๐˜€๐—ฒ๐—ฐ๐˜‚๐—ฟ๐—ฒ ๐—ฑ๐—ฎ๐˜๐—ฎ๐—ฏ๐—ฎ๐˜€๐—ฒ๐˜€ (if allowed) to avoid leaking real records. 6. ๐—œ๐—ณ ๐—ฃ๐—œ๐—œ ๐—ณ๐—ผ๐˜‚๐—ป๐—ฑ โ†’ mask, redact or replace the sensitive part (e.g., โ€œ--1234โ€ or refuse). 7. ๐—Ÿ๐—ผ๐—ด ๐˜๐—ต๐—ฒ ๐—ฒ๐˜ƒ๐—ฒ๐—ป๐˜ and update the PII rules if a new pattern is found. 3) ๐—ฅ๐˜‚๐—น๐—ฒ๐˜€-๐—•๐—ฎ๐˜€๐—ฒ๐—ฑ ๐—ฃ๐—ฟ๐—ผ๐˜๐—ฒ๐—ฐ๐˜๐—ถ๐—ผ๐—ป๐˜€ Purpose: enforce hard business rules, legal limits, or customer policies. 1. ๐—œ๐—ป๐˜€๐—ฝ๐—ฒ๐—ฐ๐˜ ๐˜๐—ต๐—ฒ ๐—ฟ๐—ฒ๐—พ๐˜‚๐—ฒ๐˜€๐˜ against a list of banned words/commands or limits. 2. ๐—ฅ๐˜‚๐—ป ๐—ฟ๐—ฒ๐—ด๐—ฒ๐˜… ๐—ผ๐—ฟ ๐—ฝ๐—ฎ๐˜๐˜๐—ฒ๐—ฟ๐—ป ๐˜€๐—ฐ๐—ฎ๐—ป๐˜€ for forbidden patterns (like SQL in a text field). 3. ๐—˜๐—ป๐—ณ๐—ผ๐—ฟ๐—ฐ๐—ฒ ๐˜‚๐˜€๐—ฎ๐—ด๐—ฒ ๐—น๐—ถ๐—บ๐—ถ๐˜๐˜€ (e.g., prohibit long file attachments). 4. ๐—œ๐—ณ ๐—ฎ ๐—ฟ๐˜‚๐—น๐—ฒ ๐—ถ๐˜€ ๐—ฏ๐—ฟ๐—ผ๐—ธ๐—ฒ๐—ป โ†’ deny the action and return a specific message explaining why. 5. ๐—ฅ๐—ฒ๐—ฐ๐—ผ๐—ฟ๐—ฑ ๐˜๐—ต๐—ฒ ๐—ฎ๐˜๐˜๐—ฒ๐—บ๐—ฝ๐˜ and notify reviewers if needed. 6. ๐—ฅ๐—ฒ๐—ณ๐—ถ๐—ป๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฟ๐˜‚๐—น๐—ฒ ๐—น๐—ถ๐˜€๐˜ when new cases appear. 4) ๐— ๐—ผ๐—ฑ๐—ฒ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป Purpose: detect and handle abusive, hateful, or toxic content. 1. ๐—–๐—ผ๐—น๐—น๐—ฒ๐—ฐ๐˜ ๐˜๐—ต๐—ฒ ๐—ถ๐—ป๐—ฝ๐˜‚๐˜ ๐—ผ๐—ฟ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ผ๐˜‚๐˜๐—ฝ๐˜‚๐˜. 2. ๐—–๐—น๐—ฒ๐—ฎ๐—ป ๐—ฎ๐—ป๐—ฑ ๐—ฝ๐—ฟ๐—ฒ๐—ฝ๐—ฟ๐—ผ๐—ฐ๐—ฒ๐˜€๐˜€ (remove emojis, normalize language). 3. ๐—ฅ๐˜‚๐—ป ๐—บ๐—ผ๐—ฑ๐—ฒ๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ to detect hate speech, harassment, sexual content, self-harm, etc. 4. ๐—ฆ๐—ฐ๐—ผ๐—ฟ๐—ฒ ๐˜€๐—ฒ๐˜ƒ๐—ฒ๐—ฟ๐—ถ๐˜๐˜† (mild, severe). 5. ๐—ง๐—ฎ๐—ธ๐—ฒ ๐—ฎ๐—ฐ๐˜๐—ถ๐—ผ๐—ป: โ€ข Mild โ†’ warn user or sanitize content. โ€ข Severe โ†’ block and escalate to human review or emergency resources. 6. ๐—Ÿ๐—ผ๐—ด ๐—ณ๐—น๐—ฎ๐—ด๐—ด๐—ฒ๐—ฑ ๐—ฐ๐—ผ๐—ป๐˜๐—ฒ๐—ป๐˜ for trend analysis and to improve the moderation model. 7. ๐—ฅ๐—ฒ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป ๐—ผ๐—ฟ ๐˜๐˜‚๐—ป๐—ฒ the moderation model using confirmed examples. โœ… Repost for others in your network who can benefit from this.

Reply
5

More like this

Recommendations from Medial

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 2d

Everyone should know how to build safe AI agents. I've explained the key safety checks below. 1. ๐—จ๐—ป๐—ฐ๐—ฒ๐—ฟ๐˜๐—ฎ๐—ถ๐—ป๐˜๐˜† ๐—ง๐—ต๐—ฟ๐—ฒ๐˜€๐—ต๐—ผ๐—น๐—ฑ๐˜€ It stops execution when the modelโ€™s ๐—ฐ๐—ผ๐—ป๐—ณ๐—ถ๐—ฑ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ถ๐˜€ ๐˜๐—ผ๐—ผ ๐—น๐—ผ๐˜„ to safely perform an action. R

See More
Reply
1
5

Prajapati Prince

presafeshoe.godaddys...ย โ€ขย 6m

I want to detect all these functions wirelessly using sensors suitable for production or medical-level use, providing accurate readings: 1. Wetness Alert 2. Gas Alert 3. Movement Monitoring 4. Light Detection 5. Body Temperature Check 6. Presenc

See More
Reply

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 2m

People know vibe tools, but struggle to use them. I've prepared a solid framework that works across all platforms. 1) ๐——๐—ฒ๐—ณ๐—ถ๐—ป๐—ฒ ๐˜๐—ต๐—ฒ ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜ ๐—ถ๐—ป ๐—ผ๐—ป๐—ฒ ๐˜€๐—ฒ๐—ป๐˜๐—ฒ๐—ป๐—ฐ๐—ฒ (5 ๐—บ๐—ถ๐—ป๐˜‚๐˜๐—ฒ๐˜€) Be brutally simple. Your app must do ONE thing

See More
Reply
2
7
Image Description
Image Description

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 3m

12 MCP servers every person should know. I've explained each one in a simple way. 1. ๐—™๐—ถ๐—น๐—ฒ ๐—ฆ๐˜†๐˜€๐˜๐—ฒ๐—บ ๐—ฆ๐—ฒ๐—ฟ๐˜ƒ๐—ฒ๐—ฟ โ€ข Gives AI access to your local files. โ€ข It can ๐—ฟ๐—ฒ๐—ฎ๐—ฑ, ๐˜„๐—ฟ๐—ถ๐˜๐—ฒ, ๐—ฎ๐—ป๐—ฑ ๐—ฐ๐—ฟ๐—ฒ๐—ฎ๐˜๐—ฒ files on your computer (under safe per

See More
2 Replies
44
31
6
Image Description
Image Description

Havish Gupta

Figuring Outย โ€ขย 1y

This startup made over $2 million just by spying at people! Let me explain. SO this is the story of Staqu, an AI startup founded by Atul Rai, Anurag Rastogi, and Pankaj Kumar in 2015. Their products use CCTV footage and with the help of AI, comput

See More
2 Replies
1
17
Image Description
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 9m

๐— ๐—ฎ๐—ฟ๐—ฐ ๐—”๐—ป๐—ฑ๐—ฟ๐—ฒ๐—ฒ๐˜€๐˜€๐—ฒ๐—ป ๐—ท๐˜‚๐˜€๐˜ ๐—ฑ๐—ฟ๐—ผ๐—ฝ๐—ฝ๐—ฒ๐—ฑ ๐Ÿฑ ๐˜๐—ฟ๐˜‚๐˜๐—ต๐˜€ ๐˜๐—ต๐—ฎ๐˜ ๐—ฐ๐—ฎ๐—ป ๐—น๐—ถ๐˜๐—ฒ๐—ฟ๐—ฎ๐—น๐—น๐˜† ๐—ฐ๐—ต๐—ฎ๐—ป๐—ด๐—ฒ ๐˜†๐—ผ๐˜‚๐—ฟ ๐˜๐—ฒ๐—ฐ๐—ต ๐—ท๐—ผ๐˜‚๐—ฟ๐—ป๐—ฒ๐˜†. Not gyan. Not fluff. Just real, raw frameworks. ๐Ÿญ. ๐—ฅ๐˜‚๐—ป ๐˜๐—ผ ๐˜๐—ต๐—ฒ ๐—ต๐—ฒ๐—ฎ๐˜ Donโ€™t play it sa

See More
11 Replies
58
49
Image Description
Image Description

vishakha Jangir

โ†ณ Ed-tech/Freelancin...ย โ€ขย 1y

๐—›๐—ผ๐˜„ ๐—ฐ๐—ฎ๐—ป ๐˜†๐—ผ๐˜‚ ๐—ฏ๐—ฒ๐—ฐ๐—ผ๐—บ๐—ฒ ๐—ณ๐—ถ๐—ป๐—ฎ๐—ป๐—ฐ๐—ถ๐—ฎ๐—น๐—น๐˜† ๐—ถ๐—ป๐—ฑ๐—ฒ๐—ฝ๐—ฒ๐—ป๐—ฑ๐—ฒ๐—ป๐˜ ๐—ถ๐—ป ๐˜†๐—ผ๐˜‚๐—ฟ ๐Ÿฎ๐Ÿฌ๐˜€ ๐—ผ๐—ฟ ๐—ฒ๐˜ƒ๐—ฒ๐—ป ๐—ฏ๐—ฒ๐—ณ๐—ผ๐—ฟ๐—ฒ ๐Ÿฎ๐Ÿฌ ? โ†ณ Choose Your Skills: Identify 3-4 relevant areas where you already have skills, such as video editing, graphic d

See More
27 Replies
4
11

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 22d

3 main types of memory used in AI systems. Iโ€™ve explained it in a simple way below. 1. ๐—˜๐—ฝ๐—ถ๐˜€๐—ผ๐—ฑ๐—ถ๐—ฐ ๐— ๐—ฒ๐—บ๐—ผ๐—ฟ๐˜† Episodic Memory is about ๐—ฟ๐—ฒ๐—บ๐—ฒ๐—บ๐—ฏ๐—ฒ๐—ฟ๐—ถ๐—ป๐—ด ๐—ฝ๐—ฎ๐˜€๐˜ ๐—ถ๐—ป๐˜๐—ฒ๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐—ผ๐—ป๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ฒ๐˜…๐—ฝ๐—ฒ๐—ฟ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ๐˜€. โ€ข ๐—ก๐—ผ๐˜๐—ถ๐—ฐ๐—ฒ ๐—ฒ

See More
Reply
1
4
Image Description

Vishu Bheda

ย โ€ขย 

Medialย โ€ขย 8m

๐—ช๐—ต๐—ฒ๐—ป ๐—ฒ๐˜ƒ๐—ฒ๐—ฟ๐˜†๐—ผ๐—ป๐—ฒโ€™๐˜€ ๐—ฐ๐—ต๐—ฎ๐˜€๐—ถ๐—ป๐—ด ๐—”๐—œ ๐—ด๐—ผ๐—น๐—ฑ, ๐—ก๐—ฉ๐—œ๐——๐—œ๐—” ๐˜€๐—ผ๐—น๐—ฑ ๐˜๐—ต๐—ฒ ๐˜€๐—ต๐—ผ๐˜ƒ๐—ฒ๐—น๐˜€. Thatโ€™s the smartest move in the whole game. While Microsoft, Google, and Meta are spending billions to build AI models... NVIDIA quietly became

See More
2 Replies
5
16

Rahul Agarwal

Founder | Agentic AI...ย โ€ขย 28d

20 prompting rules everyone must know. I've explained each in simple below. 1) ๐—•๐—ฒ ๐—ฆ๐—ฝ๐—ฒ๐—ฐ๐—ถ๐—ณ๐—ถ๐—ฐ Donโ€™t ask vague things like โ€œExplain AI.โ€ Instead ask: โ€œ๐˜Œ๐˜น๐˜ฑ๐˜ญ๐˜ข๐˜ช๐˜ฏ ๐˜ˆ๐˜ ๐˜ช๐˜ฏ 5 ๐˜ฃ๐˜ถ๐˜ญ๐˜ญ๐˜ฆ๐˜ต ๐˜ฑ๐˜ฐ๐˜ช๐˜ฏ๐˜ต๐˜ด ๐˜ง๐˜ฐ๐˜ณ ๐˜ฃ๐˜ฆ๐˜จ๐˜ช๐˜ฏ๐˜ฏ๐˜ฆ๐˜ณ๐˜ด.โ€ Clear requ

See More
Reply
1

Download the medial app to read full posts, comements and news.