Back

Rahul Agarwal

Founder | Agentic AI... • 2m

How do Voice, Coding & Computer Agents work? I've explained each one in a very simple way below. 1. 𝗩𝗼𝗶𝗰𝗲 𝗔𝗴𝗲𝗻𝘁𝘀 AI systems that talk with people using speech. Examples: Vapi, Retell AI, OpenAI TTS etc. 𝗦𝘁𝗲𝗽𝘀: 1. 𝗨𝘀𝗲𝗿 𝘀𝗽𝗲𝗮𝗸𝘀 𝗮 𝗾𝘂𝗲𝗿𝘆 → Example: “What’s the stock price of nvidia today?” 2. 𝗧𝗲𝗹𝗲𝗽𝗵𝗼𝗻𝘆 𝘀𝘆𝘀𝘁𝗲𝗺 captures the voice. 3. 𝗦𝗧𝗧 (𝗦𝗽𝗲𝗲𝗰𝗵-𝘁𝗼-𝗧𝗲𝘅𝘁) converts the spoken words into text. 4. 𝗔𝗴𝗲𝗻𝘁 𝗽𝗿𝗼𝗰𝗲𝘀𝘀𝗲𝘀 𝗶𝘁 using: • 𝗘𝗺𝗯𝗲𝗱𝗱𝗶𝗻𝗴 𝗠𝗼𝗱𝗲𝗹 & 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝗣𝗜 (finds relevant knowledge). • 𝗩𝗲𝗰𝘁𝗼𝗿 𝗗𝗕 (stores knowledge for quick search). 5. 𝗔𝗴𝗲𝗻𝘁 𝗱𝗲𝗰𝗶𝗱𝗲𝘀 𝗮𝗻 𝗮𝗻𝘀𝘄𝗲𝗿 using tools. 6. 𝗧𝗧𝗦 (𝗧𝗲𝘅𝘁-𝘁𝗼-𝗦𝗽𝗲𝗲𝗰𝗵) converts text back into speech. 7. 𝗢𝘂𝘁𝗽𝘂𝘁 is spoken back to the user. 𝗜𝗻 𝘀𝗵𝗼𝗿𝘁: Voice → Text → Keywords → Search → Answer → Voice Back. _____________________________________________ 2. 𝗖𝗼𝗱𝗶𝗻𝗴 𝗔𝗴𝗲𝗻𝘁𝘀 AI systems that help developers write, debug, and test code faster. Examples: GitHub Copilot, Cursor AI etc. 𝗦𝘁𝗲𝗽𝘀: 1. 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗲𝗿 𝗴𝗶𝘃𝗲𝘀 𝗮 𝗾𝘂𝗲𝗿𝘆 → Example: “Write a function to sort numbers.” 2. 𝗔𝗴𝗲𝗻𝘁 receives the request. 3. Agent uses: • 𝗖𝗼𝗱𝗲 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗼𝗿 → writes new code. • 𝗖𝗼𝗱𝗲 𝗗𝗲𝗯𝘂𝗴𝗴𝗲𝗿 → finds and fixes errors. • 𝗧𝗲𝘀𝘁 𝗥𝘂𝗻𝗻𝗲𝗿 → checks if code works. 4. 𝗘𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁 provides a workspace where the code runs. 5. 𝗢𝘂𝘁𝗽𝘂𝘁 → working code returned to the developer. 𝗜𝗻 𝘀𝗵𝗼𝗿𝘁: Query → Keywords → AI writes code → Debug + Test → Final Code. _____________________________________________ 3. 𝗖𝗨𝗔 (𝗖𝗼𝗺𝗽𝘂𝘁𝗲𝗿 𝗨𝘀𝗶𝗻𝗴 𝗔𝗴𝗲𝗻𝘁𝘀) AI systems that 𝘂𝘀𝗲 𝗮 𝗰𝗼𝗺𝗽𝘂𝘁𝗲𝗿 𝗹𝗶𝗸𝗲 𝗮 𝗵𝘂𝗺𝗮𝗻 𝘄𝗼𝘂𝗹𝗱 (e.g., clicking, typing, browsing). Examples: Manus AI, Saner AI etc. 𝗦𝘁𝗲𝗽𝘀: 1. 𝗨𝘀𝗲𝗿 𝗴𝗶𝘃𝗲𝘀 𝗮 𝗾𝘂𝗲𝗿𝘆 → Example: “Book the cheapest flight from Hong Kong to Paris.” 2. 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹 (𝗩𝗟𝗠) → understands what’s on the screen. 3. 𝗚𝗲𝗻𝗲𝗿𝗮𝗹 𝗣𝘂𝗿𝗽𝗼𝘀𝗲 𝗟𝗟𝗠 → decides actions (click, type, drag, etc.). 4. Uses: • 𝗩𝗲𝗰𝘁𝗼𝗿 𝗗𝗕 → recall past actions or data. • 𝗠𝗲𝗺𝗼𝗿𝘆 → remember ongoing tasks. • 𝗧𝗵𝗶𝗿𝗱-𝗣𝗮𝗿𝘁𝘆 𝗧𝗼𝗼𝗹𝘀 → connect with apps (email, Excel, etc.). • 𝗗𝗲𝘀𝗸𝘁𝗼𝗽 𝗦𝗮𝗻𝗱𝗯𝗼𝘅 → safe test environment to interact with the computer. 5. 𝗢𝘂𝘁𝗽𝘂𝘁 → task completed automatically (like a digital assistant doing your computer work). 𝗜𝗻 𝘀𝗵𝗼𝗿𝘁: You instruct → AI sees screen + plans → clicks/types → finishes the task like a human would. All these different agents are shaping the future of AI. What’s your take? ✅ Repost for others who want to understand how they work.

2 Replies
5
16
Replies (2)

More like this

Recommendations from Medial

Image Description

Rahul Agarwal

Founder | Agentic AI... • 2m

How Multi-Agent AI systems actually work? Explained in a very simple way. Read below: -> 𝗧𝗵𝗲 𝗠𝗮𝗶𝗻 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 The main 𝗔𝗜 𝗔𝗴𝗲𝗻𝘁 is the 𝗼𝗿𝗰𝗵𝗲𝘀𝘁𝗿𝗮𝘁𝗼𝗿. It has several capabilities: • 𝗗𝗮𝘁𝗮𝗯𝗮𝘀𝗲 – Stores knowledge o

See More
Reply
6
19
1
Image Description
Image Description

Vishu Bheda

 • 

Medial • 6m

𝗬𝗼𝘂 𝗱𝗼𝗻’𝘁 𝗻𝗲𝗲𝗱 𝗮 𝘁𝗲𝗮𝗺, 𝗩𝗖 𝗰𝗮𝘀𝗵, 𝗼𝗿 𝗮 𝘆𝗲𝗮𝗿 𝗼𝗳 𝗰𝗼𝗱𝗶𝗻𝗴 𝘁𝗼 𝗹𝗮𝘂𝗻𝗰𝗵 𝗮 𝘀𝘁𝗮𝗿𝘁𝘂𝗽 𝗮𝗻𝘆𝗺𝗼𝗿𝗲. Now? One person + a weekend + no-code tools = real money while you sleep. Someone just built an AI tool us

See More
13 Replies
51
54

Vishu Bheda

 • 

Medial • 5m

𝗔𝗜 𝗶𝘀 𝘁𝘂𝗿𝗻𝗶𝗻𝗴 𝗲𝘃𝗲𝗿𝘆 𝗰𝘂𝗿𝗶𝗼𝘂𝘀 𝗽𝗲𝗿𝘀𝗼𝗻 𝗶𝗻𝘁𝗼 𝗮 𝗺𝗼𝗱𝗲𝗿𝗻-𝗱𝗮𝘆 𝗽𝗼𝗹𝘆𝗺𝗮𝘁𝗵. Want to code? AI walks you through it. Trying to grasp finance? AI explains it step by step. Curious about history? AI becomes your p

See More
Reply
8

Rahul Agarwal

Founder | Agentic AI... • 1m

3 core protocols behind AI agents. I’ve explained each one in simple steps. 1. 𝗠𝗖𝗣 (𝗠𝗼𝗱𝗲𝗹 𝗖𝗼𝗻𝘁𝗲𝘅𝘁 𝗣𝗿𝗼𝘁𝗼𝗰𝗼𝗹) • 𝗠𝗮𝗶𝗻 𝗣𝘂𝗿𝗽𝗼𝘀𝗲: It helps AI models (LLMs) use extra information by connecting them to tools or databases.

See More
Reply
3
5
Image Description
Image Description

Rahul Agarwal

Founder | Agentic AI... • 2m

Well, Lovable is great for building apps. But how does Lovable actually produce full apps? I'll break down the entire process of how lovable works step by step. 1. 𝗨𝘀𝗲𝗿 𝗜𝗻𝗽𝘂𝘁 (𝗣𝗿𝗼𝗺𝗽𝘁 𝗦𝘁𝗮𝗴𝗲) • You type your idea in Lovable (e.g.

See More
2 Replies
4
14
Image Description
Image Description

Vishu Bheda

 • 

Medial • 4m

𝗟𝗼𝘃𝗮𝗯𝗹𝗲 𝗷𝘂𝘀𝘁 𝗵𝗶𝘁 $𝟭𝟬𝟬𝗠 𝗔𝗥𝗥 𝗶𝗻 𝟴 𝗺𝗼𝗻𝘁𝗵𝘀. Faster than Shopify, OpenAI, Slack, Cursor — all of them. And now, they launched their boldest bet yet: 𝗟𝗼𝘃𝗮𝗯𝗹𝗲 𝗔𝗴𝗲𝗻𝘁 — an AI that doesn’t feel like a tool. It feels

See More
20 Replies
29
64
2
Image Description

Vishu Bheda

 • 

Medial • 6m

“𝗟𝗲𝗮𝗿𝗻 𝘁𝗼 𝗰𝗼𝗱𝗲 𝗮𝗻𝗱 𝘆𝗼𝘂’𝗹𝗹 𝗮𝗹𝘄𝗮𝘆𝘀 𝗵𝗮𝘃𝗲 𝗮 𝗷𝗼𝗯.” That advice used to be gold. But times changed. Software dev job postings in the US are down 𝟕𝟎% since 2022. 𝗛𝗲𝗿𝗲’𝘀 𝘄𝗵𝘆: •AI is quietly replacing junior rol

See More
1 Reply
7
17
Image Description

Vishu Bheda

 • 

Medial • 6m

𝗢𝗽𝗲𝗻𝗔𝗜 𝗷𝘂𝘀𝘁 𝗹𝗮𝘂𝗻𝗰𝗵𝗲𝗱 𝘀𝗼𝗺𝗲𝘁𝗵𝗶𝗻𝗴 𝘄𝗶𝗹𝗱 — 𝗖𝗼𝗱𝗲𝘅. And it’s not your average code-suggestion tool anymore. It’s a full-blown 𝐀𝐈 𝐭𝐞𝐚𝐦𝐦𝐚𝐭𝐞 for anyone who builds software — even if you don’t write much code. �

See More
1 Reply
9
32

Rahul Agarwal

Founder | Agentic AI... • 2m

Fine-tune vs Prompt vs Context Engineering. Simple step-by-step breakdown for each approach. 𝗙𝗶𝗻𝗲-𝗧𝘂𝗻𝗶𝗻𝗴 (𝗠𝗼𝗱𝗲𝗹-𝗟𝗲𝘃𝗲𝗹 𝗖𝘂𝘀𝘁𝗼𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻) 𝗙𝗹𝗼𝘄: 1. Collect Data → Gather domain-specific info (e.g., legal docs). 2. Sta

See More
Reply
2
13
Image Description

Rahul Agarwal

Founder | Agentic AI... • 16d

9 Steps to Build AI Agents from Scratch. I've given a simple step by step explanation. 𝗦𝘁𝗲𝗽 1: 𝗘𝘀𝘁𝗮𝗯𝗹𝗶𝘀𝗵 𝗠𝗶𝘀𝘀𝗶𝗼𝗻 & 𝗥𝗼𝗹𝗲 • Decide what problem the agent will solve. • Figure out who will use it. • Plan how users will interact

See More
Reply
4
15
1

Download the medial app to read full posts, comements and news.