Back

Nilesh Kashid

Hey I am on Medial • 10d

on first point, voice agents - do we really need to do tts and stt. gpt 4o real-time or gemini live models directly takes voice input and voice output. is there any problem with this approach. need opinion as I am building a voice bot based solution. As per my opinion, currently, the cost is the only major differentiator.

Reply

More like this

Recommendations from Medial

NIKUNJ TULSYAN

Building @frendle.ap... • 3m

I have been trying to include live voice Convo for my app. I tried gemini live api, open ai stt tts, some have latency issues, some have very bad voice recognition and interruption pattern. Has anyone implemented this or know a cheap way to implemen

See More
Reply
1
Image Description
Image Description

Kimiko

Startups | AI | info... • 4m

google has dropped a feature that lets you describe a speaker’s voice in plain english, stuff like accent, dialect, tone, even language. it nails it effortlessly. you can try this in ai studio in a model called "Gemini 2.5 Flash Preview TTS"

3 Replies
9

zaheer

Husler • 3m

I’m not building a chatbot. I’m building presence. Crux AI creates real-time voice agents that speak like you, think with GPT-4o, and reply through ElevenLabs — deployed via web or phone. Perfect for coaches, closers, and founders who want to autom

See More
Reply
2
Image Description
Image Description

Account Deleted

Hey I am on Medial • 9m

One Good Thing Noticed in Gemini "Double-Check Response" It will check the output again and validate and fact-check it. As we all know, generative AI has limitations, and we can't always rely on its facts. However, this feature will be a good step

See More
7 Replies
2
16

Sairaj Kadam

Entrepreneur • 5m

This is why most people fail in sales or anything else. Let’s take sales: Your goal is 1,000 sales in 3 months. You close 40 out of 100? That’s a 40% rate. So you need 2,500 leads. Simple. You control the input. And that’s what decides the output. Ge

See More
Reply
8

Sachin Sk

Business analyst wor... • 1y

Need an opinion: A company which plants trees inside the city, As there are increasing pollution inside the city.

Reply
2
Image Description

Rahul Kalyankar

Building Tech at Urb... • 5m

🚀 Exploring Conversational AI + Automation: My journey with Voiceflow, Retell AI & n8n Over the past few weeks, I’ve been diving into chatbots, voice agents & workflow automation — and it’s been a rewarding learning curve! 🔹 Built a chatbot on Vo

See More
1 Reply
1
4
Image Description
Image Description

vishakha Jangir

 • 

Set2Score • 4m

𝗧𝗵𝗶𝘀 𝗔𝗜 𝘀𝘁𝗮𝗿𝘁𝘂𝗽 𝗶𝘀 𝗰𝗵𝗼𝗼𝘀𝗲𝗻 𝗯𝘆 𝗚𝗢𝗜 𝘁𝗼 𝗰𝗿𝗲𝗮𝘁𝗲 𝗜𝗻𝗱𝗶𝗮' 𝗳𝗶𝗿𝘀𝘁 𝗻𝗮𝘁𝗶𝗼𝗻 𝘀𝗼𝘃𝗲𝗿𝗲𝗶𝗴𝗻 𝗟𝗟𝗠 !! Sarvam AI – Bulbul Initiative : Bulbul V2 is a text-to-speech (TTS) model developed specifically for the

See More
2 Replies
6
18
Image Description
Image Description

LIKHITH

"You never know" • 1y

Highlights of OPEN AI's Spring update. They are introducing GPT-4o and the highlights are ●Memory and Context The model now includes a "Memory" feature that recalls previous interactions and context, resulting in a more consistent and tailored use

See More
24 Replies
4
24
Image Description

Comet

#freelancer • 5m

If you're building AI agents, you should get familiar with these 3 common agent/workflow patterns. Let's break it down. 🔹 Reflection You give the agent an input. The agent then "reflects" on its output, and based on feedback, improves and refines.

See More
1 Reply
3
15

Download the medial app to read full posts, comements and news.