Back

Nilesh Kashid

Hey I am on Medial • 2m

on first point, voice agents - do we really need to do tts and stt. gpt 4o real-time or gemini live models directly takes voice input and voice output. is there any problem with this approach. need opinion as I am building a voice bot based solution. As per my opinion, currently, the cost is the only major differentiator.

Reply

More like this

Recommendations from Medial

NIKUNJ TULSYAN

Building Flex Aura |... • 5m

I have been trying to include live voice Convo for my app. I tried gemini live api, open ai stt tts, some have latency issues, some have very bad voice recognition and interruption pattern. Has anyone implemented this or know a cheap way to implemen

See More
Reply
1
Image Description
Image Description

Aryan patil

Intern at YourStory ... • 1y

"Google apologize for calling modi a fascist by it's Ai chat bot Gemini" What do you think?

10 Replies
13
Image Description
Image Description

Kimiko

Startups | AI | info... • 6m

google has dropped a feature that lets you describe a speaker’s voice in plain english, stuff like accent, dialect, tone, even language. it nails it effortlessly. you can try this in ai studio in a model called "Gemini 2.5 Flash Preview TTS"

3 Replies
9
Image Description
Image Description

Shaurya Jha

Building upbot.space • 11m

Since AI agents are trending in the market, why not build a bot that is your google meet companion? whether it's summarizing, TTS or Recording, it's your one go model to do everything It can include features like: - Meeting Recording - Optimized

See More
9 Replies
12

zaheer

Husler • 5m

I’m not building a chatbot. I’m building presence. Crux AI creates real-time voice agents that speak like you, think with GPT-4o, and reply through ElevenLabs — deployed via web or phone. Perfect for coaches, closers, and founders who want to autom

See More
Reply
2
Image Description

Mukund

Building Future • 10m

Hello Friends, is there any AI Voice Specialist, do let me know need some suggestions

1 Reply
1

Sairaj Kadam

Student & Financial ... • 7m

This is why most people fail in sales or anything else. Let’s take sales: Your goal is 1,000 sales in 3 months. You close 40 out of 100? That’s a 40% rate. So you need 2,500 leads. Simple. You control the input. And that’s what decides the output. Ge

See More
Reply
8
Image Description

Rahul Kalyankar

Building Tech at Urb... • 7m

🚀 Exploring Conversational AI + Automation: My journey with Voiceflow, Retell AI & n8n Over the past few weeks, I’ve been diving into chatbots, voice agents & workflow automation — and it’s been a rewarding learning curve! 🔹 Built a chatbot on Vo

See More
1 Reply
1
4
Image Description
Image Description

utkarsh kothari

Building happening |... • 7m

Need everyone's opinion on my startup idea 👇🏻 it'll really help me validate my idea! Also do upvote☺️ https://medial.app/idea/happening-be97b10c40013

1 Reply
7
1

Download the medial app to read full posts, comements and news.