Hey I am on Medial • 10d
on first point, voice agents - do we really need to do tts and stt. gpt 4o real-time or gemini live models directly takes voice input and voice output. is there any problem with this approach. need opinion as I am building a voice bot based solution. As per my opinion, currently, the cost is the only major differentiator.
Building @frendle.ap... • 3m
I have been trying to include live voice Convo for my app. I tried gemini live api, open ai stt tts, some have latency issues, some have very bad voice recognition and interruption pattern. Has anyone implemented this or know a cheap way to implemen
See MoreHey I am on Medial • 9m
One Good Thing Noticed in Gemini "Double-Check Response" It will check the output again and validate and fact-check it. As we all know, generative AI has limitations, and we can't always rely on its facts. However, this feature will be a good step
See MoreBuilding Tech at Urb... • 5m
🚀 Exploring Conversational AI + Automation: My journey with Voiceflow, Retell AI & n8n Over the past few weeks, I’ve been diving into chatbots, voice agents & workflow automation — and it’s been a rewarding learning curve! 🔹 Built a chatbot on Vo
See MoreDownload the medial app to read full posts, comements and news.