Moshi AI chatbot with real-time voice features launched by Kyutai Labs as a rival to GPT-4o

Date:

Kyutai Labs on Wednesday launched Moshi AI, an artificial intelligence (AI) chatbot that responds verbally in real-time. The French AI firm has announced that Moshi’s entire audio language model was developed in-house. It can also modify the voice to express emotions and respond to different speaking styles. The AI ​​model can be accessed by the public for free. Currently, the AI ​​model limits conversations to five minutes. Interestingly, OpenAI also announced similar speech features with the release of GPT-4o, but it is yet to be released.

Moshi AI Features

The company says the AI ​​model was developed over six months with a team of eight people. Unveiling the AI ​​model at an event in Paris, Kyutai Labs said Moshi is not an AI assistant but a prototype that can be used to develop tools for various use cases. It has also made the chatbot publicly available here. Users can enter their email and join the queue, but Gadgets 360 staff were able to get instant access to the platform without any waiting time.

The platform interface is quite simple. It has a simplified AI design where users can check the loudness of their voice while speaking. There is a text box where only the responses of the AI ​​are visible. Another box near the top displays technical details such as audio duration, latency, and missed audio.

At the top, there is a button to disconnect the call. Currently, the maximum call duration can be five minutes. The description page states that Moshi can think, speak and listen at the same time to maximize the flow of conversation.

Gadgets 360 found that latency is extremely low, and the AI ​​often responds instantly. However, there are some instances where the response time delay can exceed 10-15 seconds. But this could be due to heavy server load. However, sometimes verbal prompts did not register at all even after the volume meter was three-quarters full.

Moshi Aye Voice Moshi Aye

Moshi AI Interface
Photo credit: Kyutai Labs

Gadgets 360 also found that the AI ​​model can respond in an emotional voice, and can speak in different styles and using different voice modulations. The AI ​​model is also connected to the internet and can get answers to questions that require looking up the web. Notably, the chatbot does not allow text prompts, and voice is the only means to interact with it.

Kyutai Labs has said that the AI ​​model will be open-source. However, the AI ​​firm has not yet hosted the model weights and code on the portal. Once available, users will be able to download and install it locally, and it can be run on an unconnected device.

For the latest tech news and reviews, follow Gadgets 360 XFacebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who’sThat360 on Instagram and YouTube.

Lava Blaze X 5G price leaked ahead of India launch; expected to feature MediaTek Dimensity 7050 SoC

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Share post:

Subscribe

spot_imgspot_img

Popular

More like this
Related

A Side Sleeper Test & Review Popular Body Pillows (2024)

being a party This can be challenging for the...

How to Protect Your Startup from Email Scams

For years, it's been claimed that the "end of...