Chat GPT can now speak and sing in real time Chat GPT现在可以实时说话和唱歌了| DW 德国之声

The AI race has just shifted into high gear, with US artificial intelligence pioneers OpenAI rolling out its new interface that works with audio and vision as well as text. The new model, called GPT-4o, has gone beyond the familiar chat-bot features and is capable of real-time, near-natural voice conversations. The developer OpenAI will also make it available to free users.

ChatGPT was already able to talk to users, but with long pauses to process the data. It often seemed a bit sluggish. This was because the feature required three internal applications, the company explained: transcribing the spoken text, processing and generating, and converting the response to speech. This caused delays.

We talk to computer scientist Mike Cook from the renowned Kings College London about the new Chat GPT-4o development.

人工智能竞赛刚刚进入高速阶段,美国的人工智能先锋OpenAI推出了其新界面,该界面不仅可以处理文本,还可以处理音频和视觉。新的模型称为GPT-4o,超越了传统的聊天机器人功能,能够进行实时、近乎自然的语音对话。开发商OpenAI还将其向免费用户开放。

ChatGPT此前已经能够与用户对话,但由于需要长时间处理数据,常显得有些迟钝。公司解释说,这是因为该功能需要三个内部应用程序:转录语音文本、处理和生成内容,以及将响应转换为语音。这导致了延迟。

我们采访了来自著名伦敦国王学院的计算机科学家迈克·库克,讨论新的Chat GPT-4o的发展。

编后语:其实现在大家使用的越多,就会帮助OpenAI去训练他们的大模型,所以说他们最重要的资源不是网上的开源数据,而是用户每个人的生活真实数据和案例。

※新西兰全搜索©️版权所有

敬请关注新西兰全搜索New Zealand Review 在各大社交媒体平台的公众号。从这里读懂世界!️

如果您喜欢我们的文章,请支持我们的新闻工作者和创作者!请打赏一杯咖啡给他们(注明栏目或文章题目),或支持我们每月的服务器费用,非常感谢!
订阅我们,Paypal每月赞助5纽币:
http://bit.ly/47fUCPS