Features:

  • No dependencies on other paid cloud services

  • One time payment

  • The server can handle multiple clients at the same time

  • Easy to setup

  • No internet required

Code Modules:

  • VoskPlugin (Runtime)

Number of Blueprints: 0

Number of C++ Classes: 3

Network Replicated: No

Supported Development Platforms: Windows, Mac, Linux

Supported Target Build Platforms: Windows, Mac, Linux

Documentation: https://github.com/IlgarLunin/VoskPlugin-docs

Discord: https://discord.gg/Tkf7xe2

特征:

  • 不依赖于其他付费云服务

  • 一次付款

  • 服务器可以在同一时间处理多个客户端

  • 易于安装

  • 不需要互联网

代码模块:

  • VoskPlugin(运行时)

蓝图数目:0

C++类数:3

网络复制:没有

支持的开发平台:Windows、Mac、Linux

支持的目标构建平台:Windows、Mac、Linux

文件: https://github.com/IlgarLunin/VoskPlugin-docs

不和谐: https://discord.gg/Tkf7xe2


Allows you to recognize speech from more than 15 languages, without relying on any cloud service or subscription. Instead, a language server is a separate process on your machine, which talks with your game. The language server app is public ( https://github.com/IlgarLunin/vosk-language-server ), you can fork it and customize, distribute with your game, run it without any user interface.

Unreal engine client is dead simple communication with language server. It connects to it, records, and feeds your voice to the language server, the server sends recognized voices as text back to unreal.

This is streaming voice recognition, and you can implement simple conversations with your NPC without any user input except voice. “Ok robot, do this”, “Ok robot, do that” etc.

Download latest language server: https://github.com/IlgarLunin/vosk-language-server/releases

Using language server as a separate app is optional! Your game itself can act like language server.

Visit discord and documentation for more info

Video demonstration: https://youtu.be/iJVCsuuC5A4

Example project for Unreal 5.3: here

允许您从多个识别语音 15种语言,而不依赖于任何云服务或订阅。 相反,语言服务器是您的机器上的一个单独的进程,它与您的游戏对话。 语言服务器应用程序是公共的( https://github.com/IlgarLunin/vosk-language-server ),您可以叉它和自定义,与您的游戏分发,运行它没有任何用户界面。

虚幻引擎客户端是死与语言服务器的简单通信。 它连接到它,记录并将您的声音反馈到语言服务器,服务器将识别的声音作为文本发送回虚幻。

这是流式语音识别,您可以实现与NPC的简单对话,除了语音之外没有任何用户输入。 “Ok机器人,做到这一点”,”Ok机器人,做到这一点”等等。

下载最新语言服务器: https://github.com/IlgarLunin/vosk-language-server/releases

使用语言服务器作为一个单独的应用程序是可选的! 您的游戏本身可以像语言服务器一样。

访问不和谐和文档以获取更多信息

视频演示: https://youtu.be/iJVCsuuC5A4

虚幻5.3的示例项目: 这里

声明:本站所有资源都是由站长从网络上收集而来,如若本站内容侵犯了原著者的合法权益,可联系站长删除。