Features:
- Fast recognition speed
- English-only and multilingual models available, with multilingual supporting 100 languages
- Different model sizes available (from 75 Mb to 2.9 Gb)
- Automatic download of language models in the Editor
- Optional translation of recognized speech to English
- Customizable properties
- Easy selection of model size and language in settings
- No static libraries or external dependencies
- Cross-platform compatibility (Windows, Mac, Linux, Android, iOS, etc)
特征:
- 识别速度快
- 仅提供英语和多语言模式,多语言支持100种语言
- 提供不同型号尺寸(从75Mb到2.9Gb)
- 在编辑器中自动下载语言模型
- 可选择将已识别的语音翻译成英语
- 可自定义属性
- 在设置中轻松选择模型大小和语言
- 没有静态库或外部依赖
- 跨平台兼容性(Windows,Mac,Linux,Android,iOS等)
Please be aware that in the 4.27 engine version, there are errors in speech recognition, particularly in streaming mode. Therefore, it is highly recommended to use the >=5.0 engine version.
NOTE: The images with plugin examples and demo project are made in conjunction with RuntimeAudioImporter. To follow these examples, you’ll need to have that plugin installed. However, you can also use your own audio input implementation with RuntimeSpeechRecognizer.
Runtime Speech Recognizer is a plugin that enables real-time, offline speech recognition. Based on Whisper OpenAI technology, particularly whisper.cpp library, and supports multiple language models pre-selected in the plugin’s settings.
请注意,在4.27引擎版本中,语音识别存在错误,特别是在流模式下。 因此,强烈建议使用>=5.0引擎版本。
注意:带有插件示例和演示项目的图像是与 RuntimeAudioImporter. 要遵循这些示例,您需要安装该插件。 但是,您也可以将自己的音频输入实现与RuntimeSpeechRecognizer一起使用。
运行时语音识别器是一个插件,使实时,离线语音识别. 基于Whisper OpenAI技术,特别是whisper。cpp库,并支持在插件的设置中预先选择的多种语言模型。
评论(0)