SpeechEvalPro Frequently Asked Questions

SpeechEvalPro Frequently Asked Questions. SpeechEvalPro: High-quality, multi-dimensional AI tool for accurate Chinese and English pronunciation assessment, combining voice evaluation, speech recognition, and core technologies.

FAQ from SpeechEvalPro

What is SpeechEvalPro?

SpeechEvalPro is a state-of-the-art pronunciation assessment and scoring API solution. It combines voice evaluation, speech recognition, and other core technologies to provide accurate and reliable pronunciation assessment for educational purposes.

How to use SpeechEvalPro?

To use SpeechEvalPro, simply sign up for a free trial or choose a pricing plan that suits your needs. Once you have access, integrate our API into your learning product or application using HTTP or WebSocket requests. Our API supports recommended audio formats and a variety of question types. Detailed instructions and guidelines can be found in our documentation.

Is there an SDK available for SpeechEvalPro?

At the moment, an SDK is not available. However, you can directly call our WebAPI, which offers streaming capabilities and is lightweight and cross-platform.

What audio formats are supported for pronunciation evaluation?

We recommend sending audio files in 16-bit sample size, 16K sample rate, 1 channel opus_raw, pcm, wav, or mp3 format. Other formats may affect scoring results.

What question types are supported, and what are the time and text length restrictions?

SpeechEvalPro supports phoneme, word, sentence, and chapter (paragraph) modes. Time and text length restrictions vary for each mode. In phoneme & word mode, the duration is up to 20 seconds. In sentence mode, the duration is up to 40 seconds, with a text length limit of 300 characters. In chapter mode, the duration is up to 300 seconds, with a text length limit of 10,000 characters. Please consult our documentation for specific details.