Language Learning

Reimagined through

Pronunciation Assessment APIs

Deep learning-powered speech and pronunciation assessment of pronunciation, fluency, and grammar.
Try demo
Start free trial
Why SpeechSuper
Detailed Scoring
SpeechSuper API scores paragraphs, sentences, words, phonemes, and tones. It also scores pronunciation, fluency, and completeness for spoken sentences and paragraphs.
Syllable Stress
SpeechSuper API detects whether the user laid the stress on the right syllable. It is useful for users speaking non-syllabic languages as their first language.
Mispronunciation Diagnosis
SpeechSuper API diagnoses what phonemes users say so that products can give instructions for mispronunciation correction.
Customized Scoring
SpeechSuper API supports scoring for children, young adolescents, and adults; customizing pronunciation for specific words; returning scores at different grading scales and decimal places. The scoring strictness is flexible based on your needs.
Closed-Ended Assessment
We support word-level, sentence-level, and paragraph-level closed-end assessments. It’s useful for prompted read-aloud speaking activities and practice.
Semi-Open-Ended Assessment
Semi-open-ended assessment needs multiple reference texts to give scores. It also supports scoring with key points. We’re now working towards an open-ended assessment API.
Support for Accents
SpeechSuper API supports American, British, and Australian English. It is not biased toward any accents and supports specifying a particular accent to score.
Phoneme Spelling Alignment
Reading paragraphs could be boring. SpeechSuper API’s real-time feedback can make the experience interactive and engaging. It can light up the words users just read to behave like Karaoke.
How it Works
1
API input
  • Choose the language and word assessment / sentence assessment / paragraph assessment API.
  • Send reference text and audio to API.
HTTP
WebSocket
2
API output
What We Served
10 B+
API requests
100 +
Business customers
99.9 %
Uptime
Trusted and Used by
Brazil
France
France
China
China
China
China
Vietnam
US
Pay ahead. Save money.
Our pricing strategy covers 8 languages:
English, Mandarin Chinese, German, French, Spanish, Korean, Japanese and Russian.
Pay As You Go
No minimum.
Scripted
Pronunciation Assessment
Word
$ 0.004/request
up to 20 secs, single word
Sentence
$ 0.006/request
up to 90 secs, 2-200 word
Paragraph
$ 0.008/request
up to 3 mins, over 200 words
Spontaneous
Speech Assessment
Basic
$ 0.030/request
up to 2 mins
Includes basic scores
Pro
$ 0.035/request
up to 2 mins
Includes scores and details like
grammar corrections
Start for free
Starter
Prepay $500-$1,999
credits for the year.
Scripted Pronunciation Assessment
Word
$ 0.0034/request
up to 20 secs, single word
Sentence
$ 0.0051/request
up to 90 secs, 2-200 word
Paragraph
$ 0.0068/request
up to 3 mins, over 200 words
Spontaneous
Speech Assessment
Basic
$ 0.025/request
up to 2 mins
Includes basic scores
Pro
$ 0.030/request
up to 2 mins
Includes scores and details like
grammar corrections
Start for free
Growth
Prepay $2,000-$4,999
credits for the year.
Scripted Pronunciation Assessment
Word
$ 0.0028/request
up to 20 secs, single word
Sentence
$ 0.0042/request
up to 90 secs, 2-200 word
Paragraph
$ 0.0056/request
up to 3 mins, over 200 words
Spontaneous
Speech Assessment
Basic
$ 0.0210/request
up to 2 mins
Includes basic scores
Pro
$ 0.0245/request
up to 2 mins
Includes scores and details like
grammar corrections
Start for free
Premium
Prepay $5,000+ credits for the year.
For our valued premium customers, we can devise a tailored pricing strategy that will bring added benefits to them.
Contact Sales
Top Questions
What distinguishes pronunciation assessment API from spontaneous speech Assessment API?
The Pronunciation Assessment API requires a reference text, meaning that users are expected to speak pre-scripted phrases. On the other hand, Spontaneous Speech Assessment API can evaluate speech in two different scenarios: with a topic prompt, such as in IELTS speaking tests, or without a prompt, such as in natural conversations.
What is the pricing model for the pronunciation assessment API based on?
The pricing for the Pronunciation Assessment API is based on the number of requests made, rather than the duration of use. It's worth noting that there are duration limits for different types of assessments: word assessments have a maximum limit of 20 seconds, sentence assessments have a limit of 90 seconds, and paragraph assessments have a limit of 3 minutes.
What happens if all of my prepaid credits are used up?
If all prepaid credits have been used up, the service will still be provided, but any additional usage will result in overage charges. Customers will be notified of these charges via email.
What happens if I don't use up all of my prepaid credits?
Any unused prepaid credits will be carried over to the next billing period.
What is the difference between Spontaneous Speech Assessment basic and pro?
The basic version provides an overall score along with individual scores for grammar, vocabulary, fluency, and pronunciation. The professional version offers additional features such as pausing indicators, CEFR-based vocabulary labels, grammatical error identification with suggested corrections, and word-level pronunciation scores, etc.
Introducing SpeechSuper's English Spontaneous Speech Assessment API. Transcription, Fluency, Grammar, Vocabulary & Pronunciation Feedback all in 1 API.