com.microsoft.cognitiveservices.speech
Classes
AudioDataStream |
Represents audio data stream used for operating audio data as a stream. |
AutoDetectSourceLanguageConfig |
Represents auto detect source language configuration used for specifying the possible source language candidates Note: close() must be called in order to release underlying resources held by the object. |
AutoDetectSourceLanguageResult |
Represents the result of auto detecting source languages Added in version 1.8.0 |
CancellationDetails |
Contains detailed information about why a result was canceled. |
ClassLanguageModel |
Represents a Class |
Connection |
Connection is a proxy class for managing connection to the speech service of the specified Recognizer. |
ConnectionEventArgs |
Defines payload for connection events like Connected/Disconnected. |
ConnectionMessage |
Connection |
ConnectionMessageEventArgs |
Defines payload for Connection's Message |
ContentAssessmentResult |
Represents the result of pronunciation assessment. |
Diagnostics |
Native logging and other diagnostics |
EmbeddedSpeechConfig |
Class that defines embedded (offline) speech configuration. |
Grammar |
Represents a generic grammar used to assist in improving speech recogniton accuracy. |
GrammarList |
Allows adding multiple grammars to a Speech |
HybridSpeechConfig |
Class that defines hybrid (cloud and embedded) configurations for speech recognition and speech synthesis. |
KeywordRecognitionEventArgs |
Defines content of an keyword recognizing/recognized events. |
KeywordRecognitionModel |
Represents a keyword recognition model for recognizing when the user says a keyword to initiate further speech recognition. |
KeywordRecognitionResult |
Defines result of keyword recognition. |
KeywordRecognizer |
Performs keyword recognition on the speech input. |
NoMatchDetails |
Contains detailed information for No |
PhraseListGrammar |
Allows additions of new phrases to improve speech recognition. |
PronunciationAssessmentConfig |
Represents pronunciation assessment configuration. |
PronunciationAssessmentResult |
Represents the result of pronunciation assessment. |
PropertyCollection |
Represents collection of properties and their values. |
RecognitionEventArgs |
Defines payload for recognition events like Speech Start/End Detected |
RecognitionResult |
Contains detailed information about result of a recognition operation. |
Recognizer |
Defines the base class Recognizer which mainly contains common event handlers. |
SessionEventArgs |
Defines payload for Session |
SourceLanguageConfig |
Represents source language configuration used for specifying recognition source language. |
SpeechConfig |
Speech configuration. |
SpeechRecognitionCanceledEventArgs |
Defines payload of speech recognition canceled events. |
SpeechRecognitionEventArgs |
Defines contents of speech recognizing/recognized event. |
SpeechRecognitionModel |
Contains detailed speech recognition model information. |
SpeechRecognitionResult |
Defines result of speech recognition. |
SpeechRecognizer |
Performs speech recognition from microphone, file, or other audio input streams, and gets transcribed text as result. |
SpeechSynthesisBookmarkEventArgs |
Defines contents of speech synthesis bookmark event. |
SpeechSynthesisCancellationDetails |
Contains detailed information about why a speech synthesis was canceled. |
SpeechSynthesisEventArgs |
Defines contents of speech synthesis related event. |
SpeechSynthesisResult |
Contains detailed information about result of a speech synthesis operation. |
SpeechSynthesisVisemeEventArgs |
Defines contents of speech synthesis viseme event. |
SpeechSynthesisWordBoundaryEventArgs |
Defines contents of speech synthesis word boundary event. |
SpeechSynthesizer |
Performs speech synthesis to speaker, file, or other audio output streams, and gets synthesized audio as result. |
SpeechTranslationModel |
Contains detailed speech translation model information. |
SynthesisVoicesResult |
Contains detailed information about the retrieved synthesis voices list. |
VoiceInfo |
Contains detailed information about the synthesis voice information. |
Enums
CancellationErrorCode |
Defines error code in case that Cancellation |
CancellationReason |
Defines the possible reasons a recognition result might be canceled. |
NoMatchReason |
Defines the possible reasons a recognition result might not be recognized. |
OutputFormat |
Define Speech Recognizer output formats. |
ProfanityOption |
Define profanity option for response result. |
PronunciationAssessmentGradingSystem |
Defines the point system for pronunciation score calibration; default value is Five |
PronunciationAssessmentGranularity |
Defines the pronunciation evaluation granularity; default value is Phoneme. |
PropertyId |
Defines property ids. |
ResultReason |
Defines the possible reasons a recognition result might be generated. |
ServicePropertyChannel |
Defines channels used to send service properties. |
SpeechSynthesisBoundaryType |
Defines the boundary type of speech synthesis boundary event. |
SpeechSynthesisOutputFormat |
Defines the possible speech synthesis output audio format. |
StreamStatus |
Defines the possible status of audio data stream. |
SynthesisVoiceGender |
Define synthesis voice gender. |
SynthesisVoiceType |
Define synthesis voice type. |
Azure SDK for Java
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for