OZEKI OZEKI VOIP SIP SDK
High performance VoIP SDK for .Net developers

Quick start

Quick start guide Get started on using Ozeki VoIP SDK

Example projects Check out our example projects

Sitemap voip-sip-sdk.com sitemap

Download Download Ozeki VoIP SDK

Installation steps A step-by-step guide on installing the SDK

Licensing Read about Ozeki VoIP SDK licenses
Download
Manual

Package contents Read about the SDK package contents

Data sheet Check out Ozeki VoIP SDK datasheet

On-line manual Read our on-line manual on VoIP technologies

Developers guide Read our developers guide on using the SDK

API reference book Ozeki VoIP SDK class library documentation

Softphone development Read about VoIP softphone development

PBX development Read about VoIP PBX development

Callcenter & CRM development Read about call center and CRM development

Webphone development Read about VoIP webphone development

Mobile development Read about Mobile VoIP app development
Tutorial

Course 1 How to develop a softphone in C#

Course 2 How to build a VoIP PBX in C#

Course 3 How to create an IVR system in C#
Support

Request support Request technical support

On-line chat Ask for live help

Training at Ozeki Learn Ozeki VoIP SDK fundamentals

FAQ Frequently Asked Questions

E-mail Write us an email

Telephone Contact us on phone

Office address Office location
How to buy
Contact

Softphone Development

Call Center Development

VoIP CRM Integration

Alert systems

IP Camera

Mobile phones and platforms

Ozeki VOIP SIP SDK

< Use TextToSpeech | Record voice call >

How to use text to speech and speech to text conversion with Microsoft Speech Platform 11?

	Download:	msp-11.zip
	Download:	ms-speech-platform.zip

This article gives information about how to use text-to-speech and speech-to-text with Microsoft Speech Platform (Version 11) in relation with Ozeki VoIP SIP SDK. After reading through this page you will be able to use it for reading out loud texts in different languages, and recognize incoming voices. Below you can see what you will need for creating your own solution:

text to speech conversion — Figure 1 - Text to speech conversion

Download: Microsoft Speech Platform - Software Development Kit (SDK) (Version 11)
Download: Microsoft Speech Platform - Runtime (Version 11)

What is Microsoft Speech Platform 11 used for?

Text to speech conversion means that a program reads up the text you have typed in. This can be useful when for example a mute person wants to communicate with voice calls. Text to speech conversion can also be used in interactive voice response (IVR) systems if you want to have the IVR tree navigation information read out by the computer.
You can learn more about this conversion from the How to use TextToSpeech article.

Speech recognition can be used for a lot of thing. It basically works with some standard algorithms that recognize words. The most essential usage of this technology is when you want to communicate with a deaf person using voice call. You talks into the microphone and at the other end the deaf user can see what you said in written form.
You can learn more about this feature from the How to implement Voice Recognition article.

speech too text conversion — Figure 2 - Speech to text conversion

How to use Microsoft Speech Platform 11 in C#?

There are a few important steps which must be made before you can start to develop you softphone (or other application):

Download and install the Microsoft products, listed above. Please note that, at the language selection languages are being separated with "SR" (Speech Recognition) and "TTS" (Text-To-Speech) tags in order to their purposes.
Download the msp-11.zip file, which contains two classes:
- MSSpeechPlatformSTT: a class for voice recognition
- MSSpeechPlatformTTS: a class for Text-To-Speech
You will have to add these files to your project.
Create a new Visual Studio project, and:
- Add reference to Ozeki VoIP SIP SDK.
- Add reference to Microsoft.Speech.dll similar way you did with ozeki.dll.
- Add the above downloaded classes to the project.
- If you are using 64 bit edition, make sure about at the project properties "Build" tab, the "Prefer 32-bit" checkbox is not checked, otherwise it may couse errors.

After these steps, you can begin to develop your softphone. Since you are already familiar with Text-To-Speech and Speech-To-Text implementations, only the new steps and taks will be introduced here.

Text-To-Speech: in the case of this conversion, you have to use the TextToSpeech object's AddTTSEngine() method, to pass a new instance of the MSSpeechPlatformTTS to it. After that, you can reach the available voices by calling the object's GetAvailableVoices() method, and you can set the selected one with the ChangeLanguage() one.

Speech-To-Text: you should set a new instance of MSSpeechPlatformSTT as new engine to the SpeechToText object with its ChangeSTTEngine() method. After that, you can reach all the available recognizers with the object's GetRecognizers() method, and you can set the selected one with the object's ChangeRecognizer() method.

Usage example of Microsoft Speech Platform 11 in C#

using System;
using System.Threading;
using Ozeki.Media;

namespace Microsoft_Speech_Platform
{
    class Program
    {
        static Speaker _speaker;
        static Microphone _microphone;
        static MediaConnector _connector;
        static TextToSpeech _tts;
        static SpeechToText _stt;

        static void Main(string[] args)
        {
            _microphone = Microphone.GetDefaultDevice();
            _speaker = Speaker.GetDefaultDevice();
            _connector = new MediaConnector();

            SetupTextToSpeech();

            SetupSpeechToText();

            while (true) Thread.Sleep(10);
        }

        static void SetupTextToSpeech()
        {
            _tts = new TextToSpeech();
            _tts.AddTTSEngine(new MSSpeechPlatformTTS());

            var voices = _tts.GetAvailableVoices();
            foreach (var voice in voices)
            {
                if (voice.Language.Equals("en-GB"))
                    _tts.ChangeLanguage(voice.Language, voice.Name);
            }

            _speaker.Start();
            _connector.Connect(_tts, _speaker);
            _tts.AddAndStartText("Hello World!");
        }


        static void SetupSpeechToText()
        {
            string[] words = {"Hello", "Welcome"};
            _stt = SpeechToText.CreateInstance(words);
            _stt.WordRecognized += stt_WordRecognized;
            _stt.ChangeSTTEngine(new MSSpeechPlatformSTT());

            var recognizers = _stt.GetRecognizers();
            foreach (var recognizer in recognizers)
            {
                if (recognizer.Culture.Name == "en-GB")
                    _stt.ChangeRecognizer(recognizer.ID);
            }

            _connector.Connect(_microphone, _stt);
            _microphone.Start();
        }

        static void stt_WordRecognized(object sender, SpeechDetectionEventArgs e)
        {
            Console.WriteLine("Word recognized: {0}", e.Word);
        }
    }
}

Download Text to Speech languages:

Download Speech to Text languages:

More information

< Use TextToSpeech | Record voice call >

Home > Product information > Online manual > Developers Guide > Softphone Development > Basic softphone examples > MS Speech Platform 11

Page: 7563 | 3.149.230.44 | 79.99.42.43 | Login

Privacy | Terms of use