VoiceTextTM Text-To-Speech (TTS) Server

VT Server

Overview

VoiceText™ Text-To-Speech Server is an add-on software component  to VoiceText™ TTS Engine that manages multi-thread, multi-voice TTS services in a multi-access/user environment. Using the provided SDK, generally VoiceText™ Text-To-Speech Server is used for building custom server applications such as telephony systems, alert systems, etc.

Features

In addition to the features provided by the VoiceText TTS Engine(s), the following is also provided by VoiceText TTS Server.

Web-based Configuration Panel

VoiceText™ Text-To-Speech Server can be managed through a web-based configuration panel. You can enable/disable TTS Engines, set default speed, volume, sentence pause duration, etc.

Thread-Usage Monitoring

Thread-usage can be monitored to see how many concurrent TTS synthesis is occurring at a given time. Visual graph of real-time and historic monitoring is available.

Reporting

VoiceText™ Text-To-Speech Server’s reporting feature allows you to generate custom reports regarding the number of TTS requests, total input text size, response time, etc.

Multiple Custom Dictionary Utilization and Management

Multiple custom dictionaries can be managed and selected for a given TTS request. This allows you “group” custom pronunciations based on the need.

Speech Synthesis Markup Language (SSML) Support

VoiceText™ Text-To-Speech Server supports SSML that enables users to input text with rich, XML-based markup language for assisting the generation of synthetic speech. SSML is used to provide authors of synthesizable content a standard way to control aspects of speech such as pronunciation, volume, pitch, rate, etc. across different synthesis-capable platforms.

Flexible Data Output Formats

VoiceText™ Text-To-Speech Server currently supports the following formats in 8KHz, 11KHz, and 16KHz sampling rates:

16bit linear PCM
8bit A-law PCM
8bit Mu-law PCM
4bit Dialogic ADPCM
16bit linear PCM Wave
8bit unsigned linear PCM Wave
8bit A-law PCM Wave
8bit Mu-law PCM Wave
ASF (Windows only)
OGG

Support of APIs

VoiceText™ supports C, COM, Java, and .NET-based Application Programming Interfaces (APIs).

System Requirements

Requirements
Operating System

Windows Server 2000 or 2003.

Linux RHEL 4 or higher (known to work under other RedHat-base distributions, such as Fedora).
CPU Pentium IV 1.7 GHz
RAM 1 GB (depending on the number of channels)
Database space 35-900 MB or more depending on the number of VoiceText™ TTS Engines installed.

Experience NeoSpeech natural-sounding text-to-speech (TTS) software. NeoSpeech offers superior text-to-speech applications with natural-sounding voice synthesis software. Our TTS languages include: Japanese TTS, Mandarin Chinese TTS, Korean TTS, Latin American Spanish TTS, and of course English. Robotic voices are now history.