MS 텍스트음성전환(Microsoft Text-to-Speech)


Microsoft 텍스트 음성 변환 플랫폼은 Microsoft Text-to-Speech engine을 사용하여 자연스러운 소리가 나는 텍스트를 읽습니다. 이 통합구성요소는 Cognitive Services 오퍼링의 일부이며 Bing Speech API로 알려진 API를 사용합니다. 무료 API 키가 필요합니다. Azure subscription을 사용하거나 Cognitive Services site에서 API 키를 얻을 수 있습니다.

설정

Microsoft로 텍스트 음성 변환을 활성화하려면 configuration.yaml에 다음 줄을 추가하십시오.

# Example configuration.yaml entry
tts:
  - platform: microsoft
    api_key: YOUR_API_KEY

Configuration Variables

api_key

(string)(Required)

Your API key.

language

(string)(Optional)

The language to use. Note that if you set the language to anything other than the default, you will need to specify a matching voice type as well. For the supported languages check the list of available languages.

Default value:

en-us

gender

(string)(Optional)

The gender you would like to use for the voice. Accepted values are Female and Male.

Default value:

Female

type

(string)(Optional)

The voice type you want to use. Accepted values are listed as the service name mapping in the documentation.

Default value:

ZiraRUS

rate

(integer)(Optional)

Change the rate of speaking in percentage. Example values: 25, 50.

Default value:

0

volume

(integer)(Optional)

Change the volume of the output in percentage. Example values: -20, 70.

Default value:

0

pitch

(string)(Optional)

Change the pitch of the output. Example values: high.

Default value:

default

contour

(string)(Optional)

Change the contour of the output in percentages. This overrides the pitch setting. See the W3 SSML specification for what it does. Example value: (0,0) (100,100).

region

(string)(Optional)

The region of your API endpoint. See documentation.

Default value:

eastus

전체 설정 사례

선택적 변수를 포함한 전체 설정 샘플 :

# Example configuration.yaml entry
tts:
  - platform: microsoft
    api_key: YOUR_API_KEY
    language: en-gb
    gender: Male
    type: George, Apollo
    rate: 20
    volume: -50
    pitch: high
    contour: (0, 0) (100, 100)
    region: eastus