Updates


Model with tunings for reducing high frequency noise

by Krishna Sankar M on 1st May 2023

Updated model with hooks in the api for controlling the high frequency noise is made available. Use the input parameter noise_scale along with modelver : 'v4.0' .


Updated Website

by Krishna Sankar M on 18th February 2023

Happy to announce that we have improved bare-bones website to a more colourful website, with improved readability :)


Dynamic Pitch & Pace Transform

by Krishna Sankar M on 21st December 2022

The naturalness of the sysnthesized audio is improved by having randomness in the synthesis of breaks due to punctuations. The support for dynamic pitch_scale, pitch_offset, pace_scale and new parameter punctuation_breaks was added from model v3.3 onwards. Please see API documention for more details.


Improving Efficiency in Synthesis of Sentence Breaks

by Krishna Sankar M on 4th September 2022

The breaks in sentences due to punctuations can be efficiently synthesized by insertion of dummy durations. There was efficiency improvements under-the-hood which reduced the sythensis time for long articles. This is available from model v3.2 onwards. Please see API documention for more details.


Adding Read Along Feature

by Krishna Sankar M on 12th August 2022

We have added support for read along i.e. the audio and corresponding text will be paired. In the output, we will be adding a new key "durations" which captures this information. This is available from model v3.1 onwards. Please see API documention for more details.


Supporting Punctuation

by Krishna Sankar M on 10th July 2022

We have added support for punctuation with model from v3.0. Please see API documention for more details.


More Naturally sounding Model

by Krishna Sankar M on 19th May 2022

We have made available more naturally sounding model as v2.0. Please see API documention for more details.


Availability of APIs

by Krishna Sankar M on 5th May 2022

We have made available API access to the text to speech service. Please see API documention.

keywords : api, cloud text to speech synthesis


Inital Release

by Krishna Sankar M on 20th February 2022

മലയാളം അക്ഷരത്തിൽ നിന്നും ശബ്ദം സാധ്യമാകുന്ന ഒരു സോഫ്റ്റ്‌വെയർ അവതരിപ്പിക്കുന്നതിൽ സന്തോഷം ഉണ്ട്. ഇത് നിലവിലുള്ള ന്യൂറൽ നെറ്റ്‌വർക്ക് സാങ്കേതിക വിദ്യയും, അതോടൊപ്പമുള്ള ഓപ്പൺ സോഴ്സ് സോഫ്റ്റ്‌വെയറും ഉപയോഗിച്ച് നിർമിച്ചതാണ്. ഇന്നത്തെ നിലയ്ക്ക് ഈ ശബ്ദം വൈകാരികമായ ഉള്ളടക്കം ഇല്ലാത്ത വാർത്തകൾ വായിക്കാൻ ഉചിതമാണ് . ഇത് ഒരു തുടക്കം മാത്രം. ഒരോ വ്യക്തിയുടെയും വൈകാരികമായ ശബ്ദം ഉത്‌പാദിപ്പിക്കുന്ന സോഫ്റ്റ്‌വെയർ ആണ് ലക്‌ഷ്യം.

Happy to announce the availability of malayalam text to speech synthesis. This is built using some of the latest neural network architectures and open source code around it. In its current shape, the generated speech is good for reading out news or articles which does not have emotional content. This is just a start, and the goal is to reach personalized emotional text to speech synthesis.

keywords : cloud text to speech synthesis, malayalam, bi-lingual, english, multiple speakers


Select your color
Theme Options