Voices Enables Fast Text-to-Speech for Java Applications
Briefly

Voices Enables Fast Text-to-Speech for Java Applications
"Voices, an open-source text-to-speech project, was designed for applications running on Java 17 or newer. The library requires no external APIs or manually installed software. Audio files can be generated for various languages based on dictionaries or OpenVoice. Henry Coles, creator of Voices and Pitest and head of mutation testing at Arcmutate, introduced Voices on Bluesky in September 2025 and the latest version, released in late October 2025, is 0.0.8."
"Voices uses ONNX Runtime, a cross-platform AI engine that speeds up training and inference, supporting models from various deep learning frameworks such as TensorFlow and PyTorch. The runtime leverages hardware accelerators whenever possible and supports various hardware and operating system configurations. Several libraries are required for the examples demonstrated here. The following POM file configuration can be used with Maven: <!-- The main dependency --> <dependency> <groupId>org.pitest.voices</groupId> <artifactId>chorus</artifactId> <version>0.0.8</version> </dependency> <!-- A prepackaged model --> <dependency> <groupId>org.pitest.voices</groupId> <artifactId>alba</artifactId> <version>0.0.8</version> </dependency>"
"<!-- A dictionary of pronunciations --> <dependency> <groupId>org.pitest.voices</groupId> <artifactId>en_uk</artifactId> <version>0.0.8</version> </dependency> <!-- The runtime for ONNX models --> <dependency> <groupId>com.microsoft.onnxruntime</groupId> <artifactId>onnxruntime</artifactId> <version>1.22.0</version> </dependency> Alternatively, other build tools, such as Gradle, may be used for all examples demonstrated here. An en_us dictionary may be used instead of the en_uk dictionary by replacing the above dependency. The onnxruntime may be replaced by onnxruntime_gpu for GPU"
Voices is an open-source text-to-speech library for applications running on Java 17 or newer. The library produces audio files for multiple languages locally without external APIs or manually installed software, using pronunciation dictionaries or OpenVoice models. Voices executes models through ONNX Runtime, a cross-platform AI engine that speeds training and inference and leverages hardware accelerators across frameworks such as TensorFlow and PyTorch. The project supplies packaged artifacts (chorus, alba, en_uk) and depends on onnxruntime 1.22.0 by default with an option for onnxruntime_gpu for GPU acceleration. Maven or Gradle can manage the dependencies. Henry Coles released version 0.0.8 in late October 2025.
Read at InfoQ
Unable to calculate read time
[
|
]