KikiYomu

KikiYomu is a lightweight, real-time Text-to-Speech (TTS) application that monitors your clipboard and instantly uses AI voice models to have anime-style characters narrate Japanese text from games or visual novels.

Features

Real-Time TTS: Automatically reads aloud Japanese text copied to your clipboard.
Speaker Tag Handling: Option to remove speaker tags like 【Name】 commonly found in RPGMaker and WolfRPG games.
Game Compatibility: Designed to work well with most visual novels and games that use stylized dialogue formatting.
User-Friendly GUI: Simple GUI.
Manual Control Over TTS: Lets the user Force-read texts even when filtered out by the model.
Image OCR Support: extracts Japanese text from images in your clipboard using OCR — Use with a snip tool for best result.
GPU Acceleration: Optional — Uses GPU if available for faster OCR/Voice-over.

Installation

Prerequisites

Python 3.8 or later
PyTorch (with CUDA if using GPU)
SoundDevice (for audio playback)

Setup Instructions

Clone the Repository

git clone https://github.com/yourusername/KikiYomu.git
cd KikiYomu

Install Python Dependencies

pip install -r requirements.txt

Download Pretrained Voice Models

Visit the following Hugging Face repository to download the Pretrained AI voice models:

AI-Voice Models

Place the .pth model files into the models/ directory.

Usage

Start the App

    python gui.py

Additionally you can just you run the KikiYomu.py file in command line as it still offers most utilities.

Load a Model

In the "Models" panel, select a .pth model and click "Select Model".

Configure Settings (if needed)

Set the opening/closing signs used for spoken text (e.g., 「 and 」).
- If your are playing an RPGMaker game, Enable the checkbox to remove RPGMaker/WolfRPG-style speaker tags (【Name】) at the start of lines.
Adjust playback speed with the slider.

Copy Text to Speak

Copy any Japanese line of text to the clipboard. If it passes the filters, KikiYomu will automatically speak it aloud using the selected AI voice.

Credits

Voice Models: zomehwh's VITS Models on Hugging Face

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
config		config
models		models
monotonic_align		monotonic_align
text		text
.gitignore		.gitignore
KikiYomu.py		KikiYomu.py
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
gui.py		gui.py
models.py		models.py
modules.py		modules.py
ocr.py		ocr.py
requirements.txt		requirements.txt
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KikiYomu

Features

Installation

Prerequisites

Setup Instructions

Usage

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Hexanol777/Kikiyomu

Folders and files

Latest commit

History

Repository files navigation

KikiYomu

Features

Installation

Prerequisites

Setup Instructions

Usage

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages