Skip to content

thinkingmachines/ratchada-utils

Repository files navigation

Ratchada_Utils

License: MIT PyPI version Python Versions

A Python library for text processing and utilities related to the Ratchada Whisper model.

Installation

You can install ratchada_utils using pip:

pip install ratchada_utils

To install from source, clone the repository and run:

git clone https://github.com/yourusername/ratchada_utils.git
cd ratchada_utils
pip install .

Usage

Tokenizing Text

from ratchada_utils.processor import tokenize_text

text = "Your input text here."
tokenized_text = tokenize_text(text, pred=True)
print("Tokenized Text:", tokenized_text)
# Tokenized Text: ['your', 'input', 'text', 'here']

Evaluate Performance

from ratchada_utils.evaluator import simple_evaluator

result = pd.read_csv("./output/result-whisper-ratchada.csv")

summary = simple_evaluator(result["pred_text"], result["true_text"])

Requirements

  1. Python 3.10 or higher
  2. The Requirements are located in requirements.txt

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

Please made contact on the official repository of this project.

About

Utility for project ratchada speech to text

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages