tts_text

Code for collection/generation of text for tts data collection

Developers:

Anders Jess Pedersen ([email protected])
Dan Saattrup Nielsen ([email protected])

Quick Start

The quickest way to build the dataset is using Docker. With Docker installed, simply write make docker and the final dataset will be built in the data/processed directory, with the individual datasets in data/raw.

Development Setup

To install the project for further development, run the following steps:

Run make install, which installs Poetry (if it isn't already installed), sets up a virtual environment and all Python dependencies therein.
Run source .venv/bin/activate to activate the virtual environment.

With the project installed, you can build the dataset by running:

python src/scripts/build_tts_dataset.py

NB: Running the above script on a machine running MacOS may result in an urllib.error.URLError-exception being thrown, in which case one should follow the steps described here.

Project structure

.
├── .devcontainer
│   └── devcontainer.json
├── .github
│   └── workflows
│       ├── ci.yaml
│       └── docs.yaml
├── .gitignore
├── .pre-commit-config.yaml
├── CODE_OF_CONDUCT.md
├── CONTRIBUTING.md
├── Dockerfile
├── LICENSE
├── README.md
├── config
│   ├── __init__.py
│   ├── config.yaml
│   └── hydra
│       └── job_logging
│           └── custom.yaml
├── data
│   ├── final
│   │   └── .gitkeep
│   ├── processed
│   │   └── .gitkeep
│   └── raw
│       └── .gitkeep
├── docs
│   └── .gitkeep
├── gfx
│   ├── .gitkeep
│   └── alexandra_logo.png
├── makefile
├── models
│   └── .gitkeep
├── notebooks
│   └── .gitkeep
├── poetry.lock
├── poetry.toml
├── pyproject.toml
├── src
│   ├── scripts
│   │   ├── build_tts_dataset.py
│   │   └── fix_dot_env_file.py
│   └── tts_text
│       ├── __init__.py
│       ├── __pycache__
│       ├── bus_stops_and_stations.py
│       ├── dates.py
│       ├── times.py
│       └── utils.py
└── tests
    ├── __init__.py
    ├── __pycache__
    └── test_dummy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tts_text

Quick Start

Development Setup

Project structure

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 232 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
config		config
data		data
docs		docs
gfx		gfx
models		models
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
makefile		makefile
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

License

alexandrainst/tts_text

Folders and files

Latest commit

History

Repository files navigation

tts_text

Quick Start

Development Setup

Project structure

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages