Determined: Deep Learning Training Platform

Determined helps deep learning teams train models more quickly, easily share GPU resources, and effectively collaborate. Determined allows deep learning engineers to focus on building and training models at scale, without needing to worry about DevOps or writing custom code for common tasks like fault tolerance or experiment tracking.

You can think of Determined as a platform that bridges the gap between tools like TensorFlow and PyTorch --- which work great for a single researcher with a single GPU --- to the challenges that arise when doing deep learning at scale, as teams, clusters, and data sets all increase in size.

Key Features

high-performance distributed training without any additional changes to your model code
intelligent hyperparameter optimization based on cutting-edge research
flexible GPU scheduling, including dynamically resizing training jobs on-the-fly and automatic management of cloud resources on AWS and GCP
built-in experiment tracking, metrics storage, and visualization
automatic fault tolerance for DL training jobs
integrated support for TensorBoard and GPU-powered Jupyter notebooks

To use Determined, you can continue using popular DL frameworks such as TensorFlow and PyTorch; you just need to modify your model code to implement the Determined API.

Installation

Installation Guide

Try Now on AWS

Next Steps

For a brief introduction to using Determined, start with the Quick Start Guide.

To port an existing deep learning model to Determined, follow the tutorial for your preferred deep learning framework:

Documentation

The documentation for the latest version of Determined can always be found here.

Community

If you need help, want to file a bug report, or just want to keep up-to-date with the latest news about Determined, please join the Determined community!

Slack is the best place to ask questions about Determined and get support. Click here to join our Slack.
You can also join the community mailing list to ask questions about the project and receive announcements.
To report a bug, file an issue on GitHub.
To report a security issue, email [email protected].

Contributing

Contributor's Guide

License

Apache V2

Name		Name	Last commit message	Last commit date
Latest commit History 333 Commits
.circleci		.circleci
.github		.github
CI		CI
agent		agent
cli		cli
common		common
deploy		deploy
docs		docs
examples		examples
harness		harness
master		master
packaging		packaging
scripts		scripts
tests		tests
webui		webui
.bumpversion.cfg		.bumpversion.cfg
.conform.yaml		.conform.yaml
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
.goreleaser.yml		.goreleaser.yml
.mailmap		.mailmap
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASE.md		RELEASE.md
VERSION		VERSION
combined-reqs.in		combined-reqs.in
combined-reqs.txt		combined-reqs.txt
determined-logo.png		determined-logo.png
dev-requirements.txt		dev-requirements.txt
docs-requirements.txt		docs-requirements.txt
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
release-requirements.txt		release-requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Determined: Deep Learning Training Platform

Key Features

Installation

Try Now on AWS

Next Steps

Documentation

Community

Contributing

License

About

Releases

Packages

Languages

License

atalwalkar/determined-1

Folders and files

Latest commit

History

Repository files navigation

Determined: Deep Learning Training Platform

Key Features

Installation

Try Now on AWS

Next Steps

Documentation

Community

Contributing

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages