Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[<Ray component: Core|RLlib|etc...>] ray train #49707

Open
Hxinyue opened this issue Jan 8, 2025 · 0 comments
Open

[<Ray component: Core|RLlib|etc...>] ray train #49707

Hxinyue opened this issue Jan 8, 2025 · 0 comments
Labels
bug Something that is supposed to be working; but isn't train Ray Train Related Issue triage Needs triage (eg: priority, bug/not-bug, and owning component)

Comments

@Hxinyue
Copy link

Hxinyue commented Jan 8, 2025

What happened + What you expected to happen

When ray train is performed, the train_loop_per_worker function has a return value. How can I get the return value of the train_loop_per_worker function after running the trainer.fit()

Versions / Dependencies

ray==2.11.0
python==3.10

Reproduction script

def main(config):
alg_id = config["alg_id"]
return {"success": 1}

trainer = TorchTrainer(
main,
scaling_config=ScalingConfig(use_gpu=use_gpu, num_workers=num_workers,
resources_per_worker={"CPU": cpu, "GPU": gpu}),
train_loop_config=config,
...
)
trainer.fit()

Issue Severity

High: It blocks me from completing my task.

@Hxinyue Hxinyue added bug Something that is supposed to be working; but isn't triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Jan 8, 2025
@jcotant1 jcotant1 added the train Ray Train Related Issue label Jan 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something that is supposed to be working; but isn't train Ray Train Related Issue triage Needs triage (eg: priority, bug/not-bug, and owning component)
Projects
None yet
Development

No branches or pull requests

2 participants