-
Notifications
You must be signed in to change notification settings - Fork 26
Issues: ml-energy/zeus
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Subclass New feature or request
torch.distributed.pipelining.PipelineStage
for PFO
enhancement
#131
opened Oct 13, 2024 by
jaywonchung
Add CPU support for the PowerMonitor
enhancement
New feature or request
#128
opened Sep 23, 2024 by
sharonsyh
[Testing] A simple mock device implementation for testing
enhancement
New feature or request
#127
opened Sep 20, 2024 by
jaywonchung
[RFC] Integration of Prometheus Push Gateway and Energy Metrics Collection in Zeus
#125
opened Sep 15, 2024 by
sharonsyh
Keep compatibility with Good for newcomers
integration
amdsmi
API change
good first issue
#123
opened Sep 10, 2024 by
jaywonchung
Lazily initialize RAPL wraparound monitor processes
enhancement
New feature or request
good first issue
Good for newcomers
#121
opened Sep 10, 2024 by
jaywonchung
Detect anomalies in getTotalEnergyConsumption return value and fall back to power polling for AMD GPUs
#113
opened Aug 26, 2024 by
parthraut
Integration with IPMI metrics
enhancement
New feature or request
#112
opened Aug 25, 2024 by
jaywonchung
Add Intel RAPL support to New feature or request
zeusd
enhancement
#110
opened Aug 16, 2024 by
jaywonchung
Support for NVIDIA Jetson platforms
enhancement
New feature or request
#103
opened Jul 26, 2024 by
jaywonchung
6 tasks
[Zeusd] Better failure handling and testing
enhancement
New feature or request
#88
opened May 30, 2024 by
jaywonchung
Training framework integration opportunities
integration
roadmap
#77
opened May 16, 2024 by
jaywonchung
Test and verify New feature or request
nvmlDeviceSetAPIRestriction
enhancement
#59
opened May 3, 2024 by
jaywonchung
Carbon-aware Zeus (Chase) as an optimizer
enhancement
New feature or request
#53
opened Apr 28, 2024 by
jaywonchung
GlobalPowerLimitOptimizer
for distributed data parallel training
enhancement
#43
opened Mar 13, 2024 by
jaywonchung
3 tasks
Cluster-wide energy metric aggregation
enhancement
New feature or request
#30
opened Oct 27, 2023 by
jaywonchung
OperationProfiler
and PerseusOptimizer
server and client
enhancement
#21
opened Oct 8, 2023 by
jaywonchung
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.