-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Shrink size of terra-jupyter-base #333
Comments
I agree with this sentiment. Even if pulling the image does not eat into the time limit (somehow), the image is just too large. It's hard to develop on top of this as a base image when it is so huge. Maybe the solution is to have a "developer" base image that is totally minimal? |
thanks for reporting...The image is big becuz we're using https://cloud.google.com/deep-learning-containers/docs/choosing-container and these images are around 10G themselves. We do want to somehow make things better, but it's not currently prioritized yet.. |
To follow up, I just ran an experiment using #!/bin/bash
set -eu
echo "test" It timed out. Surely this is just one data point. But it's been consistent with my previous experiences with other startup scripts as well, where the startup script was just an installation of samtools and bcftools, and it took close to 30 minutes to finish. We could build a custom image based on Can you please help on this? |
I see. It's imaginable some of us not needing the DL functionalities, but do need some of the Jupyter-related stuff, i.e. what @sjfleming has suggested. So I think his suggestion makes sense, i.e. to truly make this a "base" image, as from IMHO my base case doesn't need GPU/DL. That being said, I understand it's not a priority for you. So we'd appreciate it if there are instructions for a PR to make that happen. |
Just to add another observation, I see that |
And @Qi77Qi , as @SHuang-Broad mentioned, I would also be happy to contribute to a PR to make this happen, if we know what would be acceptable in terms of requirements |
@sjfleming it is sth we want to address at some point but really haven't had bandwidth to address....if you don't mind taking a stab at it, would you mind write a design doc before you attempt implementation? so that we can review the proposal and give feedback before you spend too much time on this? |
Hi @Qi77Qi , thanks for your response. I can write a design doc; however, having zero experience with what a design doc is, I wonder if you could point me to an example! :) |
Currently, it's 8.4GB
https://console.cloud.google.com/gcr/images/broad-dsp-gcr-public/US/terra-jupyter-base@sha256:8b8da2a3ac90e04015694b0fae20518eb38db4a5f7bc18144d3fbf81e8d27066/details?tag=latest
Pulling that takes a long time (on my config, it took over 25 minutes, though I expect it to be faster on GCP's network). This eats into the 30 minutes limit of creating one's custom environment on Terra.
Thank you.
The text was updated successfully, but these errors were encountered: