Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release v0.13.0 #456

Merged
merged 14 commits into from
Aug 8, 2024
Merged

Release v0.13.0 #456

merged 14 commits into from
Aug 8, 2024

Conversation

TRNWWZ
Copy link
Contributor

@TRNWWZ TRNWWZ commented Jul 17, 2024

Issue #, if available:
release v0.13.0

Description of changes:

  • Upgrade Python to 3.11.9
  • Enforce dependencies are all installed from conda-forge

Staleness Report: 0.13.0(gpu)

Package Current Version in the Distribution image Latest Relevant Version in Upstream
ipython 8.26.0 8.26.0
jinja2 3.1.4 3.1.4
ipywidgets 7.8.3 7.8.3
numpy 1.26.4 1.26.4
pandas 2.2.2 2.2.2
boto3 1.34.151 1.34.151
aws-glue-sessions 1.0.6 1.0.6
conda 23.11.0 23.11.0
jupyterlab 3.6.7 3.6.7
${\color{red}keras}$ 2.14.0 2.15.0
matplotlib 3.9.1 3.9.1
pip 23.3.2 23.3.2
scipy 1.14.0 1.14.0
scikit-learn 1.5.1 1.5.1
py-xgboost-gpu 1.7.6 1.7.6
thrift_sasl 0.4.3 0.4.3
pyhive 0.7.0 0.7.0
python-gssapi 1.8.3 1.8.3
${\color{red}pytorch-gpu}$ 2.0.0 2.3.1
sagemaker-headless-execution-driver 0.0.13 0.0.13
sagemaker-kernel-wrapper 0.0.2 0.0.2
sagemaker-python-sdk 2.227.0 2.227.0
sagemaker-studio-analytics-extension 0.1.2 0.1.2
${\color{red}tensorflow}$ 2.14.0 2.17.0
${\color{red}torchvision}$ 0.15.2 0.18.1

Staleness Report: 0.13.0(cpu)

Package Current Version in the Distribution image Latest Relevant Version in Upstream
ipython 8.26.0 8.26.0
jinja2 3.1.4 3.1.4
ipywidgets 7.8.3 7.8.3
numpy 1.26.4 1.26.4
pandas 2.2.2 2.2.2
boto3 1.34.151 1.34.151
aws-glue-sessions 1.0.6 1.0.6
conda 23.11.0 23.11.0
jupyterlab 3.6.7 3.6.7
keras 2.15.0 2.15.0
matplotlib 3.9.1 3.9.1
pip 23.3.2 23.3.2
scipy 1.14.0 1.14.0
scikit-learn 1.5.1 1.5.1
py-xgboost-cpu 1.7.6 1.7.6
thrift_sasl 0.4.3 0.4.3
pyhive 0.7.0 0.7.0
python-gssapi 1.8.3 1.8.3
${\color{red}pytorch}$ 2.1.2 2.3.1
sagemaker-headless-execution-driver 0.0.13 0.0.13
sagemaker-kernel-wrapper 0.0.2 0.0.2
sagemaker-python-sdk 2.227.0 2.227.0
sagemaker-studio-analytics-extension 0.1.2 0.1.2
${\color{red}tensorflow}$ 2.15.0 2.17.0
${\color{red}torchvision}$ 0.16.1 0.18.1

Unit test report:

CPU unit test failure:

FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_cpu[keras.test.Dockerfile-required_packages0] 
FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_cpu[pandas.test.Dockerfile-required_packages7] 
FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_cpu[sm-python-sdk.test.Dockerfile-required_packages8] 

keras - failed due to test case's own issue, code base not owned by us

pandas - All failed tests have error like ValueError: Could not find file..., should be test case own issue

sm-python-sdk - known issue: #316

GPU unit test failure:

FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_gpu[scipy.test.Dockerfile-required_packages4] 
FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_gpu[keras.test.Dockerfile-required_packages0] 
FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_gpu[pandas.test.Dockerfile-required_packages7] 
FAILED test/test_dockerfile_based_harness.py::test_dockerfiles_for_gpu[sm-python-sdk.test.Dockerfile-required_packages8] 

scipy - known issue: #30

keras - known issue: #413

pandas - All failed tests have error like ValueError: Could not find file..., should be test case own issue

sm-python-sdk - known issue: #316

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@TRNWWZ TRNWWZ changed the base branch from release-0.13.0 to main July 17, 2024 17:52
claytonparnell
claytonparnell previously approved these changes Jul 22, 2024
@TRNWWZ TRNWWZ requested a review from JunLyu July 22, 2024 17:05
Comment on lines 5 to 6
# Sagemaker Python SDK's unit tests requires AWS_DEFAULT_REGION to be set. So, using an arbitrary value of us-east-1
export AWS_DEFAULT_REGION=us-west-2
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: update region in the comment

Comment on lines -8 to -9
# Sagemaker Python SDK's unit tests requires AWS_DEFAULT_REGION to be set. So, using an arbitrary value of us-east-1
ENV AWS_DEFAULT_REGION=us-east-1
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this removed?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's designed to work in same way as the change I made above,


# Sagemaker Python SDK's unit tests requires AWS_DEFAULT_REGION to be set. So, using an arbitrary value of us-east-1
export AWS_DEFAULT_REGION=us-west-2

But it's not effective as tested locally. So removing it and count on commands in shell file directly.


# Enforce dependencies are all installed from conda-forge
RUN conda config --remove channels defaults
RUN micromamba config append channels conda-forge --env
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

https://mamba.readthedocs.io/en/latest/installation/micromamba-installation.html

Why not use the exact command?

micromamba config append channels conda-forge
micromamba config set channel_priority strict

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Without --env, image build job would fail:

Step 28/45 : RUN micromamba config append channels conda-forge

---> Running in 43cce551b89b
Traceback (most recent call last):
  File "/local/home/ruinon/sagemaker-distribution-dev/src/main.py", line 441, in <module>
    parse_args(get_arg_parser())
  File "/local/home/ruinon/sagemaker-distribution-dev/src/main.py", line 437, in parse_args
    args.func(args)
  File "/local/home/ruinon/sagemaker-distribution-dev/src/main.py", line 183, in build_images
    image_ids, image_versions = _build_local_images(target_version, args.target_ecr_repo, args.force, args.skip_tests)
  File "/local/home/ruinon/sagemaker-distribution-dev/src/main.py", line 250, in _build_local_images
    image, log_gen = _docker_client.images.build(
  File "/home/ruinon/anaconda3/envs/sagemaker-distribution/lib/python3.10/site-packages/docker/models/images.py", line 304, in build
    raise BuildError(chunk['error'], result_stream)
docker.errors.BuildError: The command '/usr/local/bin/_dockerfile_shell.sh micromamba config append channels conda-forge' returned a non-zero code: 139

I tried specify USER root before executing the command but getting same error. So adding --env here

JunLyu
JunLyu previously approved these changes Jul 23, 2024
@@ -0,0 +1,32 @@
# Release notes: 0.13.0

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you manually leave a note that we upgraded Python from 3.8 to 3.11

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated

@@ -62,6 +62,11 @@ RUN micromamba install -y --name base --file /tmp/$ENV_IN_FILENAME && \
ARG MAMBA_DOCKERFILE_ACTIVATE=1
RUN sudo ln -s $(which python3) /usr/bin/python

# Enforce dependencies are all installed from conda-forge
RUN conda config --remove channels defaults
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does it have conda-forge now ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes:

dev-dsk-ruinon-2a-ba2bf164 % docker run -it b0934903332f
(base) sagemaker-user@f19adfc37c06:~$ conda config --show channels
channels:
  - conda-forge
(base) sagemaker-user@f19adfc37c06:~$ exit
exit
(base) 
(24-07-24 21:33:41) <0> [~/sagemaker-distribution-dev]  
dev-dsk-ruinon-2a-ba2bf164 % docker run -it 4325ba0a6498
(base) sagemaker-user@5e786705f97c:~$ conda config --show channels
channels:
  - conda-forge
(base) sagemaker-user@5e786705f97c:~$ 

balajisankar15
balajisankar15 previously approved these changes Jul 25, 2024
template/v0/Dockerfile Outdated Show resolved Hide resolved
@TRNWWZ
Copy link
Contributor Author

TRNWWZ commented Aug 2, 2024

Python Package Size Report (GPU)

Target Image Version: 0.13.0 | Base Image Version: 0.12.0

Python Packages Total Size Summary

Target Version Total Size Base Version Total Size Size Change (abs) Size Change (%)
3.17GB 2.99GB 179.69MB 5.87

Top-20 Largest Python Packages

Package Version in the Target Image Size
cudatoolkit 11.8.0 682.45MB
tensorflow-base 2.14.0 517.65MB
cudnn 8.9.7.29 443.90MB
pytorch 2.0.0 399.35MB
mkl 2022.2.1 201.46MB
libmagma 2.7.1 201.22MB
nccl 2.22.3.1 124.94MB
libxgboost 1.7.6 99.28MB
qt-main 5.15.8 57.61MB
llvm-openmp 18.1.8 55.76MB
pillow 10.4.0 40.16MB
libllvm18 18.1.8 36.46MB
libllvm15 15.0.7 31.78MB
python 3.11.9 29.45MB
pandoc 3.3 19.92MB
scipy 1.14.0 16.89MB
libclang-cpp15 15.0.7 16.41MB
pandas 2.2.2 14.96MB
icu 73.2 11.53MB
torchvision 0.15.2 11.09MB

Python Package Size Delta

The total size of newly introduced Python packages is 57.56MB, accounts for 1.77% of the total package size.

Package Version in the Target Image Version in the Base Image Size Change (abs) Size Change (%)
tensorflow-base 2.14.0 2.13.1 56.79MB 12.32
libllvm18 18.1.8 - 36.46MB -
libclang-cpp15 15.0.7 - 16.41MB -
llvm-openmp 18.1.8 17.0.6 16.40MB 41.68
nccl 2.22.3.1 2.19.4.1 12.18MB 10.8
python 3.11.9 3.8.18 6.52MB 28.42
pandas 2.2.2 2.0.3 3.15MB 26.64
gettext-tools 0.22.5 - 2.60MB -
scipy 1.14.0 1.10.1 2.32MB 15.91
scikit-learn 1.5.1 1.3.2 2.01MB 25.01
libclang13 18.1.8 15.0.7 1.37MB 14.98
numpy 1.26.4 1.24.4 1.32MB 20.72
plotly 5.23.0 5.18.0 1.20MB 24.39
libglib 2.80.3 2.78.3 1.13MB 44.13
matplotlib-base 3.9.1 3.7.3 1.11MB 17.3
setuptools 71.0.4 69.0.3 969.44KB 210.97
sqlalchemy 2.0.31 2.0.24 723.57KB 26.9
ml_dtypes 0.2.0 - 686.10KB -
fonttools 4.53.1 4.48.1 592.85KB 26.4
qhull 2020.2 - 539.98KB -
pykrb5 0.5.1 - 458.74KB -
pillow 10.4.0 10.2.0 394.69KB 0.97
sympy 1.13.0 1.12 297.29KB 7.15
botocore 1.34.150 1.34.36 276.48KB 4.15
tornado 6.4.1 6.3.3 221.50KB 36.05
tensorflow-estimator 2.14.0 2.13.1 167.72KB 31.87
libgettextpo 0.22.5 - 166.58KB -
psutil 6.0.0 5.9.8 142.55KB 40.21
aiohttp 3.9.5 3.9.1 119.56KB 17.78
tzdata 2024a - 117.01KB -
glib 2.80.3 2.78.3 107.10KB 22.4
gst-plugins-base 1.24.6 1.22.9 82.29KB 3.11
h5py 3.11.0 3.10.0 78.69KB 6.86
libgcc-ng 14.1.0 13.2.0 69.92KB 9.29
sagemaker-python-sdk 2.227.0 2.206.0 63.55KB 9.4
pyspnego 0.11.1 0.9.1 63.12KB 31.9
libgcrypt 1.11.0 1.10.3 48.26KB 7.78
libstdcxx-ng 14.1.0 13.2.0 46.06KB 1.23
libasprintf 0.22.5 - 42.21KB -
harfbuzz 9.0.0 8.3.0 42.04KB 2.78
gstreamer 1.24.6 1.22.9 38.34KB 1.98
libwebp-base 1.4.0 1.3.2 36.25KB 9.24
contourpy 1.2.1 1.1.1 36.25KB 16.74
libgettextpo-devel 0.22.5 - 35.90KB -
thrift 0.20.0 0.19.0 33.43KB 25.54
libasprintf-devel 0.22.5 - 33.42KB -
menuinst 2.1.1 2.0.2 32.18KB 24.55
openssl 3.3.1 3.2.1 31.39KB 1.12
libpq 16.3 16.1 30.12KB 1.25
libcurl 8.9.1 8.5.0 26.26KB 6.91
mysql-common 8.3.0 8.0.33 26.05KB 3.54
libgfortran-ng 14.1.0 13.2.0 25.45KB 109.38
truststore 0.8.0 - 20.18KB -
libsqlite 3.46.0 3.44.2 19.06KB 2.31
pygments 2.18.0 2.17.2 18.43KB 2.19
libxkbcommon 1.7.0 1.6.0 18.04KB 3.21
xorg-libxxf86vm 1.1.5 - 18.01KB -
libopenblas 0.3.27 0.3.25 17.46KB 0.32
c-ares 1.32.3 1.26.0 16.61KB 10.45
ipython 8.26.0 8.12.2 16.41KB 2.88
jupyter_core 5.7.2 5.7.1 15.57KB 20.12
libmamba 1.5.8 1.5.6 15.23KB 0.94
libgfortran5 14.1.0 13.2.0 14.45KB 1.03
ansicolors 1.1.8 - 13.62KB -
zstandard 0.23.0 0.22.0 13.07KB 3.32
libgpg-error 1.50 1.47 12.68KB 4.98
libxcb 1.16 1.15 10.44KB 2.78
zstd 1.5.6 1.5.5 9.42KB 1.77
typing_extensions 4.12.2 4.5.0 8.36KB 27.33
libmambapy 1.5.8 1.5.6 7.34KB 2.48
archspec 0.2.3 0.2.2 6.32KB 15.28
google-auth 2.32.0 2.27.0 5.56KB 5.42
libarchive 3.7.4 3.7.2 5.55KB 0.66
wrapt 1.14.1 1.16.0 5.18KB 9.6
libdeflate 1.20 1.19 4.32KB 6.59
parso 0.8.4 0.8.3 4.05KB 5.83
jupyter_server 2.14.2 2.12.5 3.71KB 1.19
requests-oauthlib 2.0.0 1.3.1 3.46KB 15.97
xorg-libx11 1.8.9 1.8.7 3.42KB 0.42
referencing 0.35.1 0.33.0 3.08KB 8.08
importlib_resources 6.4.0 6.1.1 3.03KB 10.37
keras 2.14.0 2.13.1 2.98KB 0.34
ncurses 6.5 6.4 2.96KB 0.34
json5 0.9.25 0.9.14 2.92KB 11.97
pulseaudio-client 17.0 16.1 2.72KB 0.37
glib-tools 2.80.3 2.78.3 2.61KB 2.41
anyio 4.4.0 4.2.0 2.60KB 2.62
idna 3.7 3.6 2.53KB 5.18
threadpoolctl 3.5.0 3.2.0 2.51KB 12.23
pycparser 2.22 2.21 2.30KB 2.29
matplotlib-inline 0.1.7 0.1.6 2.27KB 18.95
tinycss2 1.3.0 1.2.1 2.12KB 9.34
requests 2.32.3 2.31.0 2.07KB 3.74
mysql-libs 8.3.0 8.0.33 1.96KB 0.13
pure_eval 0.2.3 0.2.2 1.95KB 13.74
filelock 3.15.4 3.13.1 1.94KB 12.73
notebook 6.5.7 6.5.6 1.94KB 0.65
zipp 3.19.2 3.17.0 1.92KB 10.36
tenacity 9.0.0 8.2.3 1.84KB 8.25
jsonpointer 3.0.0 2.4 1.76KB 11.16
jsonschema 4.23.0 4.21.1 1.47KB 2.07
libsolv 0.7.30 0.7.28 1.38KB 0.3
werkzeug 3.0.3 3.0.1 1.19KB 0.5
markdown 3.6 3.5.2 1.13KB 1.49
conda-package-handling 2.3.0 2.2.0 1.07KB 0.43
tensorflow 2.14.0 2.13.1 1.03KB 2.99
ipykernel 6.29.5 6.29.1 1.02KB 0.89
sagemaker-studio-analytics-extension 0.0.22 0.0.21 1.02KB 2.81
schema 0.7.7 0.7.5 1.02KB 4.63
alsa-lib 1.2.12 1.2.10 930.00B 0.17
python-fastjsonschema 2.20.0 2.19.1 915.00B 0.41
nbconvert-pandoc 7.16.4 7.15.0 891.00B 11.91
libxml2 2.12.7 2.12.5 872.00B 0.12
expat 2.6.2 2.5.0 849.00B 0.62
packaging 24.1 23.2 838.00B 1.69
docker-py 7.1.0 7.0.0 802.00B 0.78
nbformat 5.10.4 5.9.2 786.00B 0.78
mpg123 1.32.6 1.32.4 750.00B 0.15
sniffio 1.3.1 1.3.0 706.00B 4.92
nbconvert-core 7.16.4 7.15.0 560.00B 0.3
pyopenssl 24.2.1 24.0.0 518.00B 0.41
papermill 2.6.0 2.4.0 498.00B 1.34
jupyterlab_server 2.27.3 2.25.2 495.00B 1.01
prometheus_client 0.20.0 0.19.0 450.00B 0.93
pyasn1 0.6.0 0.5.1 447.00B 0.7
websocket-client 1.8.0 1.7.0 440.00B 0.94
gast 0.5.5 0.5.4 431.00B 1.83
pluggy 1.5.0 1.4.0 431.00B 1.84
typing-extensions 4.12.2 4.5.0 431.00B 4.46
wheel 0.43.0 0.42.0 410.00B 0.71
platformdirs 4.2.2 4.2.0 362.00B 1.79
prompt-toolkit 3.0.47 3.0.42 312.00B 0.12
rich 13.7.1 13.7.0 276.00B 0.15
xcb-util 0.4.1 0.4.0 237.00B 1.2
python-flatbuffers 24.3.25 23.5.26 226.00B 0.66
jupyter_server_fileid 0.9.2 0.9.1 226.00B 1.11
libaec 1.1.3 1.1.2 218.00B 0.62
conda-package-streaming 0.10.0 0.9.0 218.00B 1.14
zlib 1.3.1 1.2.13 179.00B 0.19
matplotlib 3.9.1 3.7.3 157.00B 1.85
xcb-util-keysyms 0.4.1 0.4.0 128.00B 0.9
jupyter_events 0.10.0 0.9.0 121.00B 0.57
webcolors 24.6.0 1.13 105.00B 0.58
notebook-shim 0.2.4 0.2.3 97.00B 0.58
terminado 0.18.1 0.18.0 83.00B 0.37
sagemaker-headless-execution-driver 0.0.13 0.0.12 82.00B 0.43
ipywidgets 7.8.3 7.8.0 75.00B 0.07
qtconsole-base 5.5.2 5.5.1 73.00B 0.07
pyasn1-modules 0.4.0 0.3.0 66.00B 0.07
send2trash 1.8.3 1.8.2 47.00B 0.21
xcb-util-renderutil 0.3.10 0.3.9 23.00B 0.14
s3transfer 0.10.2 0.10.0 17.00B 0.03
python_abi 3.11 3.8 14.00B 0.22
tqdm 4.66.4 4.66.1 3.00B 0.0
libzlib 1.3.1 1.2.13 -14.00B -0.02
cachetools 5.4.0 5.3.2 -17.00B -0.12
jinja2 3.1.4 3.1.3 -24.00B -0.02
urllib3 1.26.19 1.26.18 -50.00B -0.04
jupyter_server_terminals 0.5.3 0.5.2 -58.00B -0.29
prompt_toolkit 3.0.47 3.0.42 -62.00B -0.91
comm 0.2.2 0.2.1 -64.00B -0.52
pyparsing 3.1.2 3.1.1 -66.00B -0.07
nbconvert 7.16.4 7.15.0 -113.00B -1.34
types-python-dateutil 2.9.0.20240316 2.8.19.20240106 -127.00B -0.58
exceptiongroup 1.2.2 1.2.0 -133.00B -0.65
giflib 5.2.2 5.2.1 -137.00B -0.18
traitlets 5.14.3 5.14.1 -142.00B -0.13
aiosqlite 0.20.0 0.19.0 -245.00B -1.28
conda-libmamba-solver 24.7.0 24.1.0 -267.00B -0.65
jsonschema-with-format-nongpl 4.23.0 4.21.1 -309.00B -4.15
xcb-util-wm 0.4.2 0.4.1 -425.00B -0.82
ca-certificates 2024.7.4 2024.2.2 -579.00B -0.37
boto3 1.34.150 1.34.36 -605.00B -0.75
libpng 1.6.43 1.6.42 -879.00B -0.3
krb5 1.21.3 1.21.2 -1.13KB -0.08
certifi 2024.7.4 2024.2.2 -1.22KB -0.78
aws-glue-sessions 1.0.6 1.0.4 -1.28KB -1.87
future 1.0.0 0.18.3 -1.41KB -0.39
joblib 1.4.2 1.3.2 -1.43KB -0.66
python-tzdata 2024.1 2023.4 -1.94KB -1.36
tbb 2021.12.0 2021.11.0 -2.11KB -1.1
jupyterlab_widgets 1.1.9 1.1.7 -2.48KB -2.23
blinker 1.8.2 1.7.0 -3.10KB -17.77
cryptography 42.0.8 42.0.2 -3.20KB -0.16
libexpat 2.6.2 2.5.0 -4.15KB -5.45
libogg 1.3.5 1.3.4 -4.53KB -2.2
boltons 24.0.0 23.1.1 -6.13KB -2.06
widgetsnbextension 3.6.8 3.6.6 -12.26KB -1.96
openjpeg 2.5.2 2.5.0 -14.75KB -4.23
pyzmq 26.0.3 24.0.1 -18.05KB -3.74
python-dateutil 2.9.0 2.8.2 -22.70KB -9.45
gmpy2 2.1.5 2.1.2 -22.83KB -10.24
nbclient 0.10.0 0.8.0 -36.13KB -57.05
nss 3.102 3.97 -38.78KB -1.97
tensorboard 2.14.1 2.13.0 -56.68KB -1.11
pcre2 10.44 10.42 -60.02KB -6.04
debugpy 1.8.2 1.8.0 -136.55KB -5.72
libhwloc 2.11.1 2.9.3 -193.07KB -7.56
networkx 3.3 3.1 -267.89KB -18.79
bcrypt 4.2.0 4.1.2 -320.15KB -56.58
nbclassic 1.1.0 1.0.0 -435.42KB -7.51
xkeyboard-config 2.42 2.41 -497.12KB -56.68
sleef 3.6.1 3.5.1 -545.17KB -35.87
rpds-py 0.19.1 0.17.1 -670.92KB -67.5
gettext 0.22.5 0.21.1 -3.67MB -89.0
pandoc 3.3 3.1.3 -8.06MB -28.81
cudnn 8.9.7.29 8.8.0.121 -12.56MB -2.75

Python Package Size Report (CPU)

Target Image Version: 0.13.0 | Base Image Version: 0.12.0

Python Packages Total Size Summary

Target Version Total Size Base Version Total Size Size Change (abs) Size Change (%)
938.80MB 832.29MB 106.52MB 12.8

Top-20 Largest Python Packages

Package Version in the Target Image Size
mkl 2023.2.0 156.82MB
tensorflow-base 2.15.0 140.09MB
qt-main 5.15.8 57.61MB
llvm-openmp 18.1.8 55.76MB
libtorch 2.1.2 45.25MB
pillow 10.4.0 40.16MB
libllvm18 18.1.8 36.46MB
libllvm15 15.0.7 31.78MB
python 3.11.9 29.45MB
pytorch 2.1.2 27.88MB
pandoc 3.3 19.92MB
scipy 1.14.0 16.89MB
libclang-cpp15 15.0.7 16.41MB
pandas 2.2.2 14.96MB
icu 73.2 11.53MB
libclang13 18.1.8 10.51MB
scikit-learn 1.5.1 10.06MB
torchvision 0.16.1 9.97MB
numpy 1.26.4 7.69MB
matplotlib-base 3.9.1 7.52MB

Python Package Size Delta

The total size of newly introduced Python packages is 103.15MB, accounts for ${\color{red}10.99}$% of the total package size.

Package Version in the Target Image Version in the Base Image Size Change (abs) Size Change (%)
libtorch 2.1.2 - 45.25MB -
libllvm18 18.1.8 - 36.46MB -
tensorflow-base 2.15.0 2.13.1 16.68MB 13.52
libclang-cpp15 15.0.7 - 16.41MB -
llvm-openmp 18.1.8 17.0.6 16.40MB 41.68
python 3.11.9 3.8.18 6.52MB 28.42
pandas 2.2.2 2.0.3 3.15MB 26.64
gettext-tools 0.22.5 - 2.60MB -
scipy 1.14.0 1.10.1 2.32MB 15.91
scikit-learn 1.5.1 1.3.2 2.01MB 25.01
libclang13 18.1.8 15.0.7 1.37MB 14.98
numpy 1.26.4 1.24.4 1.32MB 20.72
plotly 5.23.0 5.18.0 1.20MB 24.39
libglib 2.80.3 2.78.3 1.13MB 44.13
matplotlib-base 3.9.1 3.7.3 1.11MB 17.3
setuptools 71.0.4 69.0.3 969.44KB 210.97
sqlalchemy 2.0.31 2.0.24 723.57KB 26.9
ml_dtypes 0.2.0 - 686.10KB -
fonttools 4.53.1 4.48.1 592.85KB 26.4
libgrpc 1.59.3 1.54.3 582.88KB 9.94
qhull 2020.2 - 539.98KB -
pykrb5 0.5.1 - 458.74KB -
pillow 10.4.0 10.2.0 394.69KB 0.97
libprotobuf 4.24.4 3.21.12 329.41KB 15.12
torchvision 0.16.1 0.15.2 311.37KB 3.15
sympy 1.13.0 1.12 297.29KB 7.15
botocore 1.34.150 1.34.36 276.48KB 4.15
grpcio 1.59.3 1.54.3 246.68KB 32.33
libre2-11 2023.09.01 - 227.33KB -
tornado 6.4.1 6.3.3 221.50KB 36.05
tensorflow-estimator 2.15.0 2.13.1 173.94KB 33.15
libgettextpo 0.22.5 - 166.58KB -
psutil 6.0.0 5.9.8 142.55KB 40.21
fsspec 2024.6.1 - 130.02KB -
aiohttp 3.9.5 3.9.1 119.56KB 17.78
tzdata 2024a - 117.01KB -
glib 2.80.3 2.78.3 107.10KB 22.4
gst-plugins-base 1.24.6 1.22.9 82.29KB 3.11
h5py 3.11.0 3.10.0 78.69KB 6.86
protobuf 4.24.4 4.21.12 71.18KB 22.56
libgcc-ng 14.1.0 13.2.0 69.92KB 9.29
sagemaker-python-sdk 2.227.0 2.206.0 63.55KB 9.4
pyspnego 0.11.1 0.9.1 63.12KB 31.9
libgcrypt 1.11.0 1.10.3 48.26KB 7.78
libstdcxx-ng 14.1.0 13.2.0 46.06KB 1.23
libasprintf 0.22.5 - 42.21KB -
harfbuzz 9.0.0 8.3.0 42.04KB 2.78
gstreamer 1.24.6 1.22.9 38.34KB 1.98
libwebp-base 1.4.0 1.3.2 36.25KB 9.24
contourpy 1.2.1 1.1.1 36.25KB 16.74
libgettextpo-devel 0.22.5 - 35.90KB -
thrift 0.20.0 0.19.0 33.43KB 25.54
libasprintf-devel 0.22.5 - 33.42KB -
menuinst 2.1.1 2.0.2 32.18KB 24.55
openssl 3.3.1 3.2.1 31.39KB 1.12
libpq 16.3 16.1 30.12KB 1.25
libcurl 8.9.1 8.5.0 26.26KB 6.91
mysql-common 8.3.0 8.0.33 26.05KB 3.54
libgfortran-ng 14.1.0 13.2.0 25.45KB 109.38
libabseil 20230802.1 20230125.3 22.48KB 1.86
truststore 0.8.0 - 20.18KB -
libsqlite 3.46.0 3.44.2 19.06KB 2.31
pygments 2.18.0 2.17.2 18.43KB 2.19
libxkbcommon 1.7.0 1.6.0 18.04KB 3.21
xorg-libxxf86vm 1.1.5 - 18.01KB -
libopenblas 0.3.27 0.3.25 17.46KB 0.32
c-ares 1.32.3 1.26.0 16.61KB 10.45
ipython 8.26.0 8.12.2 16.41KB 2.88
jupyter_core 5.7.2 5.7.1 15.57KB 20.12
libmamba 1.5.8 1.5.6 15.23KB 0.94
libgfortran5 14.1.0 13.2.0 14.45KB 1.03
ansicolors 1.1.8 - 13.62KB -
zstandard 0.23.0 0.22.0 13.07KB 3.32
libgpg-error 1.50 1.47 12.68KB 4.98
libxcb 1.16 1.15 10.44KB 2.78
zstd 1.5.6 1.5.5 9.42KB 1.77
typing_extensions 4.12.2 4.5.0 8.36KB 27.33
libmambapy 1.5.8 1.5.6 7.34KB 2.48
libuv 1.48.0 1.46.0 6.48KB 0.74
archspec 0.2.3 0.2.2 6.32KB 15.28
google-auth 2.32.0 2.27.0 5.56KB 5.42
libarchive 3.7.4 3.7.2 5.55KB 0.66
wrapt 1.14.1 1.16.0 5.18KB 9.6
libdeflate 1.20 1.19 4.32KB 6.59
parso 0.8.4 0.8.3 4.05KB 5.83
tensorflow 2.15.0 2.13.1 3.98KB 11.58
google-auth-oauthlib 1.2.1 1.0.0 3.96KB 19.0
jupyter_server 2.14.2 2.12.5 3.71KB 1.19
requests-oauthlib 2.0.0 1.3.1 3.46KB 15.97
xorg-libx11 1.8.9 1.8.7 3.42KB 0.42
referencing 0.35.1 0.33.0 3.08KB 8.08
importlib_resources 6.4.0 6.1.1 3.03KB 10.37
ncurses 6.5 6.4 2.96KB 0.34
json5 0.9.25 0.9.14 2.92KB 11.97
pulseaudio-client 17.0 16.1 2.72KB 0.37
glib-tools 2.80.3 2.78.3 2.61KB 2.41
anyio 4.4.0 4.2.0 2.60KB 2.62
keras 2.15.0 2.13.1 2.54KB 0.29
idna 3.7 3.6 2.53KB 5.18
threadpoolctl 3.5.0 3.2.0 2.51KB 12.23
pycparser 2.22 2.21 2.30KB 2.29
matplotlib-inline 0.1.7 0.1.6 2.27KB 18.95
tinycss2 1.3.0 1.2.1 2.12KB 9.34
requests 2.32.3 2.31.0 2.07KB 3.74
mysql-libs 8.3.0 8.0.33 1.96KB 0.13
pure_eval 0.2.3 0.2.2 1.95KB 13.74
filelock 3.15.4 3.13.1 1.94KB 12.73
notebook 6.5.7 6.5.6 1.94KB 0.65
zipp 3.19.2 3.17.0 1.92KB 10.36
tenacity 9.0.0 8.2.3 1.84KB 8.25
jsonpointer 3.0.0 2.4 1.76KB 11.16
jsonschema 4.23.0 4.21.1 1.47KB 2.07
libsolv 0.7.30 0.7.28 1.38KB 0.3
werkzeug 3.0.3 3.0.1 1.19KB 0.5
markdown 3.6 3.5.2 1.13KB 1.49
conda-package-handling 2.3.0 2.2.0 1.07KB 0.43
ipykernel 6.29.5 6.29.1 1.02KB 0.89
sagemaker-studio-analytics-extension 0.0.22 0.0.21 1.02KB 2.81
schema 0.7.7 0.7.5 1.02KB 4.63
alsa-lib 1.2.12 1.2.10 930.00B 0.17
python-fastjsonschema 2.20.0 2.19.1 915.00B 0.41
nbconvert-pandoc 7.16.4 7.15.0 891.00B 11.91
libxml2 2.12.7 2.12.5 872.00B 0.12
expat 2.6.2 2.5.0 849.00B 0.62
packaging 24.1 23.2 838.00B 1.69
docker-py 7.1.0 7.0.0 802.00B 0.78
nbformat 5.10.4 5.9.2 786.00B 0.78
mpg123 1.32.6 1.32.4 750.00B 0.15
sniffio 1.3.1 1.3.0 706.00B 4.92
nbconvert-core 7.16.4 7.15.0 560.00B 0.3
pyopenssl 24.2.1 24.0.0 518.00B 0.41
papermill 2.6.0 2.4.0 498.00B 1.34
jupyterlab_server 2.27.3 2.25.2 495.00B 1.01
prometheus_client 0.20.0 0.19.0 450.00B 0.93
pyasn1 0.6.0 0.5.1 447.00B 0.7
websocket-client 1.8.0 1.7.0 440.00B 0.94
gast 0.5.5 0.5.4 431.00B 1.83
pluggy 1.5.0 1.4.0 431.00B 1.84
typing-extensions 4.12.2 4.5.0 431.00B 4.46
wheel 0.43.0 0.42.0 410.00B 0.71
platformdirs 4.2.2 4.2.0 362.00B 1.79
prompt-toolkit 3.0.47 3.0.42 312.00B 0.12
rich 13.7.1 13.7.0 276.00B 0.15
xcb-util 0.4.1 0.4.0 237.00B 1.2
python-flatbuffers 24.3.25 23.5.26 226.00B 0.66
jupyter_server_fileid 0.9.2 0.9.1 226.00B 1.11
libaec 1.1.3 1.1.2 218.00B 0.62
conda-package-streaming 0.10.0 0.9.0 218.00B 1.14
zlib 1.3.1 1.2.13 179.00B 0.19
matplotlib 3.9.1 3.7.3 157.00B 1.85
xcb-util-keysyms 0.4.1 0.4.0 128.00B 0.9
jupyter_events 0.10.0 0.9.0 121.00B 0.57
webcolors 24.6.0 1.13 105.00B 0.58
notebook-shim 0.2.4 0.2.3 97.00B 0.58
terminado 0.18.1 0.18.0 83.00B 0.37
sagemaker-headless-execution-driver 0.0.13 0.0.12 82.00B 0.43
ipywidgets 7.8.3 7.8.0 75.00B 0.07
qtconsole-base 5.5.2 5.5.1 73.00B 0.07
pyasn1-modules 0.4.0 0.3.0 66.00B 0.07
send2trash 1.8.3 1.8.2 47.00B 0.21
xcb-util-renderutil 0.3.10 0.3.9 23.00B 0.14
s3transfer 0.10.2 0.10.0 17.00B 0.03
python_abi 3.11 3.8 14.00B 0.22
tqdm 4.66.4 4.66.1 3.00B 0.0
libzlib 1.3.1 1.2.13 -14.00B -0.02
cachetools 5.4.0 5.3.2 -17.00B -0.12
jinja2 3.1.4 3.1.3 -24.00B -0.02
urllib3 1.26.19 1.26.18 -50.00B -0.04
jupyter_server_terminals 0.5.3 0.5.2 -58.00B -0.29
prompt_toolkit 3.0.47 3.0.42 -62.00B -0.91
comm 0.2.2 0.2.1 -64.00B -0.52
pyparsing 3.1.2 3.1.1 -66.00B -0.07
nbconvert 7.16.4 7.15.0 -113.00B -1.34
types-python-dateutil 2.9.0.20240316 2.8.19.20240106 -127.00B -0.58
exceptiongroup 1.2.2 1.2.0 -133.00B -0.65
giflib 5.2.2 5.2.1 -137.00B -0.18
traitlets 5.14.3 5.14.1 -142.00B -0.13
aiosqlite 0.20.0 0.19.0 -245.00B -1.28
conda-libmamba-solver 24.7.0 24.1.0 -267.00B -0.65
jsonschema-with-format-nongpl 4.23.0 4.21.1 -309.00B -4.15
xcb-util-wm 0.4.2 0.4.1 -425.00B -0.82
ca-certificates 2024.7.4 2024.2.2 -579.00B -0.37
boto3 1.34.150 1.34.36 -605.00B -0.75
libpng 1.6.43 1.6.42 -879.00B -0.3
krb5 1.21.3 1.21.2 -1.13KB -0.08
certifi 2024.7.4 2024.2.2 -1.22KB -0.78
aws-glue-sessions 1.0.6 1.0.4 -1.28KB -1.87
future 1.0.0 0.18.3 -1.41KB -0.39
joblib 1.4.2 1.3.2 -1.43KB -0.66
python-tzdata 2024.1 2023.4 -1.94KB -1.36
tbb 2021.12.0 2021.11.0 -2.11KB -1.1
jupyterlab_widgets 1.1.9 1.1.7 -2.48KB -2.23
blinker 1.8.2 1.7.0 -3.10KB -17.77
cryptography 42.0.8 42.0.2 -3.20KB -0.16
libexpat 2.6.2 2.5.0 -4.15KB -5.45
libogg 1.3.5 1.3.4 -4.53KB -2.2
boltons 24.0.0 23.1.1 -6.13KB -2.06
widgetsnbextension 3.6.8 3.6.6 -12.26KB -1.96
openjpeg 2.5.2 2.5.0 -14.75KB -4.23
pyzmq 26.0.3 24.0.1 -18.05KB -3.74
tensorboard 2.15.2 2.13.0 -21.04KB -0.41
python-dateutil 2.9.0 2.8.2 -22.70KB -9.45
gmpy2 2.1.5 2.1.2 -22.83KB -10.24
nbclient 0.10.0 0.8.0 -36.13KB -57.05
nss 3.102 3.97 -38.78KB -1.97
pcre2 10.44 10.42 -60.02KB -6.04
debugpy 1.8.2 1.8.0 -136.55KB -5.72
re2 2023.09.01 2023.03.02 -170.48KB -86.76
libhwloc 2.11.1 2.9.3 -193.07KB -7.56
networkx 3.3 3.1 -267.89KB -18.79
bcrypt 4.2.0 4.1.2 -320.15KB -56.58
nbclassic 1.1.0 1.0.0 -435.42KB -7.51
mkl 2023.2.0 2022.2.1 -456.49KB -0.28
xkeyboard-config 2.42 2.41 -497.12KB -56.68
sleef 3.6.1 3.5.1 -545.17KB -35.87
rpds-py 0.19.1 0.17.1 -670.92KB -67.5
gettext 0.22.5 0.21.1 -3.67MB -89.0
pandoc 3.3 3.1.3 -8.06MB -28.81
pytorch 2.1.2 2.0.0 -39.37MB -58.54

@TRNWWZ
Copy link
Contributor Author

TRNWWZ commented Aug 2, 2024

Source of new installed packages:
libllvm18 is required by matplotlib
libclang-cpp15 is required by matplotlib
gettext-tools is required by matplotlib

@TRNWWZ TRNWWZ merged commit ceac022 into aws:main Aug 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants