[EPIC] Resource requirements and cost of foundation models #36

Shreyanand · 2023-04-13T18:28:15Z

The large size of foundation models raise several resource and cost questions around deploying them in production. This EPIC will focus on creating experiments and showing results around some of the following questions:

What is the relationship between model parameters and its memory consumption? Create a "rosetta stone" document of GPU memory required by models belonging to different parameter size. Create a notebook that captures the footprint of the GPU memory used.
How are the models loaded into GPU RAM? Is it directly from S3 or do we also require significant RAM? If so, capture the RAM requirements in a notebook. What about CPU? Update the cost document with RAM, and CPU information. Are there ways to optimize this?
What happens when we load the models in a lower precision format like INT-8? How is the accuracy, CPU, and memory performance affected? Explain theoretically and show results in a notebook. Touch upon challenges of frameworks like bitsandbytes in production.
Is distributed training and inference with a lot of cheap instances more efficient per dollar than 1 instance with a large GPU? If we have just one GPU of 16GB memory, how much can be done with it in the space of LLMs? Design experiments and share results in a notebook.
What are the options of running these models just on CPU? Are there ways of optimizing more than 1B = 1GB of GPU with INT8 precision?

codificat · 2023-04-14T18:01:33Z

About what happens when lowering precision, here's an interesting blog post: LLM.int8() and Emergent Features

suppathak · 2023-04-14T18:12:47Z

Adding here links useful for this issue:

Shreyanand · 2023-04-28T13:32:08Z

@suppathak These experiments (1, 2) are directly related to 1 GPU task that you're doing. You should adapt these for our context.

Shreyanand added the enhancement New feature or request label Apr 13, 2023

Shreyanand assigned Shreyanand and suppathak Apr 13, 2023

suppathak mentioned this issue May 5, 2023

Added a nb demonstrating efficient gpu utilizations #44

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] Resource requirements and cost of foundation models #36

[EPIC] Resource requirements and cost of foundation models #36

Shreyanand commented Apr 13, 2023 •

edited

Loading

codificat commented Apr 14, 2023

suppathak commented Apr 14, 2023 •

edited

Loading

Shreyanand commented Apr 28, 2023

[EPIC] Resource requirements and cost of foundation models #36

[EPIC] Resource requirements and cost of foundation models #36

Comments

Shreyanand commented Apr 13, 2023 • edited Loading

codificat commented Apr 14, 2023

suppathak commented Apr 14, 2023 • edited Loading

Shreyanand commented Apr 28, 2023

Shreyanand commented Apr 13, 2023 •

edited

Loading

suppathak commented Apr 14, 2023 •

edited

Loading