lesser powerful gpu #140

xprabhudayal · 2024-10-16T14:40:19Z

ive been running this model on a high end instance(h100) for a while & i have some doubts..

can we use less powerful gpu to run this ai model? like v100.
is this runnable in COLAB PRO having a100?

NathanaelTamirat · 2024-11-02T09:14:23Z

how long will it take on h100? and have you tried on other cheaper GPUs?
and can this project generate papers beside IT related ?

BradKML · 2024-12-21T13:20:06Z

@xprabhudayal You can ask the agent to do more small scale experiments targeting fine-grained issues (e.g. activation functions, classical ML with HPO, easy NLP optimization) rather than reaching all the way into DL territory. When it comes to idea generation others have said it takes "a few hours" (2-4 hours) for more compute-heavy concepts. #145
A small thought would be to create a diverse benchmark covering a wider range of common targets.

@NathanaelTamirat Check is (currently it is made for data-centric experiments not physical research) #145 (comment)

xprabhudayal · 2024-12-21T13:36:26Z

I have run this agent on Google Colab for free by using the T4 GPU and during the experiment part it has took me around 2 to 3 hours for Nano GPT lite(I had reverse the engineer the Colab and opened up a terminal).
I have generated a paper by using the Qwen 2.5 72B LLM via API.

Like the only part which utilizers the GPU is the experiment agent (when using LLM API) which has been seen in the diagram (in the middle section)

BradKML · 2024-12-21T15:39:23Z

@xprabhudayal so only the GPU is used for both writing code and experimentation? Could you share the setup and the time needed for each process (paper fetching, ideation, code-gen, experiment, drafting, reviewing)?

xprabhudayal closed this as completed Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lesser powerful gpu #140

lesser powerful gpu #140

xprabhudayal commented Oct 16, 2024

NathanaelTamirat commented Nov 2, 2024

BradKML commented Dec 21, 2024

xprabhudayal commented Dec 21, 2024

BradKML commented Dec 21, 2024

lesser powerful gpu #140

lesser powerful gpu #140

Comments

xprabhudayal commented Oct 16, 2024

NathanaelTamirat commented Nov 2, 2024

BradKML commented Dec 21, 2024

xprabhudayal commented Dec 21, 2024

BradKML commented Dec 21, 2024