Add intermediate_size
to GPT-NeoX models (#1212)
#1413
Job | Run time |
---|---|
4m 36s | |
4m 36s |
intermediate_size
to GPT-NeoX models (#1212)
#1413
Job | Run time |
---|---|
4m 36s | |
4m 36s |