Skip to content

GemmaMLP uses 'tanh` approximation for GeLU activation (#1004) #1

GemmaMLP uses 'tanh` approximation for GeLU activation (#1004)

GemmaMLP uses 'tanh` approximation for GeLU activation (#1004) #1

Annotations

1 warning

cpu-tests (macOS-12, 3.10)

succeeded Mar 10, 2024 in 9m 18s