Skip to content

A question about instancing brevitas quantizer to act layer #875

Answered by Giuseppe5
RyougiKukoc asked this question in Q&A
Discussion options

You must be logged in to vote

This works, and it would be basically equivalent to having nn.GeLU followed by a QuantIdentity(act_quant=Int8ActPerTensorFloat).

The thing I would point out is that passthrough_act in this case should be False.

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@RyougiKukoc
Comment options

@Giuseppe5
Comment options

@RyougiKukoc
Comment options

@Giuseppe5
Comment options

@RyougiKukoc
Comment options

Answer selected by RyougiKukoc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants