Skip to content

Official code for the paper titled Augmenting Transformers with Recursively Composed Multi-grained Representations

Latest
Compare
Choose a tag to compare
@imhuim982 imhuim982 released this 20 May 14:08
· 13 commits to master since this release

In this work, we successfully combine a composition model with bi-directional Transformers and make them jointly pre-trainable.