Xipeng Li 75502be814 Adding FasterTransformer: A faster transformer layer inference implementation for BERT and other transformer based models. 6 lat temu
..
Modules 75502be814 Adding FasterTransformer: A faster transformer layer inference implementation for BERT and other transformer based models. 6 lat temu