1 result found Sort:
Wonderful Matrices to Build Small Language Models
nlp
python
pytorch
transformer
deep-learning
language-model
machine-learning
foundation-models
mixture-of-experts
attention-mechanism
pytorch-transformers
small-language-model
small-language-models
dynamic-mask-attention
attention-is-all-you-need
feedforward-neural-network
natural-language-processing
cross-domain-mixture-of-experts
Created
2024-11-03
346 commits to main branch, last one 16 days ago