2 results found Sort:
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
Created
2023-06-24
27 commits to master branch, last one 10 months ago
[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
Created
2024-03-08
16 commits to hsb branch, last one 6 months ago