Search Results - RepositoryStats

Awesome-LLM-Strawberry hijkzzz

365

6.6k

apache-2.0

107

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

llm mcts coding openai-o1 strawberry mathematics chain-of-thought reinforcement-learning

Created 2024-09-15

245 commits to main branch, last one 2 days ago

OpenRLHF OpenRLHF

583

5.9k

apache-2.0

35

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

vllm raylib openai-o1 transformers large-language-models reinforcement-learning proximal-policy-optimization reinforcement-learning-from-human-feedback

Created 2023-07-30

1,198 commits to main branch, last one 18 hours ago

refly refly-ai

262

3.1k

other

23

🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifacts, AI knowledge base integration, chrome extension clip & sav...

Created 2024-02-19

4,150 commits to main branch, last one 22 hours ago

Awesome-LLM-Reasoning atfortes

162

2.9k

mit

48

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

cot gpt mllm gpt-4o papers prompt awesome chatgpt deepseek openai-o1 reasoning multimodal strawberry deepseek-r1 language-models chain-of-thought prompt-engineering in-context-learning

Created 2022-11-05

185 commits to main branch, last one 8 days ago

Awesome-MCoT yaotingwangofficial

5

294

unknown

7

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

cot mcts survey system-2 openai-o1 reasoning multimodal deepseek-r1 slow-thinking mllm-reasoning chain-of-thought instruction-tuning large-vision-language-model multimodal-chain-of-thought multimodal-large-language-models

Created 2025-02-15

53 commits to main branch, last one 17 hours ago

SmartAgent tsinghua-fib-lab

1

25

mit

2

The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".

lvlm llm-agent openai-o1 embodied-ai multi-modal llm-reasoning personalization chain-of-thought human-centric-ai large-language-model human-computer-interaction

Created 2024-12-10

20 commits to main branch, last one 6 days ago