Statistics for topic reinforcement-learning
RepositoryStats tracks 518,991 Github repositories, of these 1,151 are tagged with the reinforcement-learning topic. The most common primary language for repositories using this topic is Python (775). Other languages include: Jupyter Notebook (156), C++ (40), C# (15)
Stargazers over time for topic reinforcement-learning
Most starred repositories for topic reinforcement-learning (view more)
Trending repositories for topic reinforcement-learning (view more)
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan...
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
Prioritized Experience Replay implementation with proportional prioritization
PsyDI: A MBTI agent that helps you understand your personality type through a relaxed multi-modal interaction.
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan...
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
PsyDI: A MBTI agent that helps you understand your personality type through a relaxed multi-modal interaction.
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan...
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
PsyDI: A MBTI agent that helps you understand your personality type through a relaxed multi-modal interaction.
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Top paper collection for stock price prediction, quantitative trading. Covering top conferences and journals like KDD, TKDE, CIKM, AAAI, IJCAI, ACL, EMNLP.
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan...
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).
A deep reinforcement learning (DRL) based approach for spatial layout of land use and roads in urban communities. (Nature Computational Science)
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.