Trending repositories for topic deep-learning
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
An Open Source Machine Learning Framework for Everyone
Label Studio is a multi-type data labeling and annotation tool with standardized output format
simple but efficient kernel regression and anomaly detection algorithms
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
Mithril: A Modular Machine Learning Library for Model Composability
[ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes
Codebase for "Generating multivariate time series with COmmon Source CoordInated GAN (COSCI-GAN)"
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website...
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
在图像获取和传输过程中,往往伴随着各种形式的损坏,降低了图像质量和对图像信息的准确解释,一些老照片因为保存不当也会变得存在污渍或者破损缺失。图像修复技术主要用来修复日常生活中被噪声污染或者人为破坏的破损图像,也可应用于替换图像中的小区域或者瑕疵。目前,图像修复工作仍然由经验丰富的图像修复师来完成,让图像修复借助深度学习算法实现自动化日趋成为该领域的发展方向。本课题基于深度学习算法和图像处理技术,设...
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
S + Autograd + XLA :: S-parameter based frequency domain circuit simulations and optimizations using JAX.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
An Open Source Machine Learning Framework for Everyone
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website...
simple but efficient kernel regression and anomaly detection algorithms
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
an inference lib for image/video restoration with VapourSynth support
Mithril: A Modular Machine Learning Library for Model Composability
The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
Face recognition SDK .NET MAUI (formerly Xamarin) and CSharp with 3D passive liveness detection (Face Detection, Face Landmarks, Face Recognition, Face Liveness, Face Pose, Face Expression, Face attri...
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Implementation of the proposed Spline-Based Transformer from Disney Research
Implementation of LVSM, SOTA Large View Synthesis with Minimal 3d Inductive Bias, from Adobe Research
Deploying Android application for object detection
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Implementation of LVSM, SOTA Large View Synthesis with Minimal 3d Inductive Bias, from Adobe Research
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
Training YOLO5 model with custom data
simple but efficient kernel regression and anomaly detection algorithms
Deploying Android application for object detection
Deploying Android application for image classification
MobileNet for Image Classification
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Implementation of the proposed Spline-Based Transformer from Disney Research
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An Open Source Machine Learning Framework for Everyone
Streamlit — A faster way to build and share data apps.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
SDG is a specialized framework designed to generate high-quality structured tabular data.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
【三年面试五年模拟】算法工程师秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
Face recognition SDK Android with 3D passive liveness detection (Face Detection, Face Landmarks, Face Recognition, Face Liveness, Face Pose, Face Expression, Face attributes)
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
The official implementation of Self-Play Preference Optimization (SPPO)
A curated list of data science & AI guided projects to start building your portfolio
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data