Trending repositories for topic deep-learning
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AW...
:memo: An awesome Data Science repository to learn and apply for real world problems.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
Code for our paper "VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters".
[ECCV 2024] This is the official implementation of HRMapNet, maintaining and utilizing a low-cost global rasterized map to enhance online vectorized map perception.
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
[MedIA 2023/MICCAI 2022 Grand Challenge]: Airway Tree Modeling (ATM'22) Related Work Collections, also includes the state-of-the-art works on pulmonary airway segmentation and related works.
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
Official PyTorch implementation of AdaDiff described in the paper (https://arxiv.org/abs/2207.05876).
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🔬 A curated list of awesome LLMs & deep learning strategies & tools in financial market.
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
Code for our paper "VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters".
A Comprehensive Framework for Building End-to-End Recommendation Systems with State-of-the-Art Models
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
[ECCV 2024] This is the official implementation of HRMapNet, maintaining and utilizing a low-cost global rasterized map to enhance online vectorized map perception.
This project explores the impact of Multi-Scale CNNs on the classification of EEG signals in Brain-Computer Interface (BCI) systems. By comparing the performance of two models, EEGNet and MSTANN, the ...
[RSE 2024] 🍿POPCORN: High-resolution Population Maps Derived from Sentinel-1 and Sentinel-2 🌍🛰️
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
[ECCV 2024] This is the official implementation of HRMapNet, maintaining and utilizing a low-cost global rasterized map to enhance online vectorized map perception.
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
📺 Discover the latest machine learning / AI courses on YouTube.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Tensors and Dynamic neural networks in Python with strong GPU acceleration
An Open Source Machine Learning Framework for Everyone
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery Classification
CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
This project explores the impact of Multi-Scale CNNs on the classification of EEG signals in Brain-Computer Interface (BCI) systems. By comparing the performance of two models, EEGNet and MSTANN, the ...
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Implementation of a Light Recurrent Unit in Pytorch
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
CARLA: A self-supervised contrastive learning model for time series anomaly detection. Enhances anomaly detection by learning robust representations of time series data.
A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning (Awesome & Benchmark)
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
[ECCV 2024] UMERegRobust - Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration
This project focuses on implementing CNN model based on the EEGNet architecture with Pytorch library for classifying motor imagery tasks using EEG data.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An Open Source Machine Learning Framework for Everyone
Streamlit — A faster way to build and share data apps.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
The official implementation of Self-Play Preference Optimization (SPPO)
A Jax-based library for designing and training transformer models from scratch.
The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Foundational model for human-like, expressive TTS