Trending repositories for language Python
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabil...
Script to backup and restore your joined subredits, multireddits, followed users, saved, hidden, upvoted, downvoted posts.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabil...
A framework for Claude Opus to intelligently orchestrate subagents.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
R2R is a RAG (Retrieval-Augmented Generation) engine with a RESTful API and prod features. Including hybrid search, knowledge graphs, and more.
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
CISO Assistant is a one-stop-shop for GRC, covering Risk, AppSec and Audit Management and supporting +46 frameworks worldwide: NIST CSF, ISO 27001, SOC2, CIS, PCI DSS, NIS2, CMMC, PSPF, GDPR, HIPAA, E...
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may ...
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
An opinionated list of awesome Python frameworks, libraries, software and resources.
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
The official implementation of Self-Play Preference Optimization (SPPO)
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabil...
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
Easyest and lazyest way for building multi-agent LLMs applications.
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabil...
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
Script to backup and restore your joined subredits, multireddits, followed users, saved, hidden, upvoted, downvoted posts.
[ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"
Wrapper nodes for ComfyUI to use some ofthe DiffSynthStudio features
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video proce...
Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage and features a user-friendly Gradio interface.
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
A framework for Claude Opus to intelligently orchestrate subagents.
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Open-Sora: Democratizing Efficient Video Production for All
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
An opinionated list of awesome Python frameworks, libraries, software and resources.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabil...
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan...
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
GroqNotes: Generate organized notes from audio using Groq, Whisper, and Llama3
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
Wrapper nodes for ComfyUI to use some ofthe DiffSynthStudio features
为小黑子ikun们打造的专属键盘--鸡音键盘,非ikun不可用!用上它,你就是千万ikun粉中最靓的仔,无人可敌无人可挡。
Official code repository of CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
The official implementation of Self-Play Preference Optimization (SPPO)
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
This tool extracts and displays data from the Recall feature in Windows 11, providing an easy way to access information about your PC's activity snapshots.
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memor...
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabil...
Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
a research paper for generative cartoon interpolation
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
The tiniest PaaS you've ever seen. Piku allows you to do git push deployments to your own servers.
An opinionated list of awesome Python frameworks, libraries, software and resources.
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memor...
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.
Bizin Gothic は、ユニバーサルデザインフォントの BIZ UDゴシック と英文フォント Inconsolata を合成したプログラミング向けフォントです。
Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
A natural language interface for computers
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
An opinionated list of awesome Python frameworks, libraries, software and resources.
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropi...
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Easily migrate your codebase from one framework or language to another.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Converts text input or URL into knowledge graph and displays