Statistics for language Python
RepositoryStats tracks 579,236 Github repositories, of these 115,001 are reported to use a primary language of Python.
Most starred repositories for language Python (view more)
Trending repositories for language Python (view more)
first base model for full-duplex conversational audio
An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.5 Sonnet API and Firecrawl. While currently configured for ente...
openai-captcha-detection 是一个使用 OpenAI 进行验证码识别的工具。通过调用 OpenAI 的 API,这个项目可以实现对复杂验证码图片的文本识别,帮助开发者在验证码处理场景中进行自动化操作。
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and di...
:art: Diagram as Code for prototyping cloud system architectures
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
first base model for full-duplex conversational audio
Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers).
Official Repository for "Eurekaverse: Environment Curriculum Generation via Large Language Models" (CoRL 2024)
first base model for full-duplex conversational audio
An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.5 Sonnet API and Firecrawl. While currently configured for ente...
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
:art: Diagram as Code for prototyping cloud system architectures
first base model for full-duplex conversational audio
Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers).
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Auto_Jobs_Applier_AIHawk is a tool that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Finetune Llama 3.2, Mistral, Phi, Qwen & Gemma LLMs 2-5x faster with 80% less memory
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.