Trending repositories for topic bioinformatics

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

48 (+15)

mit

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204 (+13)

mit

plotly/dash

Data Apps & Dashboards for Python. No JavaScript Required.

22,294 (+12)

mit

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

68,481 (+7)

biopython/biopython

Official git repository for Biopython (originally converted from CVS)

4,561 (+4)

steineggerlab/foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

954 (+3)

gpl-3.0

OSU-NLP-Group/ScienceAgentBench

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

78 (+3)

mit

MultiQC/MultiQC

Aggregate results from bioinformatics analyses across many samples into a single report.

1,288 (+2)

gpl-3.0

galaxyproject/galaxy

Data intensive science for everyone.

1,480 (+2)

seandavi/awesome-single-cell

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

3,350 (+2)

mit

danielecook/Awesome-Bioinformatics

A curated list of awesome Bioinformatics libraries and software.

3,407 (+2)

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

55 (+1)

apache-2.0

Adibvafa/CodonTransformer

CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.

136 (+1)

apache-2.0

MathCancer/PhysiCell

PhysiCell: Scientist end users should use latest release! Developers please fork the development branch and submit PRs to the dev branch. Thanks!

156 (+1)

snakemake/snakefmt

The uncompromising Snakemake code formatter

164 (+1)

mit

hillerlab/TOGA

TOGA (Tool to infer Orthologs from Genome Alignments): implements a novel paradigm to infer orthologous genes. TOGA integrates gene annotation, inferring orthologs and classifying genes as intact or l...

177 (+1)

mit

wheretrue/biobear

Work with bioinformatic files using Arrow, Polars, and/or DuckDB

181 (+1)

mit

lh3/biofast

Benchmarking programming languages/implementations for common tasks in Bioinformatics

185 (+1)

linsalrob/ComputationalGenomicsManual

Robs manual for the computational genomics and bioinformatics class.

246 (+1)

mit

bedops/bedops

:microscope: BEDOPS: high-performance genomic feature operations

328 (+1)

Last 3 days (relative gain)

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

48 (+45%)

mit

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204 (+7%)

mit

OSU-NLP-Group/ScienceAgentBench

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

78 (+4%)

mit

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

55 (+2%)

apache-2.0

Adibvafa/CodonTransformer

CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.

136 (+0.7%)

apache-2.0

MathCancer/PhysiCell

PhysiCell: Scientist end users should use latest release! Developers please fork the development branch and submit PRs to the dev branch. Thanks!

156 (+0.6%)

snakemake/snakefmt

The uncompromising Snakemake code formatter

164 (+0.6%)

mit

hillerlab/TOGA

177 (+0.6%)

mit

wheretrue/biobear

Work with bioinformatic files using Arrow, Polars, and/or DuckDB

181 (+0.6%)

mit

lh3/biofast

Benchmarking programming languages/implementations for common tasks in Bioinformatics

185 (+0.5%)

linsalrob/ComputationalGenomicsManual

Robs manual for the computational genomics and bioinformatics class.

246 (+0.4%)

mit

steineggerlab/foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

954 (+0.3%)

gpl-3.0

bedops/bedops

:microscope: BEDOPS: high-performance genomic feature operations

328 (+0.3%)

stuart-lab/signac

R toolkit for the analysis of single-cell chromatin data

363 (+0.3%)

MultiQC/MultiQC

Aggregate results from bioinformatics analyses across many samples into a single report.

1,288 (+0.2%)

gpl-3.0

galaxyproject/galaxy

Data intensive science for everyone.

1,480 (+0.1%)

pysam-developers/pysam

Pysam is a Python package for reading, manipulating, and writing genomics data such as SAM/BAM/CRAM and VCF/BCF files. It's a lightweight wrapper of the HTSlib API, the same one that powers samtools, ...

818 (+0.1%)

mit

kblin/ncbi-genome-download

Scripts to download genomes from the NCBI FTP servers

999 (+0.1%)

apache-2.0

mims-harvard/TDC

Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science

1,077 (+0.1%)

mit

lh3/bwa

Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

1,597 (+0.1%)

gpl-3.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204 (+58)

mit

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

48 (+35)

mit

plotly/dash

Data Apps & Dashboards for Python. No JavaScript Required.

22,294 (+24)

mit

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

68,481 (+22)

soedinglab/MMseqs2

MMseqs2: ultra fast and sensitive search and clustering suite

1,627 (+12)

mit

steineggerlab/foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

954 (+11)

gpl-3.0

danielecook/Awesome-Bioinformatics

A curated list of awesome Bioinformatics libraries and software.

3,407 (+8)

sokrypton/ColabFold

Making Protein folding accessible to all!

2,189 (+8)

mit

google/deepvariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

3,371 (+8)

bsd-3-clause

biopython/biopython

Official git repository for Biopython (originally converted from CVS)

4,561 (+8)

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

55 (+6)

apache-2.0

Adibvafa/CodonTransformer

CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.

136 (+5)

apache-2.0

mims-harvard/TDC

Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science

1,077 (+5)

mit

galaxyproject/galaxy

Data intensive science for everyone.

1,480 (+5)

seandavi/awesome-single-cell

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

3,350 (+5)

mit

BigDataBiology/argNorm

ARG normalization by mapping to the ARO ontology.

40 (+5)

mit

mims-harvard/PrimeKG

Precision Medicine Knowledge Graph (PrimeKG)

523 (+4)

mit

OpenGene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

2,052 (+4)

mit

allenai/scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

1,786 (+3)

apache-2.0

scverse/scanpy

Single-cell analysis in Python. Scales to >100M cells.

2,064 (+3)

bsd-3-clause

Last week (relative gain)

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

48 (+269%)

mit

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204 (+40%)

mit

BigDataBiology/argNorm

ARG normalization by mapping to the ARO ontology.

40 (+14%)

mit

ACEnglish/kanpig

Kmer Analysis of Pileups for Genotyping

27 (+13%)

mit

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

55 (+12%)

apache-2.0

OSU-NLP-Group/ScienceAgentBench

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

78 (+7%)

mit

ychuest/Awesome-LLMs-meet-genomes

Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in genomes.

46 (+5%)

mit

KennthShang/PhaBOX

Local version of the virus identification and analysis web server (tool set)

47 (+4%)

afl-3.0

Adibvafa/CodonTransformer

CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.

136 (+4%)

apache-2.0

biopragmatics/pyobo

📛 A Python package for using ontologies, terminologies, and biomedical nomenclatures

65 (+3%)

mit

HaojiaWu/CellScopes.jl

A Julia package for single cell and spatial data analysis

36 (+3%)

mit

epigen/enrichment_analysis

A Snakemake workflow and MrBiomics module for performing genomic region set and gene set enrichment analyses using LOLA, GREAT, GSEApy, pycisTarget and RcisTarget.

36 (+3%)

mit

mbhall88/nohuman

Remove human reads from a sequencing run

40 (+3%)

mit

stjude-rust-labs/wdl

Rust crates for working with Workflow Description Language (WDL) documents.

48 (+2%)

apache-2.0

rajewsky-lab/openst

Open-ST: profile and analyze tissue transcriptomes in 3D with high resolution in your lab

97 (+2%)

medema-group/BiG-SCAPE

Similarity networks of biosynthetic gene clusters

106 (+2%)

agpl-3.0

haddocking/haddock3

Official repo of the modular BioExcel version of HADDOCK

136 (+1%)

apache-2.0

Electrostatics/pdb2pqr

PDB2PQR - determining titration states, adding missing atoms, and assigning charges/radii to biomolecules.

142 (+1%)

yeeus/GCI

A program for assessing the T2T genome continuity

71 (+1%)

mit

BigDataBiology/macrel

Predict AMPs in (meta)genomes and peptides

75 (+1%)

Last month (new repositories)

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204

mit

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

mit

xzx0554/EVO2-Virus

A deep learning model (EVO2-500M) for predicting host specificity of eukaryote-infecting viruses CDNA sequence

Last month (absolute gain)

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204 (+202)

mit

plotly/dash

Data Apps & Dashboards for Python. No JavaScript Required.

22,294 (+149)

mit

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

68,481 (+144)

shenwei356/rush

A cross-platform command-line tool for executing jobs in parallel

1,002 (+82)

mit

danielecook/Awesome-Bioinformatics

A curated list of awesome Bioinformatics libraries and software.

3,407 (+54)

soedinglab/MMseqs2

MMseqs2: ultra fast and sensitive search and clustering suite

1,627 (+48)

mit

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

48 (+45)

mit

sokrypton/ColabFold

Making Protein folding accessible to all!

2,189 (+40)

mit

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

55 (+39)

apache-2.0

steineggerlab/foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

954 (+36)

gpl-3.0

biopython/biopython

Official git repository for Biopython (originally converted from CVS)

4,561 (+33)

BaranziniLab/KG_RAG

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

819 (+30)

apache-2.0

google/deepvariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

3,371 (+28)

bsd-3-clause

seandavi/awesome-single-cell

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

3,350 (+27)

mit

nextflow-io/nextflow

A DSL for data-driven computational pipelines

2,905 (+27)

apache-2.0

scverse/scanpy

Single-cell analysis in Python. Scales to >100M cells.

2,064 (+25)

bsd-3-clause

xzx0554/EVO2-Virus

A deep learning model (EVO2-500M) for predicting host specificity of eukaryote-infecting viruses CDNA sequence

32 (+25)

lh3/minimap2

A versatile pairwise aligner for genomic and spliced nucleotide sequences

1,927 (+24)

OpenGene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

2,052 (+23)

mit

biotite-dev/biotite

A comprehensive library for computational molecular biology

784 (+22)

bsd-3-clause

Last month (relative gain)

xzx0554/EVO2-Virus

A deep learning model (EVO2-500M) for predicting host specificity of eukaryote-infecting viruses CDNA sequence

32 (+357%)

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

55 (+244%)

apache-2.0

cgoliver/rnaglib

Deep learning datasets for RNA 3D and 2.5D structures.

31 (+82%)

mit

stjude-rust-labs/sprocket

A bioinformatics workflow engine built on top of the Workflow Description Language (WDL).

67 (+40%)

apache-2.0

Future-House/BixBench

Benchmark for LLM-based Agents in Computational Biology

30 (+30%)

apache-2.0

BigDataBiology/argNorm

ARG normalization by mapping to the ARO ontology.

40 (+25%)

mit

ACEnglish/kanpig

Kmer Analysis of Pileups for Genotyping

27 (+23%)

mit

ychuest/Awesome-LLMs-meet-genomes

Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in genomes.

46 (+21%)

mit

stjude-rust-labs/wdl

Rust crates for working with Workflow Description Language (WDL) documents.

48 (+20%)

apache-2.0

mjendrusch/salad

protein structure generation with sparse all-atom denoising models

30 (+20%)

apache-2.0

Adibvafa/CodonTransformer

CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.

136 (+13%)

apache-2.0

OSU-NLP-Group/ScienceAgentBench

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

78 (+13%)

mit

stjude-rust-labs/crankshaft

A Rust-based, headless workflow execution framework supporting local, cloud, and HPC.

37 (+12%)

apache-2.0

scikit-fingerprints/scikit-fingerprints

Scikit-learn compatible library for molecular fingerprints

216 (+10%)

mit

BasedLabs/NoLabs

Open source biolab

127 (+9%)

apache-2.0

epigen/enrichment_analysis

A Snakemake workflow and MrBiomics module for performing genomic region set and gene set enrichment analyses using LOLA, GREAT, GSEApy, pycisTarget and RcisTarget.

36 (+9%)

mit

shenwei356/rush

A cross-platform command-line tool for executing jobs in parallel

1,002 (+9%)

mit

evotools/nf-LO

A Nextflow workflow to generate lift over files for any pair of genomes

64 (+8%)

mit

shenwei356/perfect-bioinformatic-tools

What should perfect bioinformatic tools be like?

117 (+8%)

cc0-1.0

Rbfinch/grepq

quickly filter fastq files by matching sequences to a set of regex patterns

52 (+8%)

mit

Last 12-months (new repositories)

zeqianli/tgv

Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

204

mit

BioOmics/iSeq

Download sequencing data and metadata from GSA, SRA, ENA, and DDBJ databases.

184

mit

Wainberg/ryp

R inside Python

173

mit

Adibvafa/CodonTransformer

CodonTransformer: The ultimate tool for codon optimization, optimizing DNA sequences for heterologous protein expression across 164 species.

136

apache-2.0

lh3/ropebwt3

BWT construction and search

111

helicalAI/helical

This repository contains the python package for Helical

107

agpl-3.0

Bindwell/APPT

Affinity Protein-Protein Transformers—State of the art protein-protein binding affinity in seconds!

OSU-NLP-Group/ScienceAgentBench

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

mit

rcedgar/reseek

Protein structure alignment and search algorithm

gpl-3.0

servierhub/top-pharma50

Top open source software from the top 50 pharmaceutical companies

biodatageeks/polars-bio

Blazing-Fast Bioinformatic Operations on Python DataFrames

apache-2.0

mbhall88/lrge

Genome size estimation from long read overlaps

mit

Rbfinch/grepq

quickly filter fastq files by matching sequences to a set of regex patterns

mit

genomoncology/biomcp

BioMCP: Biomedical Model Context Protocol

mit

ychuest/Awesome-LLMs-meet-genomes

Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in genomes.

mit

ayueme/R_beginners

医学生R语言零基础入门

mit

Juke34/awesome-awesomeness-bioinformatics

A list of awesome awesomeness related to bioinformatics and associated fields

cc-by-4.0

theislab/DRVI

Unsupervised Deep Disentangled Representation of Single-Cell Omics

bsd-3-clause

stjude-rust-labs/crankshaft

A Rust-based, headless workflow execution framework supporting local, cloud, and HPC.

apache-2.0

MIDIfactory/AlphaFastPPi

Fast AlphaFold-Multimer based pipeline for Protein-Protein Interaction (PPI) screening

gpl-3.0

Last 12-months (absolute gain)

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

68,481 (+3,755)

plotly/dash

Data Apps & Dashboards for Python. No JavaScript Required.

22,294 (+1,850)

mit

danielecook/Awesome-Bioinformatics

A curated list of awesome Bioinformatics libraries and software.

3,407 (+542)

BaranziniLab/KG_RAG

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

819 (+523)

apache-2.0

sokrypton/ColabFold

Making Protein folding accessible to all!

2,189 (+494)

mit

seandavi/awesome-single-cell

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

3,350 (+454)

mit

biopython/biopython

Official git repository for Biopython (originally converted from CVS)

4,561 (+409)

Starlitnightly/omicverse

A python library for multi omics included bulk, single cell and spatial RNA-seq analysis.

603 (+407)

gpl-3.0

soedinglab/MMseqs2

MMseqs2: ultra fast and sensitive search and clustering suite

1,627 (+385)

mit

nextflow-io/nextflow

A DSL for data-driven computational pipelines

2,905 (+367)

apache-2.0

steineggerlab/foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

954 (+339)

gpl-3.0

scverse/scanpy

Single-cell analysis in Python. Scales to >100M cells.

2,064 (+324)

bsd-3-clause

google/deepvariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

3,371 (+299)

bsd-3-clause

OpenGene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

2,052 (+294)

mit

moshi4/pyCirclize

Circular visualization in Python (Circos Plot, Chord Diagram, Radar Chart)

859 (+264)

mit

lh3/minimap2

A versatile pairwise aligner for genomic and spliced nucleotide sequences

1,927 (+250)

joybio/multiPrime

multiPrime is a mismatch-tolerant minimal primer set design tool for large and diverse sequences (e.g. Virus). Here is a web-based version (test: http://multiPrime.cn)

447 (+241)

mit

biotite-dev/biotite

A comprehensive library for computational molecular biology

784 (+231)

bsd-3-clause

Marsilea-viz/marsilea

Declarative creation of composable visualization for Python (Complex heatmap, Upset plot, Oncoprint and more~)

261 (+220)

mit

mims-harvard/PrimeKG

Precision Medicine Knowledge Graph (PrimeKG)

523 (+217)

mit

Last 12-months (relative gain)

Wainberg/ryp

R inside Python

173 (+4,225%)

mit

epigen/MrBiomics

MrBiomics: Modules & Recipes augment Bioinformatics for Multi-Omics Analyses

49 (+1,125%)

mit

steineggerlab/foldmason

Multiple Protein Structure Alignment at Scale with FoldMason

164 (+1,071%)

gpl-3.0

scikit-fingerprints/scikit-fingerprints

Scikit-learn compatible library for molecular fingerprints

216 (+800%)

mit

Bindwell/APPT

Affinity Protein-Protein Transformers—State of the art protein-protein binding affinity in seconds!

81 (+800%)

stjude-rust-labs/sprocket

A bioinformatics workflow engine built on top of the Workflow Description Language (WDL).

67 (+738%)

apache-2.0

mjendrusch/salad

protein structure generation with sparse all-atom denoising models

30 (+650%)

apache-2.0

mbhall88/lrge

Genome size estimation from long read overlaps

52 (+643%)

mit

MannLabs/alphadia

modular & open DIA search

67 (+570%)

apache-2.0

rcedgar/usearch12

Open-source usearch

109 (+541%)

gpl-3.0

Marsilea-viz/marsilea

Declarative creation of composable visualization for Python (Complex heatmap, Upset plot, Oncoprint and more~)

261 (+537%)

mit

paulsengroup/hictk

Blazing fast toolkit to work with .hic and .cool files

30 (+500%)

mit

lbcb-sci/RiNALMo

RiboNucleic Acid (RNA) Language Model

92 (+411%)

apache-2.0

hancockinformatics/pathlinkR

Analysis and visualization of RNA-Seq results

29 (+383%)

gpl-3.0

stjude-rust-labs/crankshaft

A Rust-based, headless workflow execution framework supporting local, cloud, and HPC.

37 (+363%)

apache-2.0

xzx0554/EVO2-Virus

A deep learning model (EVO2-500M) for predicting host specificity of eukaryote-infecting viruses CDNA sequence

32 (+357%)

mbhall88/nohuman

Remove human reads from a sequencing run

40 (+344%)

mit

rajewsky-lab/openst

Open-ST: profile and analyze tissue transcriptomes in 3D with high resolution in your lab

97 (+304%)

DessimozLab/FastOMA

FastOMA is a scalable software package to infer orthology relationship.

74 (+270%)

mpl-2.0

bionumpy/bionumpy

Python library for array programming on biological datasets. Documentation available at: https://bionumpy.github.io/bionumpy/

280 (+226%)

mit