Starred repositories
Official repository for the Boltz-1 biomolecular interaction model
KGWAS: novel genetics discovery enabled by massive functional genomics knowledge graph
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
BiomedParse: A Foundation Model for Joint Segmentation, Detection, and Recognition of Biomedical Objects Across Nine Modalities
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Official Implementation of NeurIPS 2024 paper "G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering""
Benchmarking DNA Language Models on Biologically Meaningful Tasks
ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit and hundreds of Google TPUs using Transformers Models.
Multi-class signal peptide prediction and structure decoding model.
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
User friendly and accurate binder design pipeline
Retrieve and summarize bioRxiv preprints with a local LLM using ollama
A modular graph-based Retrieval-Augmented Generation (RAG) system
Technology-invariant pipeline for spatial omics analysis that scales to millions of cells (Xenium / Visium HD / MERSCOPE / CosMx / PhenoCycler / MACSima / etc)
Open Targets python framework for post-GWAS analysis
gReLU is a python library to train, interpret, and apply deep learning models to DNA sequences.
Easily install and load packages from the tidyomcis ecosystem
HEST: Bringing Spatial Transcriptomics and Histopathology together - NeurIPS 2024 (Spotlight)
[IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models
Tool for plotting sequencing data along genomic coordinates.
Analyses conducting GWAS across the UKBB diverse superpopulations
Clinical Knowledge Graph (CKG) is a platform with twofold objective: 1) build a graph database with experimental data and data imported from diverse biomedical databases 2) automate knowledge disco…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
🧬 gget enables efficient querying of genomic reference databases
📋 A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
🦜🔗 Build context-aware reasoning applications