Models & datasets

Aura-tahoe-100M

Explore single-cell dataset Tahoe-100M for cancer research.

Tahoe-100M is a giga-scale single-cell perturbation atlas consisting of over 100 million transcriptomic profiles from 50 cancer cell lines exposed to 1,100 small-molecule perturbations. Generated using Vevo Therapeutics' Mosaic high-throughput platform, Tahoe-100M enables deep, context-aware exploration of gene function, cellular states, and drug responses at unprecedented scale and resolution. This dataset is designed to power the development of next-generation AI models of cell biology, offering broad applications across systems biology, drug discovery, and precision medicine.

Reference: https://huggingface.co/datasets/tahoebio/Tahoe-100M

Aura-X-assistant

For rapid structure analysis and function prediction

Model/Tool

Description

Alphafold 2

Protein folding prediction using deep learning

Colabfold

Lightweight, faster version of AlphaFold for structure prediction

ESM

Evolutionary scale modeling for protein structure/folding

Bolz-1, Boltz-2

Protein folding and interaction prediction with affinity

ProTrek

Protein sequence generation and optimization

ESM2

Next-generation ESM model for sequence design and property prediction

Denovo-Pinal

De novo protein sequence generation

Evolla + Llama3 8b

Hybrid model for property prediction and generation

SaProt

Functional annotation and sequence optimization

BiomedNLP

Biomedical natural language processing for contextual predictions

SignalP 6.0

Signal peptide detection

AMP-Net

Antimicrobial peptide prediction

DeepLoc

Protein subcellular localization prediction

Aura-Z-thinking

For advanced modeling and deeper exploration

Model/Tool

Description

OpenMM

Molecular dynamics simulations for protein-ligand systems

PDB Search

Protein Data Bank search for small molecules and proteins

UniProt

Protein sequence and functional information database

Aura-TX-gemma

Tailored for therapeutic discovery and clinical applications

Model/Tool

Description

Wikipedia, Web and PubMed Search

Literature search for translational science

BLASTp

Protein sequence alignment and homology search

Antibody affinity

Prediction of antibody-antigen binding affinity

Molecule Search

Chemical molecule and ligand database search

SMILES to Therapy

Converts SMILES strings into therapeutic candidates

SMILES Description

Generates properties and annotations from SMILES strings

Drug-target interaction

Predicts interactions between drugs and targets

Protein-protein interaction

Predicts interactions between proteins

Toxicity

Predicts potential toxic effects of molecules

Pharmacokinetics

Predicts ADME (absorption, distribution, metabolism, excretion) properties

Developability

Assesses therapeutic molecule manufacturability

CRISPR Repair

Predicts and designs CRISPR-based gene editing outcomes

Last updated

Was this helpful?