Grence

Overview

OptimusKG is a biomedical knowledge graph that unifies data from over 15 primary data sources into a single, ontology-grounded labeled property graph. It covers genes, diseases, drugs, phenotypes, exposures, anatomical structures, biological processes, molecular functions, cellular components, and pathways.

OptimusKG is built by the Optimus framework, a production-ready data pipeline for constructing, validating, and maintaining biomedical knowledge graphs.

At a Glance

Nodes~190K across 10 entity types
Edges~21M across 26 relationship types
Data Sources15+ direct sources, 40+ indirect sources
Relation Types30+ standardized relation types
FormatsCSV, Parquet, Neo4j
LicenseMIT

Every node and edge in the graph carries full data provenance, tracking the direct and indirect data sources that contributed to it.

Continue Reading

On this page