🧪 Beta — BiohacksAI is under active development. Data and features may change.
B
BiohacksAI
Evidence-Based Biohacking
BetaPatent Pending
B
BiohacksAI
Evidence-Based Biohacking Platform

How BiohacksAI works

BiohacksAI is an AI-driven compound research and discovery platform built on 1.12 million PubMed studies, a deterministic graph of 9,238 substances, and a cryptographic evidence chain called X-Vault.

The platform does two things: it lets researchers query the scientific literature with evidence-grade answers, and it systematically discovers novel compound–pathway combinations that may have drug-discovery value.

9,238
Substances
3,120
Targets
2,209
Pathways
1.12M+
PubMed studies
01 — Corpus

The evidence layer

Every claim in BiohacksAI is backed by a specific PubMed study. The corpus is a curated, versioned snapshot of PubMed abstracts indexed by substance, target, pathway, and study type.

The corpus is locked at a specific version — currently v20260307-01. When you query the system you always get an answer from the same dataset, and every answer includes a Merkle root hash that proves which version was used.

Determinism — why it matters
The same question asked today and in six months will produce the same answer from the same corpus version. The result is reproducible and cryptographically bound to the data it came from. This is not how standard AI chatbots work.
Root hash
85b6464f…c8efb
Corpus version
v20260307-01
Reproducible
Yes — always
02 — Discovery Engine

How discoveries are generated

The discovery engine scans the corpus for compounds that have strong biological target coverage but low literature footprint — meaning the scientific community has not yet fully characterised them.

These candidates are ranked by a two-layer scoring system and sealed cryptographically into the X-Vault evidence chain.

DISCOVERY_v1.1 — six-step pipeline
1
Filter candidates
9,238 compounds filtered: discovery_score > 40, studies < 20, targets ≥ 3, pathway diversity ≥ 2.
CACHE.underratedIndex → filterCandidates()
2
Deduplicate against vault
Each candidate gets a discovery_hash. If that exact compound × target × pathway × corpus_version combination is already sealed, it is skipped — discoveries are never duplicated.
sha256(compound + sorted_targets + sorted_pathways + corpus_version)
3
Enrich
Top molecular targets and biological pathways are resolved from the graph. Biotech score is computed. High-value target hits are identified from the HIGH_VALUE_TARGETS registry.
computeBiotechScore() · HIGH_VALUE_TARGETS
4
Seal into X-Vault
The discovery event is appended to the WORM chain. An event_hash and chain_hash are generated. This step is permanent and immutable.
discovery-seal.js → WORM JSONL
5
Generate PDF
A cryptographically sealed PDF report is generated: mechanism hypothesis, targets, pathways, X-Vault seal, validation roadmap.
6
Write ledger
A JSON ledger entry is written. This is the source of truth for the marketplace listing.
data/discoveries/disc_YYYYMMDD_NNN.json
Nightly automation
The pipeline runs automatically every night at 02:00 UTC, sealing up to 50 new discoveries per run. A compound will not appear twice for the same pathway context against the same corpus version.
03 — Scoring

Discovery score and biotech score

Every compound is evaluated by two complementary scores. The discovery score measures biological breadth. The biotech score measures drug-discovery relevance.

Discovery Score — biological breadth
target_count × pathway_diversity / log(studies + 2)

Compounds with many validated targets, broad pathway coverage, and low literature footprint score highest. The log-damping on studies ensures that well-studied compounds are de-prioritised in favour of understudied ones.

Biotech Score — drug-discovery relevance (DISCOVERY_v1.1)
(target_relevance × pathway_diversity) / log(studies + 2)

Target relevance is a weighted sum based on the HIGH_VALUE_TARGETS registry — 60+ gene symbols classified as Tier 3 (kinases, transcription factors, metabolic regulators: EGFR, NFE2L2, MTOR, SIRT1, TP53, BRD4, JAK1…) or Tier 2 (research-grade: MAPK1, SOD1, APP, HDAC2…). A compound that hits a Tier 3 target scores higher than one that hits an unstudied gene.

TierCriteriaShared licenseExclusive
Biohackerscore < 60 or studies > 10$99$990
Researchscore ≥ 60, studies ≤ 10$299$4,900
Patent-Gradescore ≥ 100, studies ≤ 5$999$24,900
04 — X-Vault

Cryptographic evidence sealing

Every discovery is sealed into the X-Vault — an append-only, SHA-256 hash-chained log that provides a cryptographically verifiable record of when a discovery was made and what corpus version it was based on.

Event hash
SHA-256 of the complete discovery event in canonical JSON. Proves the content has not been altered.
Chain hash
SHA-256 of (previous chain hash + event hash). Links every discovery to all prior discoveries — tampering breaks the chain.
Discovery hash
Deduplication key: sha256(compound + sorted targets + sorted pathways + corpus version). Prevents re-sealing identical discoveries.
Vault index
Sequential integer. #1 came before #2. Order is immutable and independently verifiable.

The X-Vault WORM log is stored server-side and is never rewritten — only appended to. Anyone who purchases a discovery license receives the event hash and chain hash, which can be independently verified at biohacksai.com/verify.

Why this matters for IP
The X-Vault seal provides a cryptographic timestamp for the discovery. When a patent examiner asks "when was this mechanism first documented?", the chain hash and vault index provide a tamper-evident answer. The seal is bound to the specific corpus version, making the evidence chain independently auditable.
05 — Mechanism Graph

Visualising how a compound works

Every discovery includes a mechanism discovery graph — a force-directed visualisation showing the relationship between the compound, its molecular targets, and the biological pathways those targets regulate.

Three-ring graph layout
Centre · Compound

Fixed at centre. The compound being analysed.

Ring 1 · Molecular targets

The gene/protein targets the compound interacts with. Colour-coded by drug-discovery relevance: gold = Tier 3 (patent-relevant kinases and TFs), indigo = Tier 2 (research targets), teal = standard targets. A pulsing dashed ring marks Tier 3 targets with ≤ 5 studies — the knowledge gap indicator.

Ring 2 · Biological pathways

The biological systems those targets belong to (Reactome). Shown as dashed rectangles connected by dotted lines from their associated targets.

06 — Marketplace

Discovery licenses

Discoveries can be licensed in two ways. Both include the full discovery report PDF, X-Vault seal reference, and a signed license agreement issued within 24 hours.

Shared Research License
  • ✓ Full discovery report PDF (20–30 pages)
  • ✓ X-Vault seal + public verification link
  • ✓ Research and publication rights
  • ✓ 10% royalty to buyer if patent arises
  • ✓ Multiple buyers permitted
BiohacksAI retains 90% IP
Exclusive License
  • ✓ Everything in Shared
  • ✓ Commercial + patent filing rights
  • ✓ Single buyer — exclusivity guaranteed
  • ✓ 90% IP transferred to buyer
  • ✓ Exclusivity recorded in X-Vault
BiohacksAI retains 10% royalty on commercialisation

Purchases are handled by email to hello@biohacksai.com. Include the Discovery ID in your subject line.

07 — Philosophy

AI proposes. Humans decide.

BiohacksAI is built on the EVE (Evidence & Verification Engine) architecture developed by Organiq Sweden AB. The core principle is witness mode: the AI generates hypotheses and surfaces evidence, but all decisions are made by humans and recorded with human-controlled cryptographic keys.

What witness mode means in practice
  • — The discovery engine never licenses anything autonomously
  • — Every discovery seal requires a verified admin key
  • — License agreements are signed by humans, not auto-generated
  • — The system logs what it proposed and when — the human logs what they decided
  • — No discovery can be unsealed or retroactively altered

All discoveries are labelled as computational hypothesis candidates — not confirmed mechanisms. Independent experimental validation is always required. Patent eligibility must be assessed by qualified patent counsel. IP assessments on this platform are computational only and do not constitute legal advice.

08 — Verification

Verify any discovery

Anyone can verify a discovery — no account required. Paste a Discovery ID (format: DISC-compound-YYYYMMDD-xxxxxxxx) or any part of the event hash into the verification page. The system walks the X-Vault chain and confirms whether the discovery exists and whether the chain integrity is intact.

09 — Data Attribution

Sources & licenses

BiohacksAI is built on publicly available scientific databases. We are grateful to the teams behind these resources.

SourceUsageLicense
PubMed / NCBILiterature corpus (1.12M abstracts)Public domain (NLM)
ChEMBLBioactivity data, targets, assaysCC BY-SA 3.0
BindingDBBinding affinity dataCC BY 3.0
ReactomePathway graph (2,209 pathways)CC BY 4.0
UniProtProtein / target metadataCC BY 4.0
PubChemCompound identifiers (CIDs)Public domain (NIH)

Data is used for non-commercial research and hypothesis generation. Bioactivity records from ChEMBL and BindingDB are aggregated and scored; individual assay records are not redistributed. If you believe any usage conflicts with a source license, contact hello@biohacksai.com.