This pipeline takes the 46-gene amyotrophic lateral sclerosis panel and pulls the evidence around the panel genes from six public databases: protein records from UniProt, interaction partners from STRING, classified variants from ClinVar, pathway membership from Reactome, literature from PubMed, and disease associations and known drugs from Open Targets. It links the records into one graph, scores the genes, and writes 589 candidate mechanisms tied to real citations. Rows trace back to cached API responses.
Built 2026-06-21 · Six public data sources · Reproducible · Code on GitHub
The 46 seed genes resolve to UniProt accessions, then query across five more sources. The responses are cached on disk and normalized into a small database: genes, interactions, variants, pathways, disease associations, drugs, and papers. The graph that results carries 405 interaction edges, 2,424 variants, 163 Reactome pathways, and 362 papers with 244 distinct PubMed ids.
Wiping the cache and running again rebuilds the database to the same shape, which is the test that no row was seeded by hand. The full provenance, with spot-checked records and their source URLs, ships with the project.
| Source | Records | Endpoint | What it provides |
|---|---|---|---|
| UniProt | 46 proteins | rest.uniprot.org | gene to accession, name |
| STRING | 405 edges | string-db.org | interactions, confidence ≥ 0.7 |
| NCBI ClinVar | 2,424 variants | eutils.ncbi.nlm.nih.gov | clinical classification |
| NCBI PubMed | 362 papers | eutils.ncbi.nlm.nih.gov | literature, citations |
| Reactome | 163 pathways | reactome.org | pathway membership |
| Open Targets | 92 ALS links, 110 drugs | platform.opentargets.org | disease association, known drugs |
Genes are scored on a weighted blend of Open Targets association, ClinVar pathogenicity, pathway and network centrality, literature volume and citation quality, and druggability. The ALS association column is the Open Targets score for the gene against amyotrophic lateral sclerosis. Click a row for a plain-language breakdown, including the published share of familial and sporadic ALS and the phenotype specificity (from GeneReviews where available; formal per-gene sensitivity and specificity are not standard).
| # | Gene | UniProt | Protein | ALS assoc. | Score |
|---|---|---|---|---|---|
| 1 | TUBA4A | P68366 | Tubulin alpha-4A chain | 0.777 | 0.725 |
| 2 | ERBB4 | Q15303 | Receptor tyrosine-protein kinase erbB-4 | 0.662 | 0.627 |
| 3 | SOD1 | P00441 | Superoxide dismutase [Cu-Zn] | 0.870 | 0.605 |
| 4 | TBK1 | Q9UHD2 | Serine/threonine-protein kinase TBK1 | 0.635 | 0.569 |
| 5 | VCP | P55072 | Transitional endoplasmic reticulum ATPase | 0.664 | 0.555 |
| 6 | SIGMAR1 | Q99720 | Sigma non-opioid intracellular receptor 1 | 0.705 | 0.554 |
| 7 | SQSTM1 | Q13501 | Sequestosome-1 | 0.809 | 0.533 |
| 8 | OPTN | Q96CV9 | Optineurin | 0.771 | 0.514 |
| 9 | PFN1 | P07737 | Profilin-1 | 0.577 | 0.502 |
| 10 | FUS | P35637 | RNA-binding protein FUS | 0.812 | 0.497 |
| 11 | HNRNPA1 | P09651 | Heterogeneous nuclear ribonucleoprotein A1 | 0.641 | 0.495 |
| 12 | ANG | P03950 | Angiogenin | 0.787 | 0.495 |
| 13 | CYLD | Q9NQC7 | Ubiquitin carboxyl-terminal hydrolase CYLD | 0.000 | 0.488 |
| 14 | UBQLN2 | Q9UHD9 | Ubiquilin-2 | 0.609 | 0.487 |
| 15 | FIG4 | Q92562 | Polyphosphoinositide phosphatase | 0.537 | 0.486 |
| 16 | KIF5A | Q12840 | Kinesin heavy chain isoform 5A | 0.708 | 0.482 |
| 17 | CCNF | P41002 | Cyclin-F | 0.418 | 0.482 |
| 18 | SPG11 | Q96JI7 | Spatacsin | 0.198 | 0.477 |
| 19 | ANXA11 | P50995 | Annexin A11 | 0.774 | 0.467 |
| 20 | TARDBP | Q13148 | TAR DNA-binding protein 43 | 0.820 | 0.464 |
| 21 | NEK1 | Q96PY6 | Serine/threonine-protein kinase Nek1 | 0.581 | 0.464 |
| 22 | ALS2 | Q96Q42 | Alsin | 0.585 | 0.459 |
| 23 | C9orf72 | Q96LT7 | Guanine nucleotide exchange factor C9orf72 | 0.610 | 0.444 |
| 24 | SPTLC1 | O15269 | Serine palmitoyltransferase 1 | 0.318 | 0.438 |
| 25 | CHMP2B | Q9UQN3 | Charged multivesicular body protein 2b | 0.773 | 0.427 |
| 26 | DCTN1 | Q14203 | Dynactin subunit 1 | 0.729 | 0.416 |
| 27 | PRPH | P41219 | Peripherin | 0.467 | 0.408 |
| 28 | SETX | Q7Z333 | Helicase senataxin | 0.378 | 0.384 |
| 29 | ATXN2 | Q99700 | Ataxin-2 | 0.444 | 0.354 |
| 30 | UNC13A | Q9UPW8 | Protein unc-13 homolog A | 0.469 | 0.330 |
| 31 | UBQLN4 | Q9NRR5 | Ubiquilin-4 | 0.463 | 0.307 |
| 32 | VAPB | O95292 | Vesicle-associated membrane protein-associated protein B/C | 0.402 | 0.300 |
| 33 | TIA1 | P31483 | Cytotoxic granule associated RNA binding protein TIA1 | 0.078 | 0.292 |
| 34 | MATR3 | P43243 | Matrin-3 | 0.747 | 0.289 |
| 35 | LRP12 | Q9Y561 | Low-density lipoprotein receptor-related protein 12 | 0.377 | 0.287 |
| 36 | CFAP410 | O43822 | Cilia- and flagella-associated protein 410 | 0.568 | 0.271 |
| 37 | MOBP | Q13875 | Myelin-associated oligodendrocyte basic protein | 0.444 | 0.251 |
| 38 | DAO | P14920 | D-amino-acid oxidase | 0.472 | 0.250 |
| 39 | FGGY | Q96C11 | FGGY carbohydrate kinase domain-containing protein | 0.372 | 0.238 |
| 40 | SCFD1 | Q8WVM8 | Sec1 family domain-containing protein 1 | 0.285 | 0.236 |
| 41 | TMEM106B | Q9NUM4 | Transmembrane protein 106B | 0.117 | 0.227 |
| 42 | NEFH | P12036 | Neurofilament heavy polypeptide | 0.492 | 0.226 |
| 43 | TRPM7 | Q96QT4 | Transient receptor potential cation channel subfamily M member 7 | 0.021 | 0.224 |
| 44 | CHCHD10 | Q8WYQ3 | Coiled-coil-helix-coiled-coil-helix domain-containing protein 10, mitochondrial | 0.409 | 0.204 |
| 45 | ELP3 | Q9H9T3 | Elongator complex protein 3 | 0.216 | 0.195 |
| 46 | TAF15 | Q92804 | TATA-binding protein-associated factor 2N | 0.036 | 0.157 |
ALS assoc. is the Open Targets disease-association score (0 to 1). Score is the pipeline's combined gene rank.
Open Targets returns 110 drugs and clinical candidates whose molecular target is one of the panel genes, with real ChEMBL ids and development stages. TOFERSEN, the approved SOD1 antisense therapy, is among them. Most-advanced shown first. Click a row for the drug detail.
| Drug | ChEMBL | Stage | Target | Mechanism |
|---|---|---|---|---|
| Afatinib | CHEMBL1173655 | Approved | ERBB4 | Receptor protein-tyrosine kinase erbB-4 inhibitor |
| Afatinib Dimaleate | CHEMBL2105712 | Approved | ERBB4 | Receptor protein-tyrosine kinase erbB-4 inhibitor |
| Belantamab Mafodotin | CHEMBL4298209 | Approved | TUBA4A | Tumor necrosis factor receptor superfamily member 17 binding agent |
| Brentuximab Vedotin | CHEMBL1742994 | Approved | TUBA4A | Tubulin inhibitor |
| Cabazitaxel | CHEMBL1201748 | Approved | TUBA4A | Tubulin inhibitor |
| Carbetapentane | CHEMBL73234 | Approved | SIGMAR1 | Sigma opioid receptor agonist |
| Carbetapentane Citrate | CHEMBL1256696 | Approved | SIGMAR1 | Sigma opioid receptor agonist |
| Colchicine | CHEMBL107 | Approved | TUBA4A | Tubulin inhibitor |
| Dacomitinib | CHEMBL2105719 | Approved | ERBB4 | Epidermal growth factor receptor erbB1 inhibitor |
| Dextromethorphan | CHEMBL52440 | Approved | SIGMAR1 | Sigma opioid receptor agonist |
| Dextromethorphan Hydrobromide | CHEMBL1256818 | Approved | SIGMAR1 | Sigma opioid receptor agonist |
| Dextromethorphan Polistirex | CHEMBL1201461 | Approved | SIGMAR1 | Sigma opioid receptor agonist |
| Docetaxel | CHEMBL3545252 | Approved | TUBA4A | Tubulin inhibitor |
| Docetaxel Anhydrous | CHEMBL92 | Approved | TUBA4A | Tubulin stabiliser |
| Enfortumab Vedotin | CHEMBL3301589 | Approved | TUBA4A | Nectin-4 binding agent |
| Eribulin | CHEMBL1683590 | Approved | TUBA4A | Tubulin inhibitor |
| Eribulin Mesylate | CHEMBL1683544 | Approved | TUBA4A | Tubulin inhibitor |
| Fenfluramine | CHEMBL87493 | Approved | SIGMAR1 | Serotonin 2c (5-HT2c) receptor agonist |
Showing 18 of 110 mapped drugs.
Click a row to see which panel genes share the pathway.
| Pathway | Name | Panel genes |
|---|---|---|
| R-HSA-2132295 | MHC class II antigen presentation | 3 |
| R-HSA-8951664 | Neddylation | 3 |
| R-HSA-5205685 | PINK1-PRKN Mediated Mitophagy | 3 |
| R-HSA-114608 | Platelet degranulation | 3 |
| R-HSA-2565942 | Regulation of PLK1 Activity at G2/M Transition | 3 |
| R-HSA-5357905 | Regulation of TNFR1 signaling | 3 |
| R-HSA-5357786 | TNFR1-induced proapoptotic signaling | 3 |
| R-HSA-8854518 | AURKA Activation by TPX2 | 2 |
| R-HSA-936964 | Activation of IRF3, IRF7 mediated by TBK1, IKKε (IKBKE) | 2 |
| R-HSA-9646399 | Aggrephagy | 2 |
| R-HSA-5620912 | Anchoring of the basal body to the plasma membrane | 2 |
| R-HSA-6811434 | COPI-dependent Golgi-to-ER retrograde traffic | 2 |
| R-HSA-6811436 | COPI-independent Golgi-to-ER retrograde traffic | 2 |
| R-HSA-6807878 | COPI-mediated anterograde transport | 2 |
163 distinct Reactome pathways total; the ones touched by the most panel genes are shown.
The pipeline writes one hypothesis per pattern it finds in the graph: two ALS genes sharing a cell process, two proteins that work together, or a gene with disease-causing variants that links to a known ALS gene. Many read alike, so open one to see what sets it apart and to walk through how the pipeline got there, step by step. Search the text, or filter by mechanism type and confidence.
Nodes are panel genes; lines are STRING interactions between two panel genes at confidence 0.7 or higher. A node grows with its betweenness, a measure of how often it sits on the shortest path between two other panel genes, so the bridge genes read largest. Rust marks a gene with an Open Targets ALS association of 0.5 or higher; pacific marks the rest. Pick a layout to redraw the network, drag a node to move it, or rotate the whole view. Hover a node to trace its links, or click one to open its row in the gene table above.
37 STRING interactions run between the 46 panel genes. The full graph, with partners, variants, pathways, papers, and hypotheses, downloads as als_knowledge_graph.graphml for Gephi or Cytoscape.
Data comes from UniProt, STRING, NCBI ClinVar and PubMed, Reactome, and Open Targets. The public APIs are queried live and cached for reproducibility. Variant classifications use ClinVar germline classifications; drug stages come from Open Targets. This page is a worked example of an evidence-integration pipeline. It is informational and is not medical advice.