Prioritization Scores
OMICs
Omics score values represent the probability of a given evidence group indicating the association of a given gene to a given disease. All scores range from 0 to 1, with 0 indicating no evidence, and 1 - the highest degree of evidence
Heterogeneous Graph Walk (HeroWalk) is a guided random walk-based approach that is applied to a heterogeneous graph. The model learns node representations and then finds gene nodes close to the reference disease node. First, the "walks" are sampled with a predefined meta-path, i.e. fixed sequence of node types in a walk, e.g. 'gene'-'disease'-'gene.' The node degree controls the probability of transition between the nodes while sampling. Following that, the SkipGram model learns the representation of each node based on the resulting corpus of walks. The cosine similarity between the specific disease and all genes produces a ranked list of genes. The top genes from this list are predicted to be promising target hypotheses.
Text-based (NLP)
This group of scores is based on the analysis of text sources, including scientific publications, grants, patents, clinical trials. Insilico monitors the biomedical text data and uses advanced NLP and AI-based technology, to analyze the contextualized mentions of entities such as genes, diseases, drugs and KOLs in a variety of data sources. The scores are calculated for each gene in the disease-agnostic manner (total mentions of the gene), and in the context of disease areas of interest. Attention spikes are calculated for target — disease associations only (no disease-agnostic scores).
Attention score measures the overall attention to the target at all times. Attention score is based on the total mentions of a gene in the text data, including scientific publications, grants, patents, clinical trials, across all time periods (both disease-agnostic and disease-specific).
Financial scores
Grant funding at all times. Skewed distribution, average entry with non-zero funding has $ 8 900 000 funding, but median funding is $ 1 700 000. 5% of entries have funding above $ 30 000 000.
Key Opinion Leaders (KOL)
Average Hirsch index is calculated over publications, mentioning given gene-disease association.