Raphael Sandaltzopoulos, PhD, MBA Lab. of Gene Expression, Molecular Diagnosis and Modern Therapeutics, MBG, DUTH BIOMARKERS DISCOVERY: An introduction Raphael Sandaltzopoulos, PhD, MBA Professor at MBG (Molecular Biology) rmsandal@mbg.duth.gr
Phycocyanin (PC) is a natural fluorescent protein PC is a pigment-protein complex involved in light-harvesting, along with Allophycocyanin [AP] and Phycoerythrin [PE]. PC absorbs light at 620 nm and emits at 652 nm. PC is one of the oldest proteins known to have almost completed its evolution about 3.7 billion yrs ago.
Phycocyanin consists of two subunits (α and β) PC subunits are decorated with tetrapyrrole chromophores covalently attached. Both subunits are required for efficient photon transport. In cells, PC dimers are organized in a hexameric structure. PC is water soluble, hypo-allergic and possesses potent anti-oxidative properties. PC is used in molecular diagnostics for labeling biomolecules [e.g. for confocal microscopy, FACS, in vivo staining, etc.]. PC is used as a natural coloring in cosmetics and food [e.g. ice-cream, pastry].
Purification of analytical-grade phycocyanin from Sp. platensis B) Yield: 1 gr/Kg of wet biomass Price of 1 Kg of wet biomass: 20 € Price of 1 gr of analytical grade PC: >1,000 € Added-value factor: x50
Differential scanning fluorimetry (DSF) allows monitoring the stability of a protein as a function of temperature 100000.0 200000.0 300000.0 400000.0 35 55 75 95 0.0 Fluorescence Temperature (oC) 2 1 GST-Elk-330 Sypro orange binds to exposed hydrophobic patches resulting in fluorescence emission. High Tm High structural stability Tm values may be determined by data analysis with the DSF Analysis Version 3.0 software package (ftp://ftp.sgc.ox.ac.uk/pub/biophyscs) and GraphPadPrism.
What is a Biomarker? Biomarker is a measureable characteristic which may be used as an indicator of a certain biological state or condition. Measureable: ease of sampling, reliability, reproducibility and accuracy of detection method. Indicator: robust correlation with biological condition. Biomarker is a measured characteristic which may be used as an indicator of a certain biological state or condition. Biomarker is a measured characteristic which may be used as an indicator of a certain biological state or condition. Biomarker is a measured characteristic which may be used as an indicator of a certain biological state or condition.
Difference of average: Pearson correlation: A good Biomarker shows a robust correlation with a certain biological condition. Difference of average: Is a value of a parameter significantly different among two groups which are defined by the value of a second parameter? Pearson correlation: How tightly does the value of one parameter correlate with the value of another parameter?
Example of difference of average (one or two-tailed t-test) Expression of T-lymphocyte markers (mRNA) is a reliable indicator of CD8+ T-lymphocyte infiltration in ovarian tumors CD3 staining CD8 staining
TTF1 is overexpressed in TIL+ Epithelial Ovarian Cancer ttf1 mRNA was quantified by qPCR in 45 ovarian tumors (classified as TIL+ or TIL-). mRNA levels were higher in TIL+ compared to TIL- samples (p=0.039).
TTF1 expression correlates with T-cell infiltration Example of Pearson correlation analysis qPCR analysis of the relative expression of ttf1 and cd8a mRNA in 96 ovarian cancer samples, verified a significant correlation (Pearson correlation=0.329, p<0.01).
TMEM132D is over-expressed in TIL+ Epithelial Ovarian Cancer Example of Pearson correlation analysis TMEM132D: Single-pass type I transmembrane protein. May serve as a cell-surface marker for oligodendrocyte differentiation. Implicated in anxiety disorders. 95% Confidence Interval: If we were to repeat the sampling 100 times and calculate a population of 100 r values, the population mean would probably lie between 0.747 and 0.923 95% of the time, the confidence intervals contain the true mean.
Common pitfalls of correlation analysis Correlation and causality: a. Correlation does not necessarily imply causality.
Common pitfalls of correlation analysis Correlation and causality: b. Normal distribution and lack of correlation does not necessarily imply independence. X and Y are uncorrelated; Both have the same normal distribution; X and Y are not independent.
Common pitfalls of correlation analysis Correlation and linearity: c. Correlation does not necessarily imply linearity Four sets of data with the same correlation of 0.816
The nature of a Biomarker
Adapted from Kaiser J, 2010. Science 330:576.
The nature of a Biomarker
Technology platforms commonly used in Biomarkers’ Discovery
In silico Biomarkers Biomarkers discovered by data analysis and mining workflows and algorithms capable of working with disparate data resources in the areas of gene expression, copy number, pathway networks, miRNA, and metabolomics data, next generation sequencing (NGS) data including RNA-seq, ChIP-seq, whole genome, exome, and others, using advanced bioinformatics modeling and development of analytics. http://icbi.georgetown.edu/in-silico/#sthash.E7RqMyt6.dpuf
The biomarker evolution pathway in translational medicine
An example of Risk Score Assessment of microRNA Biomarkers in NSCLC 0,969 x miR486 + 0,973 x miR30d - 0,650 x miR1 - 0,815 miR499 Adapted from Zander et al, 2011, Clin. Cancer Res 11:3360.
A “good” Biomarker has a great combination of sensitivity and specificity Sensitivity: The fraction of people with the disease that the test correctly identifies as positive. Specificity: The fraction of people without the disease that the test correctly identifies as negative.
Receiver Operating Characteristic (ROC) curve A receiver operating curve (ROC) is a plot of sensitivity versus 1-specificity. The name is derived from its original use in radar technology. The dotted line represents a useless test that has no discriminatory power. The size of the area between the dotted line and the solid line in the ROC curve reflects the ability of a test to discriminate between diseased and non-diseased individuals across the range of potential cut-offs.
Area Under Curve (A.U.C.) as a measure of a Biomarker’s usefulness Large distribution of measured values between the diseased and the non-diseased populations make it easy to choose a cut off value. Small difference in distribution of measured values between the diseased and the non-diseased populations make it very difficult to choose a cut off value. Test value
Serum concentration of endothelial markers Σε μεγαλύτερο αριθμό δειγμάτων μπορέσαμε να ελέγξουμε τα επίπεδα αυτών των 3 μορίων για τα οποία είχαμε τα απαραίτητα εργαλεία να αναπτύξουμε τεχνικές luminex. Και για τα 3 αυτά μόρια οι συγκεντρώσεις του βρέθηκαν αυξημένες σημαντικά σε ορούς καρκινοπαθών σε σχέση με αυτούς υγιών γυναικών. Και εδώ βλέπουμε τα επίπεδα του πιο γνωστού μορ δείκτη του CA125 στα ίδια ακριβώς δείγματα. 81 patients, 46 benign 27 healthy women * - p<0,05; ** - p<0,01; *** - p<0,001.
Validation in an independent group of patients Το πείραμα αυτό επαναλήφθηκε και σε άλλη ομάδα ασθενών από άλλη χώρα και είχαμε παρόμοια αποτελέσματα. 117 patients, 59 benign * - p<0,05; ** - p<0,01; *** - p<0,001.
ROC analysis: better Diagnostics by combination of biomarkers Κι εδώ βλέπουμε ανάλυση ROC των προηγούμενων αποτελεσμάτων. Σύμφωνα με την ανάλυση αυτή ένας τέλειος δείκτηςθα πρέπει να έχει μια καμπύλη εεεετσι, δηλαδή η επιφάνεια κάτω από την καμπύλη να είναι όσο γίνεται πιο κοντά στο 100%. Τότε έχουμε ένα δείκτη με μεγάλη ευαισθησία και ειδικότητα. Όπως βλέπετε οι καμπύλες και των 3 δεικτών είναι συγκρίσιμες αλλά όχι καλύτερες από αυτή του CA125. όμως αν συνδυάσουμε καθένα από τους δείκτες αυτού με το CA125 τότε έχουμε διαγνωστικά τεστ καλύτερα από το CA125 μόνο του, ειδικά για την περίπτωση του DR6.
Receiver Operating Characteristic (ROC) curve The cutoff value is chosen to minimize missing true positives, thus including more false positives. (for example: when consequences of missing a case are serious, e.g. testing potential blood donors for HIV). Cutoff value chosen at statistical mean. Equal number of true positive missed as the number of people diagnosed as false positive. Cutoff value chosen to minimize inclusion of false positives. (for example when treatment is expensive or invasive). Setting the criterion value is a compromise between sensitivity and specificity. Test value
The Kaplan-Meier estimator for the survival function from lifetime data. Used to measure the fraction of patients living for a certain amount of time after treatment. A Kaplan–Meier plot of the survival function is a series of horizontal steps of declining magnitude which, when a large enough sample is taken, approaches the true survival function for the population.
Overview of Multi-omic Approaches Applied in Biomarker Discovery
Overview of Multi-omic Approaches Applied in Biomarker Discovery
Sectors of Biomarkers applications in medicine
Biomarkers in personalized treatment
Qualities of a good Biomarkers in diagnosis and therapy Discrimination of healthy/deseased speciments (sensitivity and specificity). Easily detectable and quantifiable. Simple (not a combination of many factors). Extracellular (secreted) or cell surface. Presence/expression in a restricted tissue, cell type as a result of a certain condition. Minimal influence by irrelevant factors. Consistent behavior within a population.