Biomarker Description
TGx-HDACi and TGx-DDI Biomarker Descriptions
The toxicogenomics (TGx) DNA Damage Induction (DDI) biomarker and development is described in the TGx-DDI Classification tool.
The TGx Histone Deacetylase inhibition (HDACi) biomarker consists of 81 signature genes that identify chemicals with or without the potential to induce HDACi in TK6 cells. The signature was derived using the nearest shrunken centroid (NSC) method and expression profiles from TempO-Seq ( https://www.biospyder.com/ ) human whole transcriptome data from TK6 cells exposed for 4 hours to 20 reference compounds, 10 HDACi and 10 non-HDACi (NHDACi). The signature chemical set was limited to inhibitors of the classical human HDACs (Zn2+ dependent HDACs in classes I, II and IV). The probability that the data for each chemical, at the concentration(s) tested, in a test dataset will match the HDACi or the NHDACi signature is predicted using NSC analysis.Test data were prepared as tab-delimited text files containing log2 ratio expression data, normalized to solvent controls, from TempO-Seq Human Whole Transcriptomes (see `Example Data Files` link under `Classification Tool` option on the menu bar). Each column lists data for the different chemicals and/or concentrations tested for all genes detected by TempO-Seq. In addition to the classification p value, six other data outputs are generated for each column of data. The output data include Heat maps, Dendograms (cluster analysis), Principal Component Analysis (PCA), Fold change data, and Gene Cluster and Chemical Cluster distance data. An online Results table is displayed that has an option to view and download the data (see `Example Data Files` link under `Classification Tool` option on the menu bar).
Classification Process
The development of the TGx-HDACi Biomarker and processes underlying the data analysis are described in Cho et al., 2021 (see Publication page). Briefly, the probability that test data fit the profile of HDACi or NHDACi compounds is determined based on similarity of the test chemical expression profile to either of the reference chemical profiles. The analysis uses the NSC probability analysis (90% cut-off for the probability of HDACi class membership, PCA, and hierarchical clustering). Chemicals that cluster with the HDACi reference compounds in the PCA and hierarchical clustering are classified as HDACi. PCA is performed using the prcomp in R and hierarchical clustering is performed using the hclust function in R using average linkage and Euclidean distance. The NSC method along with the statistical and bioinformatics tools used are described in detail by Tibshirani et al., 2002.
- prcomp
- principal components function in R
- hclust
- hierarchical clustering function in R