# Signals Article: Evaluating the Putative Target Q9Y238 for Non-Small Cell Lung Cancer
Background
The protein associated with the putative target Q9Y238, also known as "Uncharacterized protein," has emerged as a noteworthy candidate for further investigation in the context of non-small cell lung cancer (NSCLC). Preliminary expression profiling studies suggest a potential role in tumor biology, yet the absence of any registered Phase 1 or higher clinical programs indicates a significant gap in therapeutic exploration. This article aims to elucidate the rationale for considering Q9Y238 as a target for further research in NSCLC.Data-mining rationale
The impetus for investigating Q9Y238 arises from a systematic cross-referencing of UniProt's reviewed human entries related to "non-small cell lung cancer" against a comprehensive collection of 494 microarray datasets available in the NCBI Gene Expression Omnibus (GEO). During this analysis, Q9Y238 was identified as a candidate due to its presence in several expression-profiling studies. Notably, our scan revealed that no clinical programs beyond Phase 1 have been registered for this candidate, highlighting an underexplored area in therapeutic development.Why prior analyses may have missed this
Many of the GEO datasets that included Q9Y238 were generated prior to the implementation of modern empirical-Bayes statistical methods, such as the limma package. These earlier analyses often lacked appropriate multiple-testing corrections, which may have led to the underestimation of the significance of Q9Y238 expression changes in NSCLC. Consequently, the potential relevance of this protein in tumor biology may have been overlooked, underscoring the need for a re-evaluation of existing datasets using contemporary analytical techniques.Reasoning for further validation
To substantiate the potential role of Q9Y238 in non-small cell lung cancer, several experimental approaches are suggested: 1. **Re-analyze matched GEO datasets** using the limma package with a Benjamini-Hochberg false discovery rate (FDR) threshold of < 0.05 to identify differentially expressed genes with enhanced statistical rigor. 2. **Validate the top differentially-expressed genes** identified in the re-analysis through quantitative PCR (qPCR) in an independent cohort of NSCLC samples to confirm expression patterns. 3. **Assess tissue specificity** of Q9Y238 expression utilizing resources such as the Genotype-Tissue Expression (GTEx) project and the Human Protein Atlas to determine its relevance in lung tissue versus other tissues. 4. **Explore pathway context** by employing tools like STRING and OmniPath to elucidate potential interactions and biological pathways involving Q9Y238. 5. **If validated**, evaluate the druggability of Q9Y238 through databases such as DGIdb and ChEMBL to assess its potential as a therapeutic target.References
- UniProt. Q9Y238. [UniProt](https://www.uniprot.org/uniprot/Q9Y238)
- NCBI GEO. [GEO Accession](https://www.ncbi.nlm.nih.gov/geo/)
- Limma: Linear Models for Microarray Data. [PMID: 19536218](https://pubmed.ncbi.nlm.nih.gov/19536218/)
- GTEx Project. [GTEx Portal](https://gtexportal.org/home/)
- Human Protein Atlas. [HPA](https://www.proteinatlas.org/)
- STRING Database. [STRING](https://string-db.org/)
- DGIdb. [DGIdb](http://www.dgidb.org/)
- ChEMBL. [ChEMBL](https://www.ebi.ac.uk/chembl/)