Acute myeloid leukemia (AML) with mutated NPM1 at position 960 are observed in about 10% and 5% of NPM1-mutated AML; other mutations are very rare. AML with NPM1 mutations has been clinically shown to associate with higher extramedullary involvement frequencies, which were generally responsible for gingival hyperplasia, lymphadenopathy and myeloid sarcoma. In addition, there was a unique association between NPM1 mutation status and the presence of leukemia cutis, the infiltration of skin with leukemia cells. However, the mechanisms underlying these infiltration activities are not yet fully comprehended. Our previous experimental data showed that NPM1 mutant promoted migration and invasion of leukemia cells through matrix metalloproteases (MMPs) up-regulation. It is interesting to note that this epithelial-mesenchymal transition (EMT), characterized by actin cytoskeleton reorganization, increased expression of MMPs and remodeling of extracellular matrix, plays important functions in malignancy invasion and metastasis. Therefore, further studies are needed to elucidate whether the EMT-like process is involved in the invasion phenotype of NPM1-mutated AML. EMT is usually a process through the transdifferentiation of epithelial cells into motile mesenchymal cells and is gradually found to play a vital role in nonepithelial tumors, including hematologic malignancies. The hallmarks of the EMT program are loss of epithelial markers, such as E-cadherin and ZO-1, acquisition of mesenchymal markers including vimentin, N-cadherin and fibronectin. Intriguingly, it is reported that low expression of was involved in the invasive behavior of MLL-AF9-induced AML cells. These studies supported the concept that EMT gene programs play a role in leukemia. Nevertheless, the association between EMT-related genes and NPM1-mutated AML has not yet been analyzed. The reprogramming of gene expression during EMT is initiated and controlled by numerous signalling pathways that respond to extracellular cues. Among these pathways, the transforming growth factor- (TGF-) family signalling has a prominent role. In canonical TGF- signalling pathway, TGF- binds to its receptors and subsequently downstream Sma and Mad related family 2 and 3 (Smad2/3) are phosphorylated. Then activated Smad2/3 interact with Smad4 and translocate to the nucleus, which results in the activation of EMT-related genes at transcription levels. Several studies have exhibited that cytoplasmic promyelocytic leukaemia (cPML) appears to favor the phosphorylation of Smad2/3 and acts as an essential modulator of TGF- signalling. Importantly, the cPML could promote TGF–associated EMT and invasion in prostate malignancy. Recently, aberrant cytoplasmic localization of PML was observed in NPM1-mutated AML cells, and the PML delocalization was mediated by interacting with NPM1 mutant, which implies that NPM1 mutant could be implicated in the regulation of EMT-related genes expression via cPML in AML. In this study, we first discovered the dysregulated EMT-related genes in NPM1-mutated AML from three publicly available datasets, and validated significantly reduced the invasive capability of leukemia cells, and further found that high expression of was connected with poor outcome in AML patients. These results for the first time provide insights into the involvement of EMT-related gene in the pathogenesis of NPM1-mutated leukemia, making this protein an interesting target in leukemia. Materials and Methods Identification of differentially expressed genes With the purpose of identifying the differentially expressed genes (DEGs), we utilized three datasets which mainly included AML with or without NPM1 mutations samples: the Cancer Genome Atlas (TCGA) dataset (n = 179), the GSE34860 dataset (n = 79) and the GSE6891 dataset (n = 461). The gene expression data and clinical data for the TCGA were downloaded from the TCGA data portal. The gene expression data for the GSE34860 and GSE6891 were downloaded from the Gene Expression Omnibus (GEO) website. The DEGs between t< 0.05 and FDR < 0.05 was selected to determine significant differences in gene expression. The counts of overlapping DEGs among the three datasets were visualized in the Venn diagrams. Patient samples The peripheral blood samples of 42 AML patients diagnosed recently, including 14 AML

