臧衛(wèi)東
摘 要 目的: 通過生物信息學(xué)分析乳腺癌中具有自更新能力的乳腺球樣本,挖掘與自更新能力有關(guān)的關(guān)鍵基因,為乳腺癌治療提供基礎(chǔ)和理論依據(jù)。方法:首先通過比較原位乳腺癌樣本(breast cancer, BC)與乳腺癌的乳腺球樣本(mammosphere samples, MS)的mRNA芯片表達(dá)數(shù)據(jù),獲得差異表達(dá)基因(differentially expressed genes,DEGs)。隨后構(gòu)建DEGs的蛋白與蛋白相互作用 (protein-protein interaction, PPI)網(wǎng)絡(luò),并從中篩選出一個高度關(guān)聯(lián)的子網(wǎng)絡(luò),最后對子網(wǎng)絡(luò)進(jìn)行功能富集分析。結(jié)果:MS和BC兩組樣本間共有1 083個DEGs。從這些DEGs構(gòu)建得到的PPI網(wǎng)絡(luò)中,獲得了一個包含49個DEGs的高度關(guān)聯(lián)的子網(wǎng)絡(luò),其中tspo、igf1、fn1 和cdk1為子網(wǎng)絡(luò)的核心基因。結(jié)論:這些核心基因可能是乳腺癌細(xì)胞中與自更新相關(guān)的基因。
關(guān)鍵詞 乳腺癌 乳腺球 自我更新 差異表達(dá)基因 蛋白與蛋白相互作用網(wǎng)絡(luò)
中圖分類號:R737.9 文獻(xiàn)標(biāo)識碼:A 文章編號:1006-1533(2018)01-0076-05
Analysis of critical genes related to self-renewal in the mammosphere model of breast cancer by bioinformatics
ZANG Weidong*
(Shanghai Fengheng Biotechnology Co., Ltd., Shanghai 200240, China)
ABSTRACT Objective: To explore the key genes related to self-renewal in breast cancer by bioinformatics, which may provide a basic theoretical basis for the treatment of breast cancer. Methods: The mRNA microarray data from breast cancer(BC) and mammosphere samples (MS) were compared. The protein-protein interaction (PPI) network of differentially expressed genes (DEGs) was constructed and a highly correlated subnetwork was screened out, and then the functional enrichment analysis was performed on the subnetwork. Results: There were 1 083 DEGs between MS and BC samples. Then the PPI network was constructed based on these DEGs. Subsequently, a highly correlated subnetwork containing 49 DEGs was obtained from the PPI network. Notably, tspo, igf1, fn1 and cdk1 were considered as the core genes of the subnetwork. Conclusion: These core genes may be associated with self-renewal in breast cancer cells.
KEY WORDS breast cancer; mammosphere; self-renewal; differentially expressed genes; protein-protein interaction
network
乳腺癌(breast cancer,BC)是發(fā)生在乳腺腺上皮組織的惡性腫瘤,多發(fā)生于女性,男性僅占1%,全世界每年約有100萬例新發(fā)病例和40萬死亡病例[1]。乳腺并不是維持人體生命活動的重要器官,所以原位乳腺癌并不致命;但癌細(xì)胞轉(zhuǎn)移后,會危及生命。乳腺癌細(xì)胞的一些子細(xì)胞系(如CD44+/CD24-/low細(xì)胞)能抵抗治療并導(dǎo)致癌癥復(fù)發(fā)[2]。CD44+/CD24-/low可以從乳腺癌組織中分離出來并通過體外移植到具備自更新(self-renewal)能力的乳腺球樣本(mammosphere samples,MS)中培養(yǎng)[3]。此外,MS培養(yǎng)可以為BC細(xì)胞的腫瘤誘導(dǎo)亞群的進(jìn)一步表征提供高度適宜的模型[4]。Creighton等[5]對原位乳腺癌樣本和乳腺癌的乳腺球樣本的生物芯片表達(dá)譜數(shù)據(jù)進(jìn)行分析,發(fā)現(xiàn)經(jīng)過傳統(tǒng)治療后殘留的CD44+/CD24-/low在MS樣本中具有高表達(dá)特征。Creighton等[5]認(rèn)為與上皮間充質(zhì)轉(zhuǎn)化(EMT)相關(guān)的靶蛋白或許能夠治療癌細(xì)胞并抑制BC復(fù)發(fā),但能抑制BC復(fù)發(fā)的目標(biāo)基因或蛋白質(zhì)在他們的研究中很少提及。本文利用生物信息學(xué)分析Creighton的基因芯片數(shù)據(jù),嘗試挖掘出與抗癌細(xì)胞治療和復(fù)發(fā)相關(guān)的關(guān)鍵基因,為乳腺癌的相關(guān)研究提供基礎(chǔ)和理論依據(jù)。
1 材料與方法
1.1 表達(dá)譜數(shù)據(jù)獲取
從Gene Expression Omnibus(GEO,http://www. ncbi.nlm.nih.gov/geo/)中選取下載實驗組GSE7515芯片表達(dá)數(shù)據(jù)[5]。此套表達(dá)譜數(shù)據(jù)集共有26個樣本,其中包括11個原位乳腺癌的樣本和15個乳腺癌的乳腺球樣本。該芯片采用Affymetrix Human Genome U133Plus 2.0 Array平臺進(jìn)行檢測。利用Affy軟件包中的GCRMA方法[6]對所有樣本mRNA表達(dá)數(shù)據(jù)進(jìn)行預(yù)處理,并從Probe ID轉(zhuǎn)換Gene Symbol并處理后,得到Gene Symbol對應(yīng)的表達(dá)矩陣,總共獲得19 851個Gene Symbols。endprint