为什么一些统计上不同表达的基因不是deg ?
1
0
进入编辑模式
Wuschel▴10
@wuschel - 15944
最后一次出现是七周前
胡集镇

在我的转录组学分析中,有一些我感兴趣的基因是显著不同的,但没有差异表达。我不知道我是否真的理解了DEG到底意味着什么,在我的DEG管道参数;

pval_adj_method <- 'BH' pval_cut <- 0.01 l2fc_cut <- 1

我有3个时间过程- 0/12/24小时4个基因型- WT, 3个突变体(aox1/2/3)我将每个突变体与它在给定时间点的WT值进行比较。

aox1_0h-WT_0h aox2_0h-WT_0h aox1_12h-WT_12h

等。

从这个输出中,只有少数组合显示它们是deg

* * Gene1_ * * aox2_0h。WT_0h下调**Gene2-** aox3_0h。WT_0h上调**Gene6_** aox2_12h。WT_12h衰减

然而,一旦我绘制了原始值,我看到其他突变体也统计上与他们的WT不同

这是什么意思?

deg(pval_cut = 0.01 l2fc_cut = 1)是否在给定的时间点比较的均在所有时间课程中比较?

例如,inf基因2,如果aox3被认为是DEG,为什么其他2个突变体不被认为是DEG ?如果有人能好心地解释一下,我将不胜感激。

注意:这些计数数据(dput out put)没有标准化。这个文件是在步骤1:合并数据分析管道的序列复制之前写的。

数据

结构(列表(AGI = C(“Gene_1”,“Gene_2”,“Gene_3”,“Gene_4”,“Gene_5”,“Gene_5”),AOX1_0H_1.TSV = C(12.19671398,215.2944799,1994.4740158,231.5254774,72.57597906),aox1_0h_4.tsv = C(20.483339,309.7008485,24.97567468,692.9926789,363.0050461,123.5888977),aox1_0h_5.tsv = C(12.02356715,99.9827504,11.36592332,276.0989805,142.2411187,58.85580698),aox1_12h_4.tsv = C(15.40599103,72.92196194,19.36412892,376.7248056,269.677545,99.21985896),aox1_12h_5.tsv = C(14.1934073,79.71249677,32.45167316,356.2905953,196.2274429,89.37924571),aox1_12h_6.tsv = C(24.80343931,95.02515421,34.05358939,440.896732,346.8153177,127.4466926),aox1_24h_4.tsv= C(6.184604587,196.0341138,25.18641024,258.9801928,223.9545066,33.84484312),aox1_24h_6.tsv = C(6.883819431,98.05429052,2.021393634,112.1958862,139.6212486,28.75295204),aox2_0h_3.tsv = C(25.53690512,175.1041763,28.87826692,459.5438353,433.4289707,137.3774675),AOX2_0H_4.TSV = C(10.18876629,280.2141447,32.48869835,624.7432893,323.5662966,87.89508861),aox2_0h_5.tsv = C(12.34945421,312.5272061,28.36719904,848.7432271,469.8874526,158.9474865),aox2_12h_1.tsv = C(19.08287299,137.9990764,29.84418565,377.4153307,314.9721922,95.42395784),aox2_12h_2.tsv格式= C(7.890962701,74.2399241,15.29254776,236.4643073,135.7776899,26.49812298),aox2_12h_3.tsv = C(3.678221008,62.60983209,11.67625132,200.3733542,100.7079901,31.6012183),aox2_24h_1.tsv = C(11.2162246,149.1030715,30.70364462,321.8824458,234.0832479,67.24515005),aox2_24h_4.tsv = C(13.99519719,902.6355571,41.48997933,501.303002,772.8283795,350.2515435),aox2_24h_5.tsv = C(7.867480669,361.1331651,14.63979526,218.9941277,330.2108122,147.5707818),aox3_0h_1.tsv = C(14.15880435,381.2255777,40.53415811,456.0047815,317.8303883,159.6022776),aox3_0h_2.t​​sv = C(16.7311523,291.2105347,27.30607326,339.5042008,179.0730805,61.18757328),aox3_0h_3.tsv = C(19.09795927,293.2638492,25.89886385,398.8993785,185.8289419,70.22345994),aox3_12h_1.tsv = C(81.20410372,541.3370048,85.56280843,974.0674911,836.3820935,353.6488805),aox3_12h_2.t​​sv = C(29.20942766,248.4582068,51.52438237,486.5817019,390.8069314,151.3860219),aox3_12h_3.tsv = C(46.13590166,273.2940355,63.7297075,503.9319503,395.8325311,157.8115329),aox3_24h_1.tsv = C(34.29811675,386.5204438,72.86154515,693.3497618,570.2158898,194.3375745),aox3_24h_2.t​​sv = C(6.882795111,92.6003284,26.22967721,228.5226723,120.5788092,49.8811201),aox3_24h_3.tsv = C(18.1408733,196.4513931,40.71817893,375.4835311,334.5830825,97.88584005),WT_0h_1.tsv = C(18.68220076,114.5750048,9.829645971,220.5843393,174.4842611,94.2738265),WT_0h_2.t​​sv = C(13.60937828,73.97033656,17.038227,214.8653415,155.5073735,100.5135226),WT_0h_3.tsv = C(16.96217952,76.57929551,16.07674964,141.6508369,145.1300591,73.15117509),WT_12h_2.t​​sv = C(5.05511416,11.33088644,4.189101647,44.84100697,44.55982506,18.2843175),WT_12h_3.tsv = C(0.990461611,0.949660867,1.699219964,15.91825777,15.35309945,9.38446604),WT_12h_5.tsv = C(5.933614952,46.86761146,10.29591467,122.7757985,69.69893277,56.94217399),WT_24h_1.tsv = C(4.029020182,31.21885208,10.11987433,100.5907921,76.64674385,37.26256017),WT_24h_4.tsv = C(3.054100241,102.1107272,18.4440106,121.8360378,154.6688737,44.27986257),WT_24h_5.tsv = C(8.133354312,97.5821868,10.70295122,176.7748047,164.02099,24.53833578)),类= C(“spec_tbl_df”,“tbl_df”,“tbl”,“data.frame”),row.names = c(na,-6l),spec =结构(列表(cols = list(agi = struct(list(),类 = c("collector_character", "collector")), aox1_0h_1.tsv = structure(list(), class = c("collector_double", "collector")), aox1_0h_4.tsv = structure(list(), class = c("collector_double", "collector")), aox1_0h_5.tsv = structure(list(), class = c("collector_double", "collector")), aox1_12h_4.tsv = structure(list(), class = c("collector_double", "collector")), aox1_12h_5.tsv = structure(list(), class = c("collector_double", "collector")), aox1_12h_6.tsv = structure(list(), class = c("collector_double", "collector")), aox1_24h_4.tsv = structure(list(), class = c("collector_double", "collector")), aox1_24h_6.tsv = structure(list(), class = c("collector_double", "collector")), aox2_0h_3.tsv = structure(list(), class = c("collector_double", "collector")), aox2_0h_4.tsv = structure(list(), class = c("collector_double", "collector")), aox2_0h_5.tsv = structure(list(), class = c("collector_double", "collector")), aox2_12h_1.tsv = structure(list(), class = c("collector_double", "collector")), aox2_12h_2.tsv = structure(list(), class = c("collector_double", "collector")), aox2_12h_3.tsv = structure(list(), class = c("collector_double", "collector")), aox2_24h_1.tsv = structure(list(), class = c("collector_double", "collector")), aox2_24h_4.tsv = structure(list(), class = c("collector_double", "collector")), aox2_24h_5.tsv = structure(list(), class = c("collector_double", "collector")), aox3_0h_1.tsv = structure(list(), class = c("collector_double", "collector")), aox3_0h_2.tsv = structure(list(), class = c("collector_double", "collector")), aox3_0h_3.tsv = structure(list(), class = c("collector_double", "collector")), aox3_12h_1.tsv = structure(list(), class = c("collector_double", "collector")), aox3_12h_2.tsv = structure(list(), class = c("collector_double", "collector")), aox3_12h_3.tsv = structure(list(), class = c("collector_double", "collector")), aox3_24h_1.tsv = structure(list(), class = c("collector_double", "collector")), aox3_24h_2.tsv = structure(list(), class = c("collector_double", "collector")), aox3_24h_3.tsv = structure(list(), class = c("collector_double", "collector")), WT_0h_1.tsv = structure(list(), class = c("collector_double", "collector")), WT_0h_2.tsv = structure(list(), class = c("collector_double", "collector")), WT_0h_3.tsv = structure(list(), class = c("collector_double", "collector")), WT_12h_2.tsv = structure(list(), class = c("collector_double", "collector")), WT_12h_3.tsv = structure(list(), class = c("collector_double", "collector")), WT_12h_5.tsv = structure(list(), class = c("collector_double", "collector")), WT_24h_1.tsv = structure(list(), class = c("collector_double", "collector")), WT_24h_4.tsv = structure(list(), class = c("collector_double", "collector")), WT_24h_5.tsv = structure(list(), class = c("collector_double", "collector"))), default = structure(list(), class = c("collector_guess", "collector")), skip = 1), class = "col_spec"))

4个基因型- WT, 3个突变体(aox1/ aox2/ aox3)

############## #数据分析管道#如下管道:链接https://github.com/wyguo/ThreeDRNAseq/blob/master/vignettes/user_manuals/3D_RNA-seq_command_line_user_manual.md # #具体步骤# run-tximport genes_counts < - txi_genes数美元  ##----->> 参数3 d分析pval_adj_method < -“黑洞”pval_cut < - 0.05 l2fc_cut < - 1 deltaPS_cut < - 0.1 DAS_pval_method <野生 ' ##----->> 两两对比的方法groups contrast <- c('aox1_0h-WT_0h', 'aox2_0h-WT_0h', 'aox1_12h-WT_12h', 'aox3_12h-WT_12h', 'aox3_12h-WT_12h', 'aox1_24h-WT_24h', 'aox2_24h-WT_24h', 'aox3_24h-WT_24h') # step2: DE基因# batch<- condition2design(condition = metadata$treat, batch. if) <- condition2design(condition = metadata$treat, batch. if) <- condition2design(condition = metadata$treat, batch. if)effect = NULL) #批量if(DE_pipeline == 'glmQL'){##----->> edgeR glmQL pipeline genes_3D_stat <- edgeR。管道(dge = genes_dge, design = design, deltaPS = NULL, contrast = contrast, diffAS = F, method = 'glmQL',调整。pval_adj_method)} ##保存结果DDD。数据genes_3D_stat < - genes_3D_stat美元  ################################################################################ ##----->> 摘要DE基因DE_genes < - summaryDEtarget(统计= genes_3D_stat DE美元。截断= c。pval = pval_cut log2FC = l2fc_cut DDD))。数据DE_genes < - DE_genes美元
<编辑> # < /编辑>
条块图库(tidyverse) df <- read_csv("data/Selected_Count_BR2.csv") longdata <- df %>% gather(key, value, -AGI) %>% separate(key, c("基因型","时间","复制"))library(plotrix) longdata2 <- longdata %>% group_by(AGI,基因型,时间)%>% summarise_each(funs(mean,sd,std.error)) %>% ungroup() %>% mutate(Time = factor(Time),基因型(基因型))=因素colnames (longdata2) longdata2美元基因型< -因子(longdata2基因型,美元水平= c(“WT”、“aox1”,“aox2”、“aox3”))#策划longdata2 % > % ggplot (aes (x = y = value_mean,填补=基因型))+ geom_bar(位置= position_dodge(),统计=“身份”)+ geom_errorbar (aes (ymin = value_mean - value_std。+ scale_fill_manual(values=c("#008000","#B8860B","#4169E1","#DC143C"))+ facet_wrap(~ AGI,scale = "free_y")

Picture1

RNASeqDataDEGreport刨边机•145次观点
添加评论
3.
进入编辑模式
@gordon-smyth
最后一次露面是3小时前
威奇,墨尔本,澳大利亚

我理解为什么你可能想要使用像threedrnaseq这样的闪亮应用程序工具,但使用包装包而不是使用底层的Bioconductor包直接限制了我们的帮助能力。如果你不理解ThreeDRNAseq的输出,那么你真的需要通过他们的github站点向该包的作者发送问题。

我是edgeR包的作者,但没有任何办法帮你。我没有看到任何edgeR函数在任何代码中显示在你的问题。我不知道你做了什么分析,看到了什么成果,甚至你的问题是什么。

如果你对直接尝试edgeR感兴趣,请看这里的工作流示例:

https://www.biocumon.org/packages/release/workflows/vignettes/rnaseqgeneedgerql/inst/doc/edgerql.html.

0
进入编辑模式

非常感谢,教授。

添加回复

登录然后再加上你的答案。

流量:过去一小时内访问了339个用户

使用本网站即表示接受我们的用户协议和隐私政策

由的2.3.6版本