钳掌柜龙虾 發表於 2019-9-16 20:07:00

R包对植物进行GO,KEGG注释

<p><span style="background-color: rgba(255, 255, 0, 1)"><strong>1、安装,加载所用到到R包</strong></span></p>
<p>用BiocManager安装,可同时加载依赖包</p>
<p><span style="color: rgba(0, 0, 255, 1)">source("https://bioconductor.org/biocLite.R")</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">BiocManager::install("clusterProfiler")</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">&nbsp;</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">library(clusterProfiler) ##富集分析</span><br><span style="color: rgba(0, 0, 255, 1)">library(topGO)      ###画GO图</span><br><span style="color: rgba(0, 0, 255, 1)">library(AnnotationHub) ##获取数据库</span><br><span style="color: rgba(0, 0, 255, 1)">library(BiocFileCache) ##依赖包</span><br><span style="color: rgba(0, 0, 255, 1)">library(dbplyr) ##依赖包</span><br><span style="color: rgba(0, 0, 255, 1)">library(pathview) ##看KEGG pathway</span></p>
<p><span style="background-color: rgba(255, 255, 0, 1)"><strong>2、利用annotataionHub去抓取目标orgDb</strong></span></p>
<p><span style="color: rgba(0, 0, 255, 1)">ah &lt;- AnnotationHub()&nbsp; ##收索所有orgdb,到ah</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">unique(ah$dataprovider) ##可查看数据注释来源</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">query(ah, "Apis cerana")&nbsp; ##查找目标物种</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">tar_org &lt;- ah[["AH62635"]] ##下载目标物种到org数据</span></p>
<p><span style="background-color: rgba(255, 255, 0, 1)"><strong>3、了解org数据库</strong></span></p>
<p>主要有5个函数</p>
<pre class="line-numberslanguage-go"><code class="language-go"><span class="token function">columns<span class="token punctuation">(x<span class="token punctuation">): 显示当前对象有哪些数据
<span class="token function">keytypes<span class="token punctuation">(x<span class="token punctuation">): 有哪些keytypes可以用作<span class="token keyword">select或keys的keytypes参数
<span class="token function">keys<span class="token punctuation">(x<span class="token punctuation">, keytype<span class="token punctuation">, <span class="token operator">...<span class="token punctuation">):返回当前数据对象的keys
<span class="token keyword">select<span class="token punctuation">(x<span class="token punctuation">, keys<span class="token punctuation">, columns<span class="token punctuation">, keytype<span class="token punctuation">, <span class="token operator">...<span class="token punctuation">):基于keys<span class="token punctuation">, columns和keytype以data<span class="token punctuation">.frame数据类型返回数据,可以是一对多的关系
<span class="token function">mapIds<span class="token punctuation">(x<span class="token punctuation">, keys<span class="token punctuation">, column<span class="token punctuation">, keytype<span class="token punctuation">, <span class="token operator">...<span class="token punctuation">, multiVals<span class="token punctuation">): 类似于<span class="token keyword">select,只不过就返回一个列。</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<p><strong>3.1&nbsp;以symbol形式展示</strong></p>
<p><code class="r functions">head</code><code class="r plain">(</code><code class="r functions">keys</code><code class="r plain">(tar_org,keytype =<span class="Apple-converted-space">&nbsp;</span></code><code class="r string">"SYMBOL"</code><code class="r plain">),30)&nbsp; ##默认为ENTREZID</code></p>
<p><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190915221439238-2116629297.png"></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><strong>3.2、可以查看gene类型</strong></p>
<pre class="line-numberslanguage-csharp"><code class="language-csharp"><span class="token function"><span style="color: rgba(0, 0, 255, 1)">keytypes</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">(tar_org</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">)</span>
<span class="token punctuation">[<span class="token number">1<span class="token punctuation">] <span class="token string">"ARACYC"       <span class="token string">"ARACYCENZYME" <span class="token string">"ENTREZID"   <span class="token string">"ENZYME"       <span class="token string">"EVIDENCE"   <span class="token string">"EVIDENCEALL"<span class="token string">"GENENAME"   
<span class="token punctuation">[<span class="token number">8<span class="token punctuation">] <span class="token string">"GO"         <span class="token string">"GOALL"      <span class="token string">"ONTOLOGY"   <span class="token string">"ONTOLOGYALL"<span class="token string">"PATH"         <span class="token string">"PMID"         <span class="token string">"REFSEQ"      
<span class="token punctuation">[<span class="token number">15<span class="token punctuation">] <span class="token string">"SYMBOL"       <span class="token string">"TAIR" </span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<p><strong>3.3、<code>select</code>则是根据你提供的key值去查找注释数据库,返回你需要的columns信息</strong></p>
<pre class="line-numberslanguage-ruby"><code class="language-ruby"><span class="token operator">&gt; <span style="color: rgba(0, 0, 255, 1)">select</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">(tar_org</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">, keys</span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token string"><span style="color: rgba(0, 0, 255, 1)">"AGO1"</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">, columns</span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">=c</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">(</span><span class="token string"><span style="color: rgba(0, 0, 255, 1)">"TAIR"</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,</span><span class="token string"><span style="color: rgba(0, 0, 255, 1)">"GO"</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">)</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,keytype </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token string"><span style="color: rgba(0, 0, 255, 1)">"SYMBOL"</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">)</span>
<span class="token string">'select()' returned <span class="token number">1<span class="token symbol">:many mapping between keys <span class="token keyword">and columns
   <span class="token constant">SYMBOL      <span class="token constant">TAIR         <span class="token constant">GO <span class="token constant">EVIDENCE <span class="token constant">ONTOLOGY
<span class="token number">1    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0004521      <span class="token constant">IDA       <span class="token constant">MF
<span class="token number">2    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0005515      <span class="token constant">IPI       <span class="token constant">MF
<span class="token number">3    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0005634      <span class="token constant">IDA       <span class="token constant">CC
<span class="token number">4    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0005737      <span class="token constant">IDA       <span class="token constant">CC
<span class="token number">5    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0005737      <span class="token constant">ISM       <span class="token constant">CC
<span class="token number">6    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0005737      <span class="token constant">TAS       <span class="token constant">CC
<span class="token number">7    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0005829      <span class="token constant">IDA       <span class="token constant">CC
<span class="token number">8    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0006306      <span class="token constant">RCA       <span class="token constant">BP
<span class="token number">9    <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0006342      <span class="token constant">RCA       <span class="token constant">BP
<span class="token number">10   <span class="token constant">AGO1 <span class="token constant">AT1G48410 <span class="token constant">GO<span class="token punctuation">:<span class="token number">0006346      <span class="token constant">RCA       <span class="token constant">BP</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<p>⚠️找到差异基因后,必须得确定你得基因号是对应ENTREZID 或者SYMBOL,,是属于哪种类型,若没有符合上述类型,可自行找到NCBI上得数据名称,进行blast更换名字。</p>
<p>&nbsp;</p>
<p><span style="background-color: rgba(255, 255, 0, 1)"><strong>4、进行作图</strong></span></p>
<p>统一将差异基因名字改为ENTREZID,防止在做GO分析的时候出现报错,需要将symbolID转换成ENTREZID:用<code>mapIds</code>函数就可以转换ID。</p>
<pre class="line-numberslanguage-bash"><span style="color: rgba(0, 0, 255, 1)"><code class="language-bash">DEG.entrez_id = mapIds(x = tar_org,               #### 数据库
                     keys = DEG.gene_symbol,    #####差异基因名字
                     keytype = "SYMBOL",      ####差异基因类型是SYMBOL
                     column = "ENTREZID")       #####转换为ENTREZID
</code></span></pre>
<p><code class="language-bash">&nbsp;这时就已经把symbolID转换成ENTREZID了,但会出现个别的转换不成功的情况,就是图中<code>NA</code>的地方,我们进行以下操作即可去掉:</code></p>
<pre class="line-numberslanguage-undefined"><span style="color: rgba(0, 0, 255, 1)"><code class="language-undefined">DEG.entrez_id = na.omit(DEG.entrez_id)</code></span></pre>
<p><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916143122669-1870334654.png"></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><span style="background-color: rgba(255, 255, 0, 1)"><strong>4.1、GO分析代码</strong></span></p>
<p>BP(Biological process)层面上的富集分析:</p>
<pre class="line-numberslanguage-go"><code class="language-go"><span style="color: rgba(0, 0, 255, 1)">erich</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">.</span><span class="token keyword"><span style="color: rgba(0, 0, 255, 1)">go</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">.BP </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token function"><span style="color: rgba(0, 0, 255, 1)">enrichGO</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">(gene </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= DEG</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">.entrez_id</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,      ###差异基因ID
                     OrgDb </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= tar_org</span><span class="token punctuation"><span class="token punctuation"><span class="token punctuation"><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,         ###数据库
                     keyType </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token string"><span style="color: rgba(0, 0, 255, 1)">"ENTREZID"</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,       ##基因ID类型
                     ont </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token string"><span style="color: rgba(0, 0, 255, 1)">"BP"</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,               ##对BP进行GO分析
                     pvalueCutoff </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token number"><span style="color: rgba(0, 0, 255, 1)">0.5</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">,         ###fisher检验对p值
                     qvalueCutoff </span><span class="token operator"><span style="color: rgba(0, 0, 255, 1)">= </span><span class="token number"><span style="color: rgba(0, 0, 255, 1)">0.5</span><span class="token punctuation"><span style="color: rgba(0, 0, 255, 1)">)         ###对p值进行校对对q值,一般大于p值</span><br><br><strong>作图:</strong><br></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<pre class="line-numberslanguage-go"><span style="color: rgba(0, 0, 255, 1)"><code class="language-go"><span class="token function">dotplot<span class="token punctuation">(erich<span class="token punctuation">.<span class="token keyword">go<span class="token punctuation">.BP<span class="token punctuation">)</span></span></span></span></span></span></code></span></pre>
<pre class="line-numberslanguage-go"><code class="language-go"><span class="token punctuation"><span class="token keyword"><span class="token punctuation"><span class="token operator"><span class="token function"><span class="token punctuation"><span class="token operator"><span class="token punctuation"><span class="token punctuation"><span class="token operator"><span class="token punctuation"><span class="token punctuation"><span class="token punctuation"><span class="token punctuation"><span class="token operator"><span class="token string"><span class="token punctuation"><span class="token operator"><span class="token string"><span class="token punctuation"><span class="token operator"><span class="token number"><span class="token punctuation"><span class="token operator"><span class="token number"><span class="token punctuation">解读BP层面富集分析图:<br>横坐标是<code>GeneRatio</code>,意思是说输入进去的基因,它每个term(纵坐标)站整体基因的百分之多少。圆圈的大小代表基因的多少,图中给出了最大的圆圈代表60个基因,圆圈的颜色代表<code>P-value</code>,也就是说<code>P-value</code>越小<code>gene count</code>圈越大,这事就越可信。</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<p><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916143444199-781945114.png"></p>
<p>&nbsp;</p>
<p>也可以画柱形图</p>
<pre class="line-numberslanguage-go"><span style="color: rgba(0, 0, 255, 1)"><code class="language-go"><span class="token function">barplot<span class="token punctuation">(erich<span class="token punctuation">.<span class="token keyword">go<span class="token punctuation">.CC<span class="token punctuation">)</span></span></span></span></span></span></code></span></pre>
<p><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916143540346-424574623.png"></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;一般GO分析画这两个图就可以了,有时也把GO分析画成树形图,可以更加帮助我们理解。</p>
<pre class="line-numberslanguage-css"><span style="color: rgba(0, 0, 255, 1)"><code class="language-css"><span class="token function">plotGOgraph<span class="token punctuation">(erich.go.BP<span class="token punctuation">)</span></span></span></code></span></pre>
<p>树状图很大,所以我们用代码把它存成pdf,学习下如何用代码&nbsp;</p>
<pre class="line-numberslanguage-go"><span style="color: rgba(0, 0, 255, 1)"><code class="language-go"><span class="token function">pdf<span class="token punctuation">(file<span class="token operator">=<span class="token string">"./enrich.go.bp.tree.pdf"<span class="token punctuation">,width <span class="token operator">= <span class="token number">10<span class="token punctuation">,height <span class="token operator">= <span class="token number">15<span class="token punctuation">)
<span class="token function">plotGOgraph<span class="token punctuation">(erich<span class="token punctuation">.<span class="token keyword">go<span class="token punctuation">.BP<span class="token punctuation">)
dev<span class="token punctuation">.<span class="token function">off<span class="token punctuation">(<span class="token punctuation">)</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></span><br>至此,GO分析就做完了 ----&gt; over<br><br><br><strong><span style="background-color: rgba(255, 255, 0, 1)">4.2、KEGG pathway 介绍<br></span></strong>KEGG pathway是最常用的功能注释数据库之一,可以利用KEGG 的API获取一个物种所有基因对应的pathway注释,human对应的API 链接如下<br><br>http://rest.kegg.jp/link/hsa/pathway      人类<br>http://rest.kegg.jp/link/soe/pathway      菠菜</pre>
<pre class="prettyprint"><code class="prism language-r has-numbering">path:hsa00010 hsa:10327
path:hsa00010 hsa:124
path:hsa00010 hsa:125
</code></pre>
<pre class="line-numberslanguage-go">第一列为pathway编号,第二列为基因编号。这里只提供了pathway编号,我们还需要pathway对应的描述信息,同样也可以通过以下API链接得到<br>http://rest.kegg.jp/list/<br>通过该链接可以获得如下内容</pre>
<p>path:map00010 Glycolysis / Gluconeogenesis<br>path:map00020 Citrate cycle (TCA cycle)<br>path:map00030 Pentose phosphate pathway<br>path:map00040 Pentose and glucuronate interconversions<br>path:map00051 Fructose and mannose metabolism</p>
<p><br>第一列为pathway编号,第二列为具体的描述信息。需要注意的是,pathway是一个跨物种的概念,原始的pathway编号为<code>map</code>或者<code>ko</code>加数字,对于特定物种,改成物种对应的三字母缩写, 比如human对应<code>hsa</code>, 所有拥有pathway信息的物种和对应的三字母缩写见如下链接</p>
<p>https://www.genome.jp/kegg/catalog/org_list.html</p>
<p>clusterProfiler也是通过KEGG API去获取物种对应的pathway注释,对于已有pathway注释的物种,我们只需要知道对应的三字母缩写, clusterProfiler就会联网自动获取该物种的pathway注释信息。</p>
<p>和GO富集分析类似,对于KEGG的富集分析也包含以下两种</p>
<ul>
<li>
<p>过表征分析 (over representation analysis, ORA)&nbsp; &nbsp; &nbsp; &nbsp; ###先会筛选,并挑选出我们感兴趣对基因</p>

</li>
<li>
<p>基因富集分析 (gene set enrichment analysis, GSEA)&nbsp; &nbsp; &nbsp; &nbsp;###不进行筛选</p>

</li>

</ul>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;<span style="color: rgba(0, 0, 255, 1)">enrich.KEGG.BP &lt;- enrichKEGG(gene = test_sample,&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; #### 差异基因ID&nbsp; ENTREZID</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">                keyType = "kegg",&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ####key类型</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">                organism = "soe",&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;###物种3字母</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">                pvalueCutoff = 0.05,&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&nbsp;</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; pAdjustMethod = "BH",</span></p>
<p><span style="color: rgba(0, 0, 255, 1)">                 qvalueCutoff = 0.1,)</span></p>
<p>&nbsp;</p>
<p><strong>4.2.1、柱状图</strong></p>
<p><span style="color: rgba(0, 0, 255, 1)">barplot(enrich.KEGG.BP, showCategory = 10)</span></p>
<p><span style="color: rgba(0, 0, 255, 1)"><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916151456177-2042129670.png"></span></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;横轴为该pathway的差异基因个数,纵轴为富集到的pathway的描述信息,<span class="Apple-converted-space">&nbsp;<code>showCategory</code>指定展示的pathway的个数<strong>,默认展示显著富集的top10个</strong>,即<strong>p.adjust最小的10个</strong>。注意的颜色对应p.adjust值,从小到大,对应蓝色到红色。</span></p>
<p>4.2.2、点图</p>
<p><span style="color: rgba(0, 0, 255, 1)">dotplot(enrich.KEGG.BP, showCategory = 10)</span></p>
<p><span style="color: rgba(0, 0, 255, 1)"><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916151800986-335656094.png"></span></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>横轴为GeneRatio, 代表该pathway下的差异基因个数占差异基因总数的比例,纵轴为富集到的pathway的描述信息, showCategory指定展示的pathway的个数,默认展示显著富集的top10个,即<strong>p.adjust最小的10个</strong>。图中点的颜色对应p.adjust的值,从小到大,对应蓝色到红色,大小对应该GO terms下的差异基因个数,个数越多,点越大。<br>&nbsp;</p>
<p><strong>4.2.3、Gene-Concept Network</strong></p>
<p>前面的<span class="Apple-converted-space">&nbsp;<del>两款神器</del><span class="Apple-converted-space">&nbsp;两个函数,都只能展示富集最显著的 GO term,而函数<span class="Apple-converted-space">&nbsp;<strong><code>cnetplot()</code></strong><span class="Apple-converted-space"><strong>&nbsp;</strong>可以将基因与生物学概念 (e.g.* GO terms or KEGG pathways) 的关系绘制成网状图。对于基因和富集的pathways之间的对应关系进行展示,如果一个基因位于一个pathway下,则将该基因与pathway连线,用法如下</span></span></span></span></p>
<pre class="line-numberslanguage-r"><span style="color: rgba(0, 0, 255, 1)"><code class="Rlanguage-r">cnetplot(enrich.KEGG.BP, showCategory = 5)<br><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916152347294-1934508506.png"></code></span></pre>
<p>&nbsp;</p>
<pre class="line-numberslanguage-r"></pre>
<p>&nbsp;图中灰色的点代表基因,黄色的点代表富集到的pathways, 默认画<strong>top5</strong>富集到的pathwayss, pathways节点的大小对应富集到的基因个数。<strong>数字就是基因ID,如果需要更换,可以更换keytype,或者直接在enrich.KEGG.BP 的结果中进行相同ID更换</strong></p>
<pre class="line-numberslanguage-r"><span style="color: rgba(0, 0, 255, 1)"><code class="Rlanguage-r">cnetplot(enrich.KEGG.BP,circular=T,###画为圈图</code></span></pre>
<pre class="line-numberslanguage-r"><span style="color: rgba(0, 0, 255, 1)"><code class="Rlanguage-r">                         colorEdge=T)      ##线条用颜色区分<br><br><br><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916154128328-1495189059.png"></code></span></pre>
<p>&nbsp;</p>
<pre class="line-numberslanguage-r"></pre>
<p>&nbsp;</p>
<p><strong>4.2.4、Enrichment Map</strong></p>
<p>Enrichment Map 可以将富集条目和重叠的基因集整合为一个网络图,相互重叠的基因集则趋向于成簇,从而易于分辨功能模型。对于富集到的pathways之间的基因重叠关系进行展示,如果两个pathway的差异基因存在重叠,说明这两个节点存在overlap关系,在图中用线条连接起来,用法如下</p>
<p><span style="color: rgba(0, 0, 255, 1)">emapplot(enrich.KEGG.BP)</span></p>
<p><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916162653210-1002041459.png"></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>每个节点是一个富集到的pathway, 默认画<strong>top30个</strong>富集到的pathways, 节点大小对应该pathway下富集到的差异基因个数,节点的颜色对应p.adjust的值,从小到大,对应蓝色到红色。</p>
<p>&nbsp;</p>
<p><strong>&nbsp;4.2.5&nbsp;<span class="Apple-converted-space">browseKEGG</span></strong></p>
<p><span style="color: rgba(0, 0, 255, 1)">browseKEGG(enrich.KEGG.BP,"soe00564")&nbsp; &nbsp; ###画出某一特定pathway的图</span></p>
<p><span style="color: rgba(51, 51, 0, 1)">关于KEGG解释,可以查看链接🔗</span></p>
<p><span style="color: rgba(51, 51, 0, 1)">https://www.jianshu.com/p/f90ed1c52079</span></p>
<p>&nbsp;</p>
<p><strong><span style="color: rgba(51, 51, 0, 1)">4.2.6&nbsp;&nbsp;</span><span class="Apple-converted-space"><code>pathview</code><span class="Apple-converted-space">&nbsp;包里的<span class="Apple-converted-space">&nbsp;<del>上帝视角</del><span class="Apple-converted-space">&nbsp;PATHVIEW!</span></span></span></span></strong></p>
<p><span style="color: rgba(0, 0, 255, 1)">pathview(gene.data = test_sample, ##是需要提供的基因向量,默认是Entrez_ID。其由gene.idtype决定</span><br><span style="color: rgba(0, 0, 255, 1)">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;pathway.id = "soe00564",      ###指的是在KEGG中的ID</span><br><span style="color: rgba(0, 0, 255, 1)">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; species = "soe",</span><br><span style="color: rgba(0, 0, 255, 1)">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; kegg.native = TRUE,###默认是TRUE输出完整pathway的png格式文件,反之输出仅是输入的基因列表的pdf文件。</span><br><span style="color: rgba(0, 0, 255, 1)">         )</span></p>
<p>感觉结果和&nbsp;<strong><span class="Apple-converted-space">browseKEGG 差不多</span></strong></p>
<p>&nbsp;</p>
<p><img src="https://img2018.cnblogs.com/blog/1388229/201909/1388229-20190916200623220-1819863216.png"></p>
<p>&nbsp;</p>
<p>持续整理。。。</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>关注下方公众号可获得更多精彩</p>
<p><img src="https://img2020.cnblogs.com/blog/1388229/202003/1388229-20200326151242394-1702163714.jpg"></p>
<p>&nbsp;</p>
<p>参考:https://www.jianshu.com/p/ae94178918bc&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; GO</p>
<p>https://www.jianshu.com/p/47b5ea646932?utm_source=desktop&amp;utm_medium=timeline&nbsp; &nbsp; &nbsp;GO</p>
<p>https://www.cnblogs.com/djx571/p/10271874.html&nbsp; &nbsp; &nbsp; &nbsp;GO</p>
<p>https://blog.csdn.net/weixin_43569478/article/details/83744384&nbsp; &nbsp; &nbsp;KEGG</p>
<p>https://www.jianshu.com/p/f90ed1c52079&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; KEGG</p>
<p>&nbsp;https://www.jianshu.com/p/e133ab3169fa&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; KEGG</p>
<p>&nbsp;</p><br><br>
来源:https://www.cnblogs.com/zhanmaomao/p/11529589.html
頁: [1]
查看完整版本: R包对植物进行GO,KEGG注释