清微 發表於 2021-10-25 10:56:00

MongoDB中如何优雅地删除大量数据

<p data-tool="mdnice编辑器">删除大量数据,无论是在哪种数据库中,都是一个普遍性的需求。除了正常的业务需求,我们需要通过这种方式来为数据库“瘦身”。</p>
<p data-tool="mdnice编辑器">为什么要“瘦身”呢?</p>
<ol data-tool="mdnice编辑器">
<li>
<p>表的数据量到达一定量级后,数据量越大,表的查询性能会越差。</p>
<p>毕竟数据量越大,B+树的层级会越高,需要的IO也会越多。</p>
</li>
<li>
<p>表的数据有冷热之分,将很多无用或很少用到的数据存储在数据库中会消耗数据库的资源。</p>
<p>譬如会占用缓存;会增加备份集的大小,进而影响备份的恢复时间等。</p>
</li>
</ol>
<p data-tool="mdnice编辑器">所以,对于那些无用的数据,我们会定期删除。</p>
<p data-tool="mdnice编辑器">对于那些很少用到的数据,则会定期归档。归档,一般是将数据写入到归档实例或抽取到大数据组件中。归档完毕后,会将对应的数据从原实例中删除。</p>
<p data-tool="mdnice编辑器">一般来说,这种删除操作涉及的数据量都比较大。</p>
<p data-tool="mdnice编辑器">对于这类删除操作,很多开发童鞋的实现就是一个简单的DELETE操作。看上去,简单明了,干净利落。</p>
<p data-tool="mdnice编辑器">但是,这种方式,危害性却极大。</p>
<p data-tool="mdnice编辑器">以 MySQL 为例:</p>
<ul data-tool="mdnice编辑器">
<li>
<p>会造成大事务</p>
<p>大事务会导致主从延迟,而主从延迟又会影响数据库的高可用切换。</p>
</li>
<li>
<p>回滚表空间会不断膨胀</p>
<p>在MySQL 8.0之前,回滚表空间默认是放到系统表空间中,而系统表空间一旦”膨胀“,就不会收缩。</p>
</li>
<li>
<p>锁定的记录多</p>
<p>相对而言,更容易导致锁等待。</p>
</li>
</ul>
<p data-tool="mdnice编辑器">即使是分布式数据库,如TiDB,如果一次删除了大量数据,这批数据在进行Compaction时有可能会触发流控。</p>
<p data-tool="mdnice编辑器">所以,对于线上的大规模删除操作,建议分而治之。具体来说,就是批量删除,每次只删除一部分数据,分多次执行。</p>
<p data-tool="mdnice编辑器">就如何删除大量数据,接下来我们看看MongoDB中的落地方案。</p>
<p data-tool="mdnice编辑器">本文主要包括以下四部分内容。</p>
<ol data-tool="mdnice编辑器">
<li>MongoDB中删除数据的三种方式。</li>
<li>三种方式的执行效率对比。</li>
<li>通过Write Concern规避主从延迟。</li>
<li>删除过程中碰到的Bug。</li>
</ol>
<h1 data-tool="mdnice编辑器"><span class="prefix"><span class="content">MongoDB中删除数据的三种方式</span></span></h1>
<p data-tool="mdnice编辑器">在MongoDB中删除数据,可通过以下三种方式:</p>
<ul data-tool="mdnice编辑器">
<li>
<p>db.collection.remove()</p>
<p>删除单个文档或满足条件的所有文档。</p>
</li>
<li>
<p>db.collection.deleteMany()</p>
<p>删除满足条件的所有文档。</p>
</li>
<li>
<p>db.collection.bulkWrite()</p>
<p>批量操作接口,可执行批量插入、更新、删除操作。</p>
</li>
</ul>
<p data-tool="mdnice编辑器">接下来,对比下这三种方式的执行效率。</p>
<h1 data-tool="mdnice编辑器"><span class="prefix"><span class="content">三种方式的执行效率对比</span></span></h1>
<p data-tool="mdnice编辑器">环境:MongoDB 3.4.4,副本集。</p>
<p data-tool="mdnice编辑器">测试思路:分别使用 remove、deleteMany、bulkWrite 删除 10w 条记录(每批删除 5000 条),交叉执行 5 次。</p>
<h2 data-tool="mdnice编辑器"><span class="prefix"><span class="content">1. remove</span></span></h2>
<pre class="custom" data-tool="mdnice编辑器"><code class="hljs"><span class="hljs-comment">//&nbsp;delete_date是删除条件<br><span class="hljs-keyword">var&nbsp;delete_date&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date(<span class="hljs-string">"2021-01-01T00:00:00.000Z");<br><span class="hljs-comment">//&nbsp;获取程序开始时间<br><span class="hljs-keyword">var&nbsp;start_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br><span class="hljs-comment">//&nbsp;获取满足删除条件的记录数<br>rows&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}}).count()<br>print(<span class="hljs-string">"total&nbsp;rows:",&nbsp;rows);<br><span class="hljs-comment">//&nbsp;定义每批需要删除的记录数<br><span class="hljs-keyword">var&nbsp;batch_num&nbsp;=&nbsp;<span class="hljs-number">5000;<br><span class="hljs-keyword">while&nbsp;(rows&nbsp;&gt;&nbsp;<span class="hljs-number">0)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;rows也可理解为剩余记录数<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;如果剩余记录数小于batch_num,则将剩余记录数赋值给batch_num<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;为什么要怎么做,后面会提到。<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">if&nbsp;(rows&nbsp;&lt;&nbsp;batch_num)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;batch_num&nbsp;=&nbsp;rows;<br>&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;获取满足删除条件的最小的5000个_id(ObjectID)<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">var&nbsp;cursor&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}},&nbsp;{<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).sort({<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).limit(batch_num);<br>&nbsp;&nbsp;&nbsp;&nbsp;rows&nbsp;=&nbsp;rows&nbsp;-&nbsp;batch_num;<br>&nbsp;&nbsp;&nbsp;&nbsp;cursor.forEach(<span class="hljs-function"><span class="hljs-keyword">function&nbsp;(<span class="hljs-params">each_row)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;通过remove删除记录,这里指定了"justOne":&nbsp;true,每次只能删除一条记录。<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;为了避免误删除,这里同时指定了主键和删除条件。<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;db.test_collection.remove({<span class="hljs-string">'_id':&nbsp;each_row[<span class="hljs-string">"_id"],&nbsp;<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-string">'$lt':&nbsp;delete_date}},&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"justOne":&nbsp;<span class="hljs-literal">true,<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-attr">w:&nbsp;<span class="hljs-string">"majority"<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;})<br>&nbsp;&nbsp;&nbsp;&nbsp;});<br>}<br><span class="hljs-comment">//&nbsp;获取程序结束时间<br><span class="hljs-keyword">var&nbsp;end_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br><span class="hljs-comment">//&nbsp;两者的差值,即为程序执行时长<br>print((end_time&nbsp;-&nbsp;start_time)&nbsp;/&nbsp;<span class="hljs-number">1000);<br></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<h2 data-tool="mdnice编辑器"><span class="prefix"><span class="content">2. deleteMany</span></span></h2>
<p data-tool="mdnice编辑器">实例思路同remove类似,只不过会将待删除的_id放到一个数组中,最后再通过deleteMany一次性删除。</p>
<p data-tool="mdnice编辑器">具体代码如下:</p>
<pre class="custom" data-tool="mdnice编辑器"><code class="hljs"><span class="hljs-keyword">var&nbsp;delete_date&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date(<span class="hljs-string">"2021-01-01T00:00:00.000Z");<br><span class="hljs-keyword">var&nbsp;start_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br>rows&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}}).count()<br>print(<span class="hljs-string">"total&nbsp;rows:",&nbsp;rows);<br><span class="hljs-keyword">var&nbsp;batch_num&nbsp;=&nbsp;<span class="hljs-number">5000;<br><span class="hljs-keyword">while&nbsp;(rows&nbsp;&gt;&nbsp;<span class="hljs-number">0)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">if&nbsp;(rows&nbsp;&lt;&nbsp;batch_num)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;batch_num&nbsp;=&nbsp;rows;<br>&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">var&nbsp;cursor&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}},&nbsp;{<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).sort({<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).limit(batch_num);<br>&nbsp;&nbsp;&nbsp;&nbsp;rows&nbsp;=&nbsp;rows&nbsp;-&nbsp;batch_num;<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">var&nbsp;delete_ids&nbsp;=&nbsp;[];<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;将满足条件的主键值放入到数组中。<br>&nbsp;&nbsp;&nbsp;&nbsp;cursor.forEach(<span class="hljs-function"><span class="hljs-keyword">function&nbsp;(<span class="hljs-params">each_row)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;delete_ids.push(each_row[<span class="hljs-string">"_id"]);<br>&nbsp;&nbsp;&nbsp;&nbsp;});<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-comment">//&nbsp;通过deleteMany一次删除5000条记录。<br>&nbsp;&nbsp;&nbsp;&nbsp;db.test_collection.deleteMany({<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">'_id':&nbsp;{<span class="hljs-string">"$in":&nbsp;delete_ids},<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"createTime":&nbsp;{<span class="hljs-string">'$lt':&nbsp;delete_date}<br>&nbsp;&nbsp;&nbsp;&nbsp;},{<span class="hljs-attr">w:&nbsp;<span class="hljs-string">"majority"})<br>}<br><span class="hljs-keyword">var&nbsp;end_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br>print((end_time&nbsp;-&nbsp;start_time)&nbsp;/&nbsp;<span class="hljs-number">1000);<br></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<h2 data-tool="mdnice编辑器"><span class="prefix"><span class="content">3. bulkWrite</span></span></h2>
<p data-tool="mdnice编辑器">实现思路同deleteMany类似,也是将待删除的_id放到一个数组中,最后再调用bulkWrite进行删除。</p>
<p data-tool="mdnice编辑器">具体代码如下:</p>
<pre class="custom" data-tool="mdnice编辑器"><code class="hljs"><span class="hljs-keyword">var&nbsp;delete_date&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date(<span class="hljs-string">"2021-01-01T00:00:00.000Z");<br><span class="hljs-keyword">var&nbsp;start_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br>rows&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}}).count()<br>print(<span class="hljs-string">"total&nbsp;rows:",&nbsp;rows);<br><span class="hljs-keyword">var&nbsp;batch_num&nbsp;=&nbsp;<span class="hljs-number">5000;<br><span class="hljs-keyword">while&nbsp;(rows&nbsp;&gt;&nbsp;<span class="hljs-number">0)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">if&nbsp;(rows&nbsp;&lt;&nbsp;batch_num)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;batch_num&nbsp;=&nbsp;rows;<br>&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">var&nbsp;cursor&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}},&nbsp;{<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).sort({<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).limit(batch_num);<br>&nbsp;&nbsp;&nbsp;&nbsp;rows&nbsp;=&nbsp;rows&nbsp;-&nbsp;batch_num;<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">var&nbsp;delete_ids&nbsp;=&nbsp;[];<br>&nbsp;&nbsp;&nbsp;&nbsp;cursor.forEach(<span class="hljs-function"><span class="hljs-keyword">function&nbsp;(<span class="hljs-params">each_row)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;delete_ids.push(each_row[<span class="hljs-string">"_id"]);<br>&nbsp;&nbsp;&nbsp;&nbsp;});<br>&nbsp;&nbsp;&nbsp;&nbsp;db.test_collection.bulkWrite(<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-attr">deleteMany:&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"filter":&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">'_id':&nbsp;{<span class="hljs-string">"$in":&nbsp;delete_ids},<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"createTime":&nbsp;{<span class="hljs-string">'$lt':&nbsp;delete_date}<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;],<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{<span class="hljs-attr">ordered:&nbsp;<span class="hljs-literal">false},<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{<span class="hljs-attr">writeConcern:&nbsp;{<span class="hljs-attr">w:&nbsp;<span class="hljs-string">"majority",&nbsp;<span class="hljs-attr">wtimeout:&nbsp;<span class="hljs-number">100}}<br>&nbsp;&nbsp;&nbsp;&nbsp;)<br>}<br><span class="hljs-keyword">var&nbsp;end_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br>print((end_time&nbsp;-&nbsp;start_time)&nbsp;/&nbsp;<span class="hljs-number">1000);<br></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<p data-tool="mdnice编辑器">接下来,看看三者的执行效率。</p>
<table>
<thead>
<tr><th>删除方式</th><th>平均执行时间(s)</th><th>第一次</th><th>第二次</th><th>第三次</th><th>第四次</th><th>第五次</th></tr>
</thead>
<tbody>
<tr>
<td>remove</td>
<td>47.341</td>
<td>49.606</td>
<td>48.487</td>
<td>49.314</td>
<td>47.572</td>
<td>41.727</td>
</tr>
<tr>
<td>deleteMany</td>
<td>16.951</td>
<td>16.566</td>
<td>18.669</td>
<td>17.932</td>
<td>18.66</td>
<td>12.928</td>
</tr>
<tr>
<td>bulkWrite</td>
<td>16.476</td>
<td>17.247</td>
<td>14.181</td>
<td>16.151</td>
<td>18.403</td>
<td>16.397</td>
</tr>
</tbody>
</table>
<p data-tool="mdnice编辑器">结合表中的数据,可以看出,</p>
<ol data-tool="mdnice编辑器">
<li>执行最慢的是remove,执行最快的是bulkWrite,前者差不多是后者的 2.79 倍。</li>
<li>deleteMany 和 bulkWrite 的执行效率差不多,但就语法而言,前者比后者简洁。</li>
</ol>
<p data-tool="mdnice编辑器">所以线上如果要删除大量数据,推荐使用 deleteMany + ObjectID 进行批量删除。</p>
<h1 data-tool="mdnice编辑器"><span class="prefix"><span class="content">通过 Write Concern 规避主从延迟</span></span></h1>
<p data-tool="mdnice编辑器">虽然是批量删除,但在MySQL中,如果没控制好节奏,还是很容易导致主从延迟。在MongoDB中,其实也有类似的担忧,不过我们可以通过 Write Concern 进行规避。</p>
<p data-tool="mdnice编辑器">Write Concern,可理解为写安全策略,简单来说,它定义了一个写操作,需要在几个节点上应用(Apply)完,才会给客户端反馈。</p>
<p data-tool="mdnice编辑器">看下面这个原理图。</p>
<p><img src="https://img2020.cnblogs.com/blog/576154/202110/576154-20211025105328379-1640974837.png" alt="" loading="lazy"></p>
<p id="1635130407372">&nbsp;</p>
<p data-tool="mdnice编辑器">图中是一个一主两从的副本集,设置了w: "majority",代表一个写操作,需要等待副本集中绝大多数节点(本例中是两个)应用完,才能给客户端反馈。</p>
<p data-tool="mdnice编辑器">在前面的代码中,无论是remove,deleteMany还是bulkWrite方法,都设置了w: "majority"。</p>
<p data-tool="mdnice编辑器">之所以这样设置,一方面是为了保证数据的安全性,毕竟删除操作能在多个节点落盘,另一方面,还能有效降低批量操作可能导致的主从延迟风险。</p>
<p data-tool="mdnice编辑器">Write Concern的完整语法如下,</p>
<pre class="custom" data-tool="mdnice编辑器"><code class="hljs">{&nbsp;<span class="hljs-attr">w:&nbsp;&lt;value&gt;,&nbsp;j:&nbsp;&lt;boolean&gt;,&nbsp;wtimeout:&nbsp;&lt;number&gt;&nbsp;}<br></span></code></pre>
<p data-tool="mdnice编辑器">其中,</p>
<p data-tool="mdnice编辑器">w:指定节点数或tags。其有如下取值:</p>
<ul data-tool="mdnice编辑器">
<li>
<p>&lt;number&gt;:显式指定节点数量。</p>
<p>设置为0,无需Server端反馈。</p>
<p>设置为1,只需Primary节点反馈。</p>
<p>设置为2,在副本集中,需要一个Primary节点(Primary节点必需)和一个Secondary节点反馈。</p>
<p>需要注意的是,这里的Secondary节点必须是数据节点,可以是隐藏节点、延迟节点或Priority为 0 的节点,但仲裁节点(Arbiter)绝对不行。</p>
<p>一般来说,设置的节点数越多,数据越安全,写入的效率也会越低。</p>
</li>
<li>
<p>majority:副本集大多数节点。</p>
<p>与上面不一样的是,这里的Secondary节点不仅要求是数据节点,它的votes(members.votes)还必须大于0。</p>
</li>
<li>
<p>&lt;custom write concern name&gt;:指定tags。</p>
<p>tag,顾名思义,是给节点打标签。常用于多数据中心部署场景。</p>
<p>如一个集群,有5个节点,跨机房部署。其中3个节点在A机房,另外2个节点在B机房,因为对数据的安全性、一致性要求很高,我们希望写操作至少能在A机房的2个节点落盘,B机房的1个节点落盘。</p>
<p>对于这种个性化的需求,只有通过tags才能实现。</p>
<p>具体使用,可参考:https://docs.mongodb.com/manual/tutorial/configure-replica-set-tag-sets/#configure-custom-write-concern。</p>
</li>
</ul>
<p data-tool="mdnice编辑器">j:是否需要等待对应操作的日志持久化到磁盘中。</p>
<p data-tool="mdnice编辑器">在MongoDB中,一个写操作会涉及到三个动作:更新数据,更新索引,写入oplog,这三个动作要么全部成功,要么全部失败,这也是MongoDB单行事务的由来。</p>
<p data-tool="mdnice编辑器">对于每个写操作,WiredTiger都会记录一条日志到 journal 中。</p>
<p data-tool="mdnice编辑器">日志在写入journal之前,会首先写入到 journal buffer(最大128KB)中。</p>
<p data-tool="mdnice编辑器">Journal buffer会在以下场景持久化到 journal 文件中:</p>
<ul data-tool="mdnice编辑器">
<li>
<p>副本集中,当有操作等待oplog时。</p>
<p>这类操作包括:针对oplog最新位置点的扫描查询;Causally consistent session中的读操作;对于Secondary节点,每次批量应用oplog后。</p>
</li>
<li>
<p>Write Concern 设置了 j: true。</p>
</li>
<li>
<p>每100ms。</p>
<p>由 storage.journal.commitIntervalMs 参数指定。</p>
</li>
<li>
<p>创建新的 journal 文件时。</p>
<p>当 journal 文件的大小达到100MB时会自动创建一个新的journal 文件。</p>
</li>
</ul>
<p data-tool="mdnice编辑器">wtimeout:超时时长,单位ms。</p>
<p data-tool="mdnice编辑器">不设置或设置为0,命令在执行的过程中,如果遇到了锁等待或节点数不满足要求,会一直阻塞。</p>
<p data-tool="mdnice编辑器">如果设置了时间,命令在这个时间内没有执行成功,则会超时报错,具体报错信息如下:</p>
<pre class="custom" data-tool="mdnice编辑器"><code class="hljs">rs:PRIMARY&gt;&nbsp;db.test.insert({<span class="hljs-string">"a":&nbsp;<span class="hljs-number">1},&nbsp;{<span class="hljs-attr">writeConcern:&nbsp;{<span class="hljs-attr">w:&nbsp;<span class="hljs-string">"majority",&nbsp;<span class="hljs-attr">wtimeout:&nbsp;<span class="hljs-number">100}})<br>WriteResult({<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"nInserted":&nbsp;<span class="hljs-number">1,<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"writeConcernError":&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"code":&nbsp;<span class="hljs-number">64,<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"codeName":&nbsp;<span class="hljs-string">"WriteConcernFailed",<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"errInfo":&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"wtimeout":&nbsp;<span class="hljs-literal">true<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;},<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"errmsg":&nbsp;<span class="hljs-string">"waiting&nbsp;for&nbsp;replication&nbsp;timed&nbsp;out"<br>&nbsp;&nbsp;&nbsp;&nbsp;}<br>})<br></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<h1 data-tool="mdnice编辑器"><span class="prefix"><span class="content">删除过程中遇到的Bug</span></span></h1>
<p data-tool="mdnice编辑器">其实,最开始的删除程序是下面这个版本。</p>
<pre class="custom" data-tool="mdnice编辑器"><code class="hljs"><span class="hljs-keyword">var&nbsp;delete_date&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date(<span class="hljs-string">"2021-01-01T00:00:00.000Z");<br><span class="hljs-keyword">var&nbsp;start_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br><span class="hljs-keyword">var&nbsp;batch_num&nbsp;=&nbsp;<span class="hljs-number">5000;<br><span class="hljs-keyword">while&nbsp;(<span class="hljs-number">1&nbsp;==&nbsp;<span class="hljs-number">1)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">var&nbsp;cursor&nbsp;=&nbsp;db.test_collection.find({<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-attr">$lt:&nbsp;delete_date}},&nbsp;{<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).sort({<span class="hljs-string">"_id":&nbsp;<span class="hljs-number">1}).limit(batch_num);<br>&nbsp;&nbsp;&nbsp;&nbsp;delete_ids&nbsp;=&nbsp;[]<br>&nbsp;&nbsp;&nbsp;&nbsp;cursor.forEach(<span class="hljs-function"><span class="hljs-keyword">function&nbsp;(<span class="hljs-params">each_row)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;delete_ids.push(each_row[<span class="hljs-string">"_id"])<br>&nbsp;&nbsp;&nbsp;&nbsp;});<br><br>&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">if&nbsp;(delete_ids.length&nbsp;==&nbsp;<span class="hljs-number">0)&nbsp;{<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-keyword">break;<br>&nbsp;&nbsp;&nbsp;&nbsp;}<br>&nbsp;&nbsp;&nbsp;&nbsp;db.test_collection.deleteMany({<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">'_id':&nbsp;{<span class="hljs-string">"$in":&nbsp;delete_ids},<br>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span class="hljs-string">"createtime":&nbsp;{<span class="hljs-string">'$lt':&nbsp;delete_date}<br>&nbsp;&nbsp;&nbsp;&nbsp;},&nbsp;{<span class="hljs-attr">w:&nbsp;<span class="hljs-string">"majority"})<br>}<br><span class="hljs-keyword">var&nbsp;end_time&nbsp;=&nbsp;<span class="hljs-keyword">new&nbsp;<span class="hljs-built_in">Date();<br>print((end_time&nbsp;-&nbsp;start_time)&nbsp;/&nbsp;<span class="hljs-number">1000);<br></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></code></pre>
<p data-tool="mdnice编辑器">相对于效率对比章节的版本,这个版本的代码简洁不少。</p>
<ol data-tool="mdnice编辑器">
<li>不用额外获取需要删除的记录数。</li>
<li>batch_num在整个执行过程中也是不变的。</li>
</ol>
<p data-tool="mdnice编辑器">但用这个版本在线上删除数据时,发现了一个问题。</p>
<p data-tool="mdnice编辑器">在删除到最后一批时,程序会hang在那里。重试了多次依然如此。分析如下:</p>
<ul data-tool="mdnice编辑器">
<li>
<p>最后一批的文档数小于batch_num时,会出现这个问题。</p>
<p>删除同实例下另外一个集合,也出现了类似的问题。</p>
<p>但在测试环境,删除一个简单的集合却没有复现出来,怀疑这个Bug与线上集合的记录过长有关。</p>
</li>
<li>
<p>cursor只是一个迭代对象,并不是查询结果。基于cursor可以分批返回记录,类似于Python中的迭代器。</p>
<p>最后一批也不是完全没有返回,而是在返回100条之后才hang在那里。</p>
</li>
<li>
<p>不使用sort没有这个问题。</p>
<p>为什么要使用sort呢?这样可保证得到的id是有序且在物理上的存储是相邻的。这样,在执行批量删除操作时,效率也会相对较高。</p>
<p>经过实际测试,当要删除的数据量较大时,使用sort的效率确实比不使用的要高。</p>
<p>如果删除的数据量较小,使不使用sort则没多大区别。</p>
</li>
</ul>
<h1 data-tool="mdnice编辑器"><span class="prefix"><span class="content">总结</span></span></h1>
<p data-tool="mdnice编辑器">从最佳实践的角度出发,无论是在哪种数据库中,如果都删除(更新)大量数据,都建议分而治之,分批执行。</p>
<p data-tool="mdnice编辑器">在MongoDB中,如果要删除大量数据,推荐使用deleteMany + ObjectID进行批量删除。</p>
<p data-tool="mdnice编辑器">为了保证操作的安全性及规避批量操作带来的主从延迟风险,建议在执行删除操作时,将Write Concern设置为w: "majority"。</p>
<h1 data-tool="mdnice编辑器"><span class="prefix"><span class="content">参考</span></span></h1>
<p data-tool="mdnice编辑器"> Journaling&nbsp;</p>
<p data-tool="mdnice编辑器"> Write Concern</p><br><br>
来源:https://www.cnblogs.com/ivictor/p/15457454.html
頁: [1]
查看完整版本: MongoDB中如何优雅地删除大量数据