杨应清 發表於 2023-8-21 00:00:00

MegaCli 监控raid状态

<p>MegaCli<span>是一款管理维护硬件</span><span>RAID</span><span>软件,可以通过它来了解当前</span><span>raid</span><span>卡的所有信息,包括 </span><span>raid</span><span>卡的型号,</span><span>raid</span><span>的阵列类型,</span><span>raid </span><span>上各磁盘状态,等等。通常,我们对硬盘当前的状态不太好确定,一般通过机房人员巡检来完成,有没有通过软件的方式来检查确定这个问题呢。</span><span>MegaCli</span><span>就可以做到,一般通过 </span><span>MegaCli </span><span>的</span>Media Error Count: 0 Other Error Count: 0 这两个数值来确定阵列中磁盘是否有问题;<span>Medai Error Count </span><span>表示磁盘可能错误,可能是磁盘有坏道,这个值不为</span><span>0</span><span>值得注意,数值越大,危险系数越高,</span><span>Other Error Count </span><span>表示磁盘可能存在松动,可能需要重新再插入。</span><span>MegaCli </span><span>可以对阵列中所有的磁盘进行检测,我们可以通过脚本的方式来检测相关参数,从而通知管理人员。</span></p>
<p> </p>
<p>一、 下载<span>MegCli,</span><span>目前针对公司</span><span>DB</span><span>数据库是</span><span>IBM</span><span>的服务器,直接从</span><span>IBM</span><span>官方下载,如果</span></p>
<p>其它服务器的,使用各官方下载或者 <span>LSI</span><span>网站上进行相关下载。一般来说,是通用的。这个包适用</span><span>32 /64</span><span>位操作系统平台 </span></p>
<p>下载地址:ftp://download2.boulder.ibm.com/ecc/sar/CMA/XSA/ibm_utl_sraidmr_megacli-8.00.48_linux_32-64.zip</p>
<p>二、 安装</p>
<p>#unzip ibm_utl_sraidmr_megacli-8.00.48_linux_32-64.zip</p>
<p>#cd linux</p>
<p># rpm -ivh Lib_Utils-1.00-09.noarch.rpm  MegaCli-8.00.48-1.i386.rpm</p>
<p> </p>
<p>三 使用命令及参数</p>
<p>命令使用:<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDInfo -Lall -aALL 查raid级别<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpAllInfo -aALL 查raid卡信息<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDList -aALL 查看硬盘信息<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -aAll 查看电池信息<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -FwTermLog -Dsply -aALL 查看raid卡日志<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -adpCount 【显示适配器个数】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpGetTime –aALL 【显示适配器时间】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpAllInfo -aAll 【显示所有适配器信息】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDInfo -LALL -aAll 【显示所有逻辑磁盘组信息】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDList -aAll 【显示所有的物理信息】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -GetBbuStatus -aALL |grep ‘Charger Status’ 【查看充电状态】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -GetBbuStatus -aALL【显示BBU状态信息】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -GetBbuCapacityInfo -aALL【显示BBU容量信息】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -GetBbuDesignInfo -aALL 【显示BBU设计参数】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -AdpBbuCmd -GetBbuProperties -aALL 【显示当前BBU属性】<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -cfgdsply -aALL 【显示Raid卡型号,Raid设置,Disk相关信息】<br>
3.磁带状态的变化,从拔盘,到插盘的过程中。<br>
Device |Normal|Damage|Rebuild|Normal<br>
Virtual Drive |Optimal|Degraded|Degraded|Optimal<br>
Physical Drive |Online|Failed –&gt; Unconfigured|Rebuild|Online<br>
4.查看磁盘缓存策略<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDGetProp -Cache -L0 -a0<br>
or<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDGetProp -Cache -L1 -a0<br>
or<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDGetProp -Cache -LALL -a0<br>
ro<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDGetProp -Cache -LALL -aALL<br>
ro<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDGetProp -DskCache -LALL -aALL<br>
5.设置磁盘缓存策略<br>
缓存策略解释:</p>
<p>WT (Write through<br>
WB (Write back)<br>
NORA (No read ahead)<br>
RA (Read ahead)<br>
ADRA (Adaptive read ahead)<br>
Cached<br>
Direct<br>
例子:<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDSetProp WT|WB|NORA|RA|ADRA -L0 -a0<br>
or<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDSetProp -Cached|-Direct -L0 -a0<br>
or<br>
enable / disable disk cache<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDSetProp -EnDskCache|-DisDskCache -L0 -a0<br>
6.创建一个 raid5 阵列,由物理盘 2,3,4 构成,该阵列的热备盘是物理盘 5<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -CfgLdAdd -r5 WB Direct -Hsp -a0<br>
7.创建阵列,不指定热备<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -CfgLdAdd -r5 WB Direct -a0<br>
8.删除阵列<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -CfgLdDel -L1 -a0<br>
9.在线添加磁盘<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDRecon -Start -r5 -Add -PhysDrv -L1 -a0<br>
10.阵列创建完后,会有一个初始化同步块的过程,可以看看其进度。<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDInit -ShowProg -LALL -aALL<br>
或者以动态可视化文字界面显示<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDInit -ProgDsply -LALL -aALL<br>
11.查看阵列后台初始化进度<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDBI -ShowProg -LALL -aALL<br>
或者以动态可视化文字界面显示<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -LDBI -ProgDsply -LALL -aALL<br>
12.指定第 5 块盘作为全局热备<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDHSP -Set [-EnclAffinity] [-nonRevertible] -PhysDrv -a0<br>
13.指定为某个阵列的专用热备<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDHSP -Set [-Dedicated [-Array1]] [-EnclAffinity] [-nonRevertible] -PhysDrv -a0<br>
14.删除全局热备<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDHSP -Rmv -PhysDrv -a0<br>
15.将某块物理盘下线/上线<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDOffline -PhysDrv -a0<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDOnline -PhysDrv -a0<br>
16.查看物理磁盘重建进度<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDRbld -ShowProg -PhysDrv -a0<br>
或者以动态可视化文字界面显示<br>
#/opt/MegaRAID/MegaCli/MegaCli64 -PDRbld -ProgDsply -PhysDrv -a0</p>
<p>磁带状态的变化,从拔盘,到插盘的过程中:<br>
Device |Normal|Damage|Rebuild|Normal<br>
Virtual Drive |Optimal|Degraded|Degraded|Optimal<br>
Physical Drive |Online|Failed –&gt; Unconfigured|Rebuild|Online</p>
<p> </p>
<p>1、 查看<span>RAID</span><span>级别</span></p>
<p><img src="https://zhuji.jb51.net/uploads/img/20230517/8ded7f34693ae16a9050706af06e35af.jpg" width="300" height="191"></p>
<p>2、 查看<span>RAID</span><span>卡信息,主要输出</span><span>RAID</span><span>卡的一些配置信息</span></p>
<p><img src="https://zhuji.jb51.net/uploads/img/20230517/ec801bfb73ea4fbe5c78da9b1affcc1d.jpg" width="300" height="176"></p>
<p> </p>
<p>RAID Level : Primary-1, Secondary-0, RAID Level Qualifier-0<br>
Size : 1.086 TB<br>
State : Optimal<br>
Strip Size : 128 KB<br>
Number Of Drives per span:4 //表示每4个物理盘做成一个RAID1盘组<br>
Span Depth : 2 //表示共2个RAID1盘组做成了RAID10</p>
<p>3、 查看所有硬盘的信息</p>
<p>/opt/MegaCli64  -pdlist –aALL</p>
<p><img src="https://zhuji.jb51.net/uploads/img/20230517/83cf96b36add30b04d14b1c30ec8be77.jpg" width="300" height="201"></p>
<p>4、 查看当前<span>raid</span><span>缓存状态,</span>raid<span>缓存状态设置为</span><span>wb</span><span>的话要注意电池放电事宜,设置电池放电模式为自动学习模式</span></p>
<p>/opt/MegaRAID/MegaCli/MegaCli64   -ldgetprop  -dskcache -lall  -aall</p>
<p><img src="https://zhuji.jb51.net/uploads/img/20230517/62bc81c164312fdaae2823731616bb29.jpg" width="300" height="49"></p>
<p>5、 raid <span>电池设置相关</span></p>
<p>查看电池状态信息<span>(</span>Display BBU Status Information)<br>
MegaCli -AdpBbuCmd -GetBbuStatus -aN|-a0,1,2|-aALL<br>
MegaCli -AdpBbuCmd -GetBbuStatus -aALL</p>
<p>查看电池容量(Display BBU Capacity Information)<br>
MegaCli -AdpBbuCmd -GetBbuCapacityInfo -aN|-a0,1,2|-aALL<br>
MegaCli -AdpBbuCmd -GetBbuCapacityInfo –aALL</p>
<p>查看电池设计参数(Display BBU Design Parameters)<br>
MegaCli -AdpBbuCmd -GetBbuDesignInfo -aN|-a0,1,2|-aALL<br>
MegaCli -AdpBbuCmd -GetBbuDesignInfo –aALL</p>
<p>查看电池属性(Display Current BBU Properties)<br>
MegaCli -AdpBbuCmd -GetBbuProperties -aN|-a0,1,2|-aALL<br>
MegaCli -AdpBbuCmd -GetBbuProperties –aALL</p>
<p>设置电池为学习模式为循环模式(Start BBU Learning Cycle)<br>
Description Starts the learning cycle on the BBU.<br>
No parameter is needed for this option.<br>
MegaCli -AdpBbuCmd -BbuLearn -aN|-a0,1,2|-aALL</p>
<p>设置磁盘的缓存模式和访问方式 (Change Virtual Disk Cache and Access Parameters)<br>
Description Allows you to change the following virtual disk parameters:<br>
-WT (Write through), WB (Write back): Selects write policy.<br>
-NORA (No read ahead), RA (Read ahead), ADRA (Adaptive read ahead): Selects read policy.<br>
-Cached, -Direct: Selects cache policy.<br>
-RW, -RO, Blocked: Selects access policy.<br>
-EnDskCache: Enables disk cache.<br>
-DisDskCache: Disables disk cache.<br>
MegaCli -LDSetProp { WT | WB|NORA |RA | ADRA|-Cached|Direct} |<br>
{-RW|RO|Blocked} |<br>
{-Name} |<br>
{-EnDskCache|DisDskCache} –Lx |<br>
-L0,1,2|-Lall -aN|-a0,1,2|-aALL<br>
MegaCli -LDSetProp WT -L0 -a0</p>
<p>显示磁盘缓存和访问方式(Display Virtual Disk Cache and Access Parameters)<br>
MegaCli -LDGetProp -Cache | -Access | -Name | -DskCache -Lx|-L0,1,2|<br>
-Lall -aN|-a0,1,2|-aALL<br>
Displays the cache and access policies of the virtual disk(s):<br>
-WT (Write through), WB (Write back): Selects write policy.<br>
-NORA (No read ahead), RA (Read ahead), ADRA (Adaptive read ahead): Selects read policy.<br>
-Cache, -Cached, Direct: Displays cache policy.<br>
-Access, -RW, -RO, Blocked: Displays access policy.<br>
-DskCache: Displays physical disk cache policy.</p>
<p> </p>
<p>Megaraid 必知必会 使用LSI的megaraid可以对raid进行有效监控。别的厂商比如HP,IBM也有自己的raid API<br>
MegaCli -ldinfo -lall -aall<br>
查询raid级别,磁盘数量,容量,条带大小。<br>
MegaCli -cfgdsply -aALL |grep Policy<br>
查询控制器cache策略<br>
MegaCli -LDSetProp WB -L0 -a0<br>
设置write back功能<br><span>MegaCli -LDSetProp CachedBadBBU -L0 -a0<br>
设置即使电池坏了还是保持WB功能</span><br>
MegaCli -AdpBbuCmd -BbuLearn a0<br>
手动充电<br>
MegaCli -FwTermLog -Dsply -aALL<br>
查询日志</p>
<p>显示适配器个数: MegaCli -adpCount</p>
<p>显示所有适配器信息: MegaCli -AdpAllInfo -aAll<br>
Critical Disks : 0<br>
Failed Disks : 0</p>
<p>显示所有逻辑磁盘组信息: MegaCli -LDInfo -LALL -aAll<br>
显示所有的物理信息: MegaCli -PDList -aAll<br>
Media Error Count: 0<br>
Other Error Count: 0</p>
<p>查看充电状态: MegaCli -AdpBbuCmd -GetBbuStatus -aALL<br>
Learn Cycle Requested : No<br>
Fully Charged : Yes</p>
<p>显示BBU(后备电池)状态信息: MegaCli -AdpBbuCmd -GetBbuStatus -aALL<br>
显示BBU容量信息: MegaCli -AdpBbuCmd -GetBbuCapacityInfo -aALL<br>
显示BBU设计参数: MegaCli -AdpBbuCmd -GetBbuDesignInfo -aALL<br>
显示当前BBU属性: MegaCli -AdpBbuCmd -GetBbuProperties -aALL<br>
显示Raid卡型号,Raid设置,Disk相关信息: MegaCli -cfgdsply -aALL<br>
查看Cache 策略设置: MegaCli -cfgdsply -aALL |grep -i Policy<br>
Current Cache Policy: WriteBack, ReadAheadNone, Direct, Write Cache OK if Bad BBU<br>
查看充电进度百分比: MegaCli -AdpBbuCmd -GetBbuStatus -aALL</p>
<p>各种设备和磁盘的不同状态:<br>
Device |Normal|Damage|Rebuild|Normal<br>
Virtual Drive |Optimal|Degraded|Degraded|Optimal<br>
Physical Drive |Online|Failed –&gt; Unconfigured|Rebuild|Online</p>
<p>6、 通过脚本检测<span>RAID </span><span>磁盘状态</span></p>
<p>Linux<span>下脚本</span></p>
<p> </p><pre class="brush:bash;toolbar:false">#!/bin/bash
#check raid disk status
MEGACLI="/opt/MegaRAID/MegaCli/MegaCli64 "

$MEGACLI -pdlist -aALL  | grep "Firmware state" | awk -F : '{print $2}' | awk -F , '{print $1}' &gt;/tmp/fireware.log

$MEGACLI -pdlist -aALL  | grep -E "Media Error|Other Error" | awk -F : '{print $2}' &gt;/tmp/disk.log

for i in `cat &lt; /tmp/disk.log`
do

if [ $i -ne 0 ]
then
curl "http://xxxxxxB&amp;state=ALARM&amp;description=raid_disk_error"
fi

done

for i in `cat &lt; /tmp/fireware.log`
do

if [ $i !=  Online ]
then
curl "http://xxxxxxstate=ALARM&amp;description=raid_disk_offline"
fi

done</pre><p> </p>
<p> </p>
<p> </p>
<p>Windows <span>下脚本</span></p>
<p>Windows<span>下脚本用的工具是</span><span>gnu for windows</span><span>平台的一些软件,如 </span><span>bash grep awk cat</span></p>
<p>通过<span>bash</span><span>直接调用脚本</span></p>
<p>如:G:\raid_check\unixtools&gt;bash.exe  G:\disk.sh</p>
<p> </p><pre class="brush:bash;toolbar:false">#check raid disk status
MEGACLI="//G/raid_check/MegaCli.exe"
GREP="//G/raid_check/unixtools/grep.exe"
AWK="//G/raid_check/unixtools/awk.exe"
CAT="//G/raid_check/unixtools/cat.exe"
CURL="//G/raid_check/unixtools/curl.exe"

$MEGACLI -pdlist -aALL  | $GREP "Firmware state" |$AWK -F: '{print $2}' |$AWK -F , '{print $1}' &gt;//c/fireware.log

$MEGACLI -pdlist -aALL  | $GREP -E "Media Error|Other Error" | $AWK -F : '{print $2}' &gt; //c/disk.log

for i in `$CAT c:/disk.log`
do

if [ $i -ne 0 ]
then
$CURL "http://xxxxxx&amp;description=raid_disk_error"
fi

done

for i in `$CAT c:/fireware.log`
do
if [ $i != Online ]
then
$CURL "http://xxxxx&amp;state=ALARM&amp;description=raid_disk_offline"
fi</pre><p> </p>
<p> </p>
<p>在UC上看到一篇文章挺详细的,转载下</p>
<p>转自:http://blog.chinaunix.net/uid-25135004-id-3139293.html<br>
本站整理:http://www.ttlsa.com/html/3121.html</p>
頁: [1]
查看完整版本: MegaCli 监控raid状态