常见硬件故障举例 sun.docx
- 文档编号:17249555
- 上传时间:2023-07-23
- 格式:DOCX
- 页数:17
- 大小:18.50KB
常见硬件故障举例 sun.docx
《常见硬件故障举例 sun.docx》由会员分享,可在线阅读,更多相关《常见硬件故障举例 sun.docx(17页珍藏版)》请在冰点文库上搜索。
常见硬件故障举例sun
常见硬件故障举例
1,系统日志/var/adm/messages*中报出Cpu,内存出现AFT类报错,应及时保修更换。
查看cpu数量是否正确:
psrinfo
0on-linesince01/26/0711:
22:
07
2on-linesince01/26/0711:
22:
05
16on-linesince01/26/0711:
22:
07
18on-linesince01/26/0711:
22:
07
或者/usr/platform/sun4u/sbin/prtdiag–v可以看到比较详细的系统硬件配置。
prtdiag-v|more
SystemConfiguration:
SunMicrosystemssun4uSunFireV440
Systemclockfrequency:
177MHZ
Memorysize:
4GB
=========================CPUs====================================
E$CPUCPUTemperature
CPUFreqSizeImplementationMaskDieAmb.StatusLocation
---------------------------------------------------------------------
01593MHz1MBSUNW,UltraSPARC-IIIi3.4--online-
11593MHz1MBSUNW,UltraSPARC-IIIi3.4--online-
(下面略)
A,CPU报错信息举例,例子中说明cpu18出现错误:
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID289920kern.info]NOTICE:
[AFT0]UCCEventdetectedbyCPU18inUsermodeatTL=0,
errID0x00420f56.380eacb0
Jun2717:
50:
30v440AFSR0x00000400
Jun2717:
50:
30v440Fault_PC0xfe1696a8Esynd0x0026
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID173042kern.info][AFT0]errID0x00420f56.380eacb0DataBit19wasinerrorandcor
rected
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID832860kern.info][AFT2]errID0x00420f56.380eacb0PA=0x000000a0.b2532b00
Jun2717:
50:
30v440E$tag0x000004a0.b2400001E$state_0Shared
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x00)0xfe409880.fe40b1e40xfe4133f8.fe1ee4a4ECC
0x03f
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x10)0xfe409370.fe40b1200xfe40d77c.fe418850ECC
0x17a
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x20)0xfe410fb8.fe40c8740xfe07bdd0.fe406ad0ECC
0x1a8
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x30)0xfe40fe14.fe4104c00xfe40df10.fe4052c4ECC
0x14e
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID929717kern.info][AFT2]D$datanotavailable
Jun2717:
50:
30v440SUNW,UltraSPARC-IV:
[ID335345kern.info][AFT2]I$datanotavailable
B,内存报错举例,可以看出/N0/SB4/P3/E1J7300这根内存有问题。
May1417:
39:
20hdb-lclw8:
[ID408692kern.notice]Main,up153days12:
05:
38,Memory8,512,064
May1421:
39:
20hdb-lclw8:
[ID994892kern.notice]Main,up153days16:
05:
38,Memory8,208,768
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID838864kern.info]NOTICE:
[AFT0]FirstErrorUCCEventdetectedbyCPU19inUsermode
atTL=0,errID0x002f28e2.e95593c0
May1422:
25:
38hdb-lcAFSR0x00000400
May1422:
25:
38hdb-lcFault_PC0x100fb8e60Esynd0x0001/N0/SB4/P3/E1J7300
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID450664kern.info][AFT0]errID0x002f28e2.e95593c0CheckBit0wasinerrorandcorre
cted
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID248406kern.info][AFT2]errID0x002f28e2.e95593c0PA=0x00000023.f67bce40
May1422:
25:
38hdb-lcE$tag0x0000008f.d9249049E$state_1Shared
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x00)0xde10a064.80a3e0000x1240027e.01000000ECC0x
02a
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x10)0xd80d601e.80a320050x0240015c.80a7202bECC0x
13a
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x20)0x124000f5.010000000xd2176000.d406a030ECC0x
186
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x30)0x80a2400a.1640002d0x01000000.c65e6180ECC0x
1e8
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID929717kern.info][AFT2]D$datanotavailable
May1422:
25:
38hdb-lcSUNW,UltraSPARC-IV:
[ID335345kern.info][AFT2]I$datanotavailable
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID828558kern.info]NOTICE:
[AFT0]UCCEventdetectedbyCPU19inUsermodeatTL=0,er
rID0x002f28e2.e95593c0
May1422:
25:
49hdb-lcAFSR0x00200400
May1422:
25:
49hdb-lcFault_PC0x100fb8e60Esynd0x0001/N0/SB4/P3/E1J7300
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID450664kern.info][AFT0]errID0x002f28e2.e95593c0CheckBit0wasinerrorandcorre
cted
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID248406kern.info][AFT2]errID0x002f28e2.e95593c0PA=0x00000023.f67bce40
May1422:
25:
49hdb-lcE$tag0x0000008f.d9249049E$state_1Shared
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x00)0xde10a064.80a3e0000x1240027e.01000000ECC0x
02a
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x10)0xd80d601e.80a320050x0240015c.80a7202bECC0x
13a
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x20)0x124000f5.010000000xd2176000.d406a030ECC0x
186
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID895151kern.info][AFT2]E$Data(0x30)0x80a2400a.1640002d0x01000000.c65e6180ECC0x
1e8
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID929717kern.info][AFT2]D$datanotavailable
May1422:
25:
49hdb-lcSUNW,UltraSPARC-IV:
[ID335345kern.info][AFT2]I$datanotavailable
2,硬盘报错
a,用format命令,对应磁盘条目出现“typeunknown”或者“drivernotfound”。
b,命令iostat–En输出查看磁盘信息,注意MediaError中的数值是否为0。
c,对于用sds做的软raid,出现下面几种情况,则应及时保修。
•命令metadb的输出中出现大写字母打头的行。
•命令metastat的输出中,对应的raid卷状态出现非ok状态提示。
•系统日志中出现关于meta的告警信息。
举例正常的metadb和metastat输出和系统日志中的告警信息。
bash-2.03#metadb
flagsfirstblkblockcount
ampluo161034/dev/dsk/c1t0d0s7
apluo10501034/dev/dsk/c1t0d0s7
apluo20841034/dev/dsk/c1t0d0s7
apluo161034/dev/dsk/c1t1d0s7
apluo10501034/dev/dsk/c1t1d0s7
apluo20841034/dev/dsk/c1t1d0s7
bash-2.03#metastat|more
d0:
Mirror
Submirror0:
d1
State:
Okay
Submirror1:
d2
State:
Okay
Pass:
1
Readoption:
roundrobin(default)
Writeoption:
parallel(default)
Size:
55092864blocks
d1:
Submirrorofd0
State:
Okay
Size:
55092864blocks
Stripe0:
DeviceStartBlockDbaseStateHotSpare
c1t0d0s00NoOkay
d2:
Submirrorofd0
State:
Okay
Size:
55092864blocks
Stripe0:
DeviceStartBlockDbaseStateHotSpare
c1t1d0s00NoOkay
Nov1320:
25:
23v440md_stripe:
[ID641072kern.warning]WARNING:
md:
d32:
readerroron/dev/dsk/c1t1d0s3
Nov1320:
25:
24v440md_mirror:
[ID104909kern.warning]WARNING:
md:
d32:
/dev/dsk/c1t1d0s3needsmaintenance
C,系统日志/var/adm/messages*中出现磁盘的block报错信息。
例如:
Nov1320:
25:
18v440scsi:
[ID107833kern.warning]WARNING:
/pci@9,600000/SUNW,qlc@2/fp@0,0/ssd@w500000e0114799c1,0(ssd0):
Nov1320:
25:
18v440ErrorforCommand:
read(10)ErrorLevel:
Retryable
Nov1320:
25:
18v440scsi:
[ID107833kern.notice]RequestedBlock:
111216288ErrorBlock:
111216305
Nov1320:
25:
18v440scsi:
[ID107833kern.notice]Vendor:
FUJITSUSerialNumber:
0530C049EM
Nov1320:
25:
18v440scsi:
[ID107833kern.notice]SenseKey:
MediaError
Nov1320:
25:
18v440scsi:
[ID107833kern.notice]ASC:
0x11(
0x1,FRU:
0x0
Nov1320:
25:
19v440scsi:
[ID243001kern.warning]WARNING:
/pci@9,600000/SUNW,qlc@2/fp@0,0(fcp0):
Nov1320:
25:
19v440FCP:
WWN0x500000e0114799c1resetsuccessfully
Nov1320:
25:
19v440scsi:
[ID107833kern.warning]WARNING:
/pci@9,600000/SUNW,qlc@2/fp@0,0/ssd@w500000e0114799c1,0(ssd0):
Nov1320:
25:
19v440ErrorforCommand:
read(10)ErrorLevel:
Retryable
Nov1320:
25:
19v440scsi:
[ID107833kern.notice]RequestedBlock:
111216288ErrorBlock:
111216305
Nov1320:
25:
19v440scsi:
[ID107833kern.notice]Vendor:
FUJITSUSerialNumber:
0530C049EM
Nov1320:
25:
19v440scsi:
[ID107833kern.notice]SenseKey:
MediaError
Nov1320:
25:
19v440scsi:
[ID107833kern.notice]ASC:
0x11(
0x1,FRU:
0x0
Nov1320:
25:
20v440scsi:
[ID243001kern.warning]WARNING:
/pci@9,600000/SUNW,qlc@2/fp@0,0/ssd@w500000e0114799c1,0(ssd0):
Nov1320:
25:
20v440SCSItransportfailed:
reason'reset':
retryingcommand
Nov1320:
25:
22v440scsi:
[ID107833kern.warning]WARNING:
/pci@9,600000/SUNW,qlc@2/fp@0,0/ssd@w500000e0114799c1,0(ssd0):
Nov1320:
25:
22v440ErrorforCommand:
read(10)ErrorLevel:
Retryable
Nov1320:
25:
22v440scsi:
[ID107833kern.notice]RequestedBlock:
111216288ErrorBlock:
111216305
Nov1320:
25:
22v440scsi:
[ID107833kern.notice]Vendor:
FUJITSUSerialNumber:
0530C049EM
Nov1320:
25:
22v440scsi:
[ID107833kern.notice]SenseKey:
MediaError
Nov1320:
25:
22v440scsi:
[ID107833kern.notice]ASC:
0x11(
0x1,FRU:
0x0
3,网络接口的问题
例如网络时断时通,会在系统日志/var/adm/messages*中产生如下日志:
Mar1623:
12:
30v440genunix:
[ID408789kern.warning]WARNING:
ce0:
faultdetectedexternaltodevice;servicedegraded
Mar1623:
12:
30v440genunix:
[ID451854kern.warning]WARNING:
ce0:
xcvraddr:
0x01-linkdown
Mar1623:
14:
06v440genunix:
[ID408789kern.notice]NOTICE:
ce0:
faultclearedexternaltodevice;serviceavailable
Mar1623:
14:
06v440genunix:
[ID451854kern.notice]NOTICE:
ce0:
xcvraddr:
0x01-linkup100Mbpsfullduplex
Mar1623:
14:
16v440genunix:
[ID408789kern.warning]WARNING:
ce0:
faultdetectedexternaltodevice;servicedegraded
Mar1623:
14:
16v440genunix:
[ID451854kern.warning]WARNING:
ce0:
xcvraddr:
0x01-linkdown
Mar1623:
14:
54v440genunix:
[ID408789kern.notice]NOTICE:
ce0:
faultclearedexternaltodevice;serviceavailable
Mar1623:
14:
54v440genunix:
[ID451854kern.notice]NOTICE:
ce0:
xcvraddr:
0x01-linkup100Mbpsfullduplex
Mar1623:
51:
39v440genunix:
[ID408789kern.warning]WARNING:
ce0:
faultdetectedexternaltodevice;servicedegraded
Mar1623:
51:
39v440genunix:
[ID451854kern.warning]WARNING:
ce0:
xcvraddr:
0x01-linkdown
Mar1623:
53:
11v440genunix:
[ID408789kern.notice]NOTICE:
ce0:
faultclearedexternaltodevice;serviceavailable
Mar1623:
53:
11v440genunix:
[ID451854kern.notice]NOTICE:
ce0:
xc
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 常见硬件故障举例 sun 常见 硬件 故障 举例