linux系统下硬盘无法读写,但是服务器上硬盘没有告警,确定故障硬盘的信息

news/2024/12/20 3:07:40/

 

onecli收集FFDC日志:

1、将onecli文件拷贝到本地临时目录下(比如/tmp);

2、 确保onecli文件具有可执行属性,执行命令:chmod +x lnvgy_utl_lxceb_onecli01n-3.2.0_rhel_x86-64.bin;

3、 执行onecli工具(需要用ROOT用户),执行命令:./ lnvgy_utl_lxceb_onecli01n-3.2.0_rhel_x86-64.bin;

4、稍等一段时间,onecli程序会将收集到的信息保存到/var/log/Lenovo_Support 目录下

 

storcli收集RAIDlog方法如下:

1、解压缩到Linux/ tmp文件夹

 

2、安装rpm -Uvh /tmp/Linux/storcli-*

 

3、执行shell命令

 

>>>>>>>>>>>>>>>>>> 

 

/opt/MegaRAID/storcli/storcli64 /c0 show all > show_all.txt

 

/opt/MegaRAID/storcli/storcli64 /c0 show events file=mr_events0.log

 

(Note: provide "mr_events0.log file once you finish with this command)

 

/opt/MegaRAID/storcli/storcli64 /c0 show termlog > termlog.txt

 

/opt/MegaRAID/storcli/storcli64 /c0/bbu show all > bbu_show_all.txt

 

/opt/MegaRAID/storcli/storcli64 /c0/cv show all > cv_show_all.txt

 

/opt/MegaRAID/storcli/storcli64 /c0/dall show all > dall_show_all.txt

 

/opt/MegaRAID/storcli/storcli64 /c0/eall show all > eall_show_all.txt

 

/opt/MegaRAID/storcli/storcli64 /c0/eall/sall show all > sall_show_all.txt

 

/opt/MegaRAID/storcli/storcli64 /c0/vall show all > vall_show_all.txt

 

故障硬盘判断方式:

1:找到有问题的挂载点:

[root@bdp-bat-worker01 ~]# ls /data/

ls: cannot access /data/disk12: Input/output error

disk1 disk10 disk11 disk12 disk2 disk3 disk4 disk5 disk6 disk7 disk8 disk9

2:找到故障盘:

root@bdp-bat-worker01 ~]# df -Th

Filesystem Type Size Used Avail Use% Mounted on

devtmpfs devtmpfs 252G 0 252G 0% /dev

tmpfs tmpfs 252G 96K 252G 1% /dev/shm

tmpfs tmpfs 252G 819M 251G 1% /run

tmpfs tmpfs 252G 0 252G 0% /sys/fs/cgroup

/dev/mapper/centos-root xfs 450G 97G 354G 22% /

/dev/sdm2 xfs 1014M 146M 869M 15% /boot

/dev/sde xfs 7.3T 6.4T 919G 88% /data/disk5

/dev/sdl xfs 7.3T 6.5T 840G 89% /data/disk12

/dev/sdh xfs 7.3T 6.4T 908G 88% /data/disk8

/dev/sdk xfs 7.3T 6.4T 924G 88% /data/disk11

/dev/sdb xfs 7.3T 6.4T 919G 88% /data/disk2

/dev/sdj xfs 7.3T 6.4T 923G 88% /data/disk10

/dev/sdi xfs 7.3T 6.4T 924G 88% /data/disk9

/dev/sdg xfs 7.3T 6.4T 924G 88% /data/disk7

/dev/sda xfs 7.3T 6.4T 912G 88% /data/disk1

/dev/sdm1 vfat 200M 12M 189M 6% /boot/efi

/dev/sdf xfs 7.3T 6.4T 924G 88% /data/disk6

/dev/sdd xfs 7.3T 6.4T 924G 88% /data/disk4

/dev/sdc xfs 7.3T 6.4T 922G 88% /data/disk3

/dev/mapper/centos-home xfs 438G 41M 438G 1% /home

tmpfs tmpfs 51G 0 51G 0% /run/user/0

cm_processes tmpfs 252G 75M 252G 1% /run/cloudera-scm-agent/process

tmpfs tmpfs 51G 0 51G 0% /run/user/1001

3:找到故障盘的信息:

[root@bdp-bat-worker01 ~]# smartctl -all /dev/sdl

smartctl 7.0 2018-12-30 r4883 [x86_64-linux-3.10.0-1127.el7.x86_64] (local build)

Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

 

=======> INVALID ARGUMENT TO -l: l

tlog,N[,RANGE], nvmelog,N,SIZE <=======

 

Use smartctl -h to get a usage summary

 

[root@bdp-bat-worker01 ~]# smartctl --all /dev/sdl

smartctl 7.0 2018-12-30 r4883 [x86_64-linux-3.10.0-1127.el7.x86_64] (local build)

Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

 

=== START OF INFORMATION SECTION ===

Vendor: LENOVO

Product: ST8000NM001A X

Revision: LCBA

Compliance: SPC-5

User Capacity: 8,001,563,222,016 bytes [8.00 TB]

Logical block size: 512 bytes

Physical block size: 4096 bytes

LU is fully provisioned

Rotation Rate: 7200 rpm

Form Factor: 3.5 inches

Logical Unit id: 0x5000c500de2fe497

Serial number: WSD2RTGB0000E1430X0R

Device type: disk

Transport protocol: SAS (SPL-3)

Local Time is: Thu Dec 12 14:18:38 2024 CST

SMART support is: Available - device has SMART capability.

SMART support is: Enabled

Temperature Warning: Enabled

 

=== START OF READ SMART DATA SECTION ===

SMART Health Status: OK

 

Grown defects during certification <not available>

Total blocks reassigned during format <not available>

Total new blocks reassigned = 992

Power on minutes since format <not available>

Current Drive Temperature: 40 C

Drive Trip Temperature: 65 C

 

Manufactured in week 20 of year 2021

Specified cycle count over device lifetime: 50000

Accumulated start-stop cycles: 33

Specified load-unload count over device lifetime: 600000

Accumulated load-unload cycles: 44045

Elements in grown defect list: 1018

 

Error counter log:

           Errors Corrected by Total Correction Gigabytes Total

               ECC rereads/ errors algorithm processed uncorrected

           fast | delayed rewrites corrected invocations [10^9 bytes] errors

read: 0 82 0 82 85 192163.006 1

write: 0 0 100 100 447 89552.595 0

verify: 0 2 7 9 40 583.376 0

 

Non-medium error count: 0

 

No Self-tests have been logged

4:在FFDC中找到盘的序号


http://www.ppmy.cn/news/1556544.html

相关文章

Rk3588 FFmpeg 拉流 RTSP, 硬解码转RGB

RK3588 ,基于FFmpeg, 拉取RTSP,使用 h264_rkmpp 实现硬解码. ⚡️ 传送 ➡️ RK3588, FFmpeg 拉流 RTSP, mpp 硬解码转RGBRk3588 FFmpeg 拉流 RTSP, 硬解码转RGBUbuntu x64 架构, 交叉编译aarch64 FFmpeg mppRK3588 , mpp硬编码rgb, 保存MP4视频文件.</

最新消息!ChatGPT已集成到苹果操作系统!

12月11日&#xff0c;OpenAI宣布ChatGPT将集成到苹果iOS、iPadOS和macOS操作系统中&#xff0c;用户可以直接在这些设备上访问ChatGPT的功能。 通过此次宣布内容来看&#xff0c;ChatGPT不再局限于单独的应用程序&#xff0c;用户可以在苹果设备上更便捷地使用它。这意味着&…

圆梦:借助云开发 CloudBase实现你的游戏开发梦想

最近我发现AI产品在不断涌现新动向&#xff0c;尤其是一些技术巨头推出的创新产品。例如&#xff0c;今天我们要探讨的是腾讯云开发的云开发 CloudBase&#xff0c;如果你之前没有听说过这个名字&#xff0c;那可能还记得腾讯云推出的另一个产品——微搭。没错&#xff0c;Clou…

3D 生成重建034-NerfDiff借助扩散模型直接生成nerf

3D 生成重建034-NerfDiff借助扩散模型直接生成nerf 文章目录 0 论文工作1 论文方法2 实验结果 0 论文工作 感觉这个论文可能能shapE差不多同时期工作&#xff0c;但是shapE是生成任意种类。 本文提出了一种新颖的单图像视图合成方法NerfDiff&#xff0c;该方法利用神经辐射场 …

模板方法模式详解:定义程序骨架与框架设计

目录 1. 什么是模板方法模式2. 为什么需要模板方法模式3. 模板方法模式的结构4. 实现示例5. 钩子方法的使用6. 最佳实践与注意事项 1. 什么是模板方法模式 模板方法模式是一种行为型设计模式&#xff0c;它在一个方法中定义一个算法的骨架&#xff0c;将某些步骤延迟到子类中…

HUGGINFACE NLP-dataset

1 What if my dataset isn’t on the Hub? 1.1 Working with local and remote datasets 1.1.1 supports several common data formats, CSV & TSV csv load_dataset("csv", data_files"my_file.csv") Text files text load_dataset("text&quo…

【Liunx-后端开发软件安装】Liunx安装minio

【Liunx-后端开发软件安装】Liunx安装nginx 使用安装包安装 一、简介 MinIO 是一个高性能的对象存储系统&#xff0c;专为处理大规模非结构化数据而设计。它完全兼容 Amazon S3 API&#xff0c;这使得 MinIO 不仅可以作为本地存储解决方案&#xff0c;还能轻松地与基于云的服务…

金蝶云苍穹踩过的坑(慢慢更新)

IDEA不能用最新版&#xff0c;不然搜不到金蝶的插件。 我用的是2024.1.7/2023.1.7 IDEA里增加金蝶插件库的地址也变了&#xff0c;现在是 https://tool.kingdee.com/kddt/idea-updatePlugins.xml 金蝶云苍穹部署在服务器 MAC本地IDEA调试的时候&#xff0c;登录N次能成功一次…