Disc dying ???

antipodean

Active Member
I suspect my hard disc is dying.

Symptoms are the machine suddenly has started locking up, seems like every 15-30 minutes.. In some cases the linux software keeps running and can be accessed across the network, other times the whole thing locks up.

Does this sound reasonable?

From the custom software disc diagnostics ...


Disk Information
SMART Status PASSED
Model Family Seagate Pipeline HD 5900.2
Device Model ST3500312CS
Serial Number 9VV51LC3
LU WWN Device Id 5 000c50 02d7f0f75
Firmware Version SC13
User Capacity 500,107,862,016 bytes [500 GB]
Sector Size 512 bytes logical/physical
Rotation Rate 5900 rpm
Device is In smartctl database [for details use: -P show]
ATA Version is ATA8-ACS T13/1699-D revision 4
SATA Version is SATA 2.6, 3.0 Gb/s (current: 3.0 Gb/s)
Local Time is Thu Oct 6 22:55:22 2016 BST
SMART support is Available - device has SMART capability.
SMART support is Enabled
Attributes
ID Name Flags Raw Value Value Worst Threshold Life Left Notes
1 Raw_Read_Error_Rate POSR-- 68071868 114 099 006 -
3 Spin_Up_Time PO---- 0 097 097 000 -
4 Start_Stop_Count -O--CK 4288 096 096 020 95% -
5 Reallocated_Sector_Ct PO--CK 0 100 100 036 100% -
7 Seek_Error_Rate POSR-- 557347578 087 060 030 -
9 Power_On_Hours -O--CK 41120 054 054 000 54% -
10 Spin_Retry_Count PO--C- 0 100 100 097 100% -
12 Power_Cycle_Count -O--CK 2144 098 098 020 98% -
184 End-to-End_Error -O--CK 0 100 100 099 -
187 Reported_Uncorrect -O--CK 0 100 100 000 -
188 Command_Timeout -O--CK 0 100 100 000 -
189 High_Fly_Writes -O-RCK 315 001 001 000 -
190 Airflow_Temperature_Cel -O---K 53 047 (53°C) 037 (63°C) 045 (55°C) In_the_past
194 Temperature_Celsius -O---K 53 053 063 000 -
195 Hardware_ECC_Recovered -O-RC- 68071868 049 034 000 -
197 Current_Pending_Sector -O--C- 0 100 100 000 -
198 Offline_Uncorrectable ----C- 0 100 100 000 -
199 UDMA_CRC_Error_Count -OSRCK 0 200 200 000 -
 
How about unplugging the disc. Would that help diagnose the problem and perhaps eliminate the HDD?
 
It would, and that is one of the diagnostic steps in the link posted (presuming you mean the "disk" rather than the "disc" - I'm not sure where you would find one of those in a 'FOX!).
 
I see no evidence of disk problems.

Oh, that is a shame, I thought it might be a quick way out of fixing a crashing machine ... :(

My initial thought had been either a power supply problem, or the video chip is dying, the way it keeps locking up.
 
One of mine periodically does something similar - locks up for no apparent reason. I haven't got to the bottom of the problem yet. Mine isn't disk either though.
 
It could be a problem with the TSR buffer. You can reset that easily enough by tuning to a data channel (e.g. 200) and running the tsr/reset diagnostic. Otherwise follow Black Hole's guide.
 
Both of my boxes do this from time to time.
I've managed to reduce the occurrence by ensuring I do a warm reset
at least once a day.
When it locks sometimes a remote control will allow warm cycling
sometimes I need the front panel and other times I have to power
off at the back (then run e2fsck -c)
I guess it could be any number of things but if I had to put money
on it I'd say a memory leak or perhaps buffer overflow somewhere.

That being said I did read somewhere that the front panels were
a bit unreliable themselves.
 
Back
Top