I think my disc is failing

I was quite impressed by this - it did what I wanted to do on my PC anyway (although not Humax related).
 
Probably my final update on this thread, as my new 1Tb drive is now fitted.
Here's what I did.

1) Made sure I have the latest copy of the schedule BUT, this turns out (as posted elsewhere) not to have been necessary.It was restored automatically.
2) Remove the drive. It's a very easy dismantle and removal. One cross-head screwdriver is all that was required. The Disk is fitted into a single unit that also contains a fan, so take care to remove the fan connector from the main board before pulling the drive unit out.
3) Next is where I had the only problem. I put both old and new drives into a dual external USB caddy, but my laptop did not recognize the drives. I hoped to at least see drive letters but nope - nothing.
4) I installed the Seagate DiscWizard software, as it can easily clone a drive, but this software will not run unless it can identify a seagate or Maxtor drive in the system, and it would NOT detect my seagate drives via USB. So, unless I fitted them internally to a desktop machine I was unable to do any cloning. (I thought I had Acronis True Image here but I didn't!)
5) So, next I installed the new drive, untouched, into the Humax, and from here things got very good! Good old hummy identified the drive, told me I needed to format it and after a couple of button clicks off it went. For a 1Tb drive it competed in a couple of minutes, which surprised me.
6) After formatting, used the 'Back' button to return to the main media menu, checked the schedule and everything was there. No need to restore from my backup.
7) Plugged the old drive into a USB caddy and it was recognized, so I just set the media to USB, located the video files, and selected the top menu for copy, used opt+ and set it to copy back to the new drive. It's running now and will probably keep going for a couple of days, though I'm not really sure of this.
8) Connected via WebIf page and I was given the option to download all the WebIf software again. After letting it finish the download and resetting on the Humax, averything was again as before.

End result looking good. I hoped to do a fast disk clone, but this is working fine, provided we don't have any power failures while it runs!

I hope this helps people thinking about replacing their hard drive. I'm pretty sure that this method would also work for updating the drive from 500Gb to 1Tb.

All the best to those who provided valuable information and gave me the confidence to do this. I can now reformat the old drive and see if I can fix any remaining sector errors and keep it as a spare.
Alan.
 
The schedule isn't stored on the drive, only the custom firmware schedule backups.

You probably want to edit your post: for "cross-thread" try "cross-head", and for "filled" try "fitted" (I know I know, predictive spell check!). I will delete this paragraph later.
 
I just got the message below, and I am running full disk check (option 3) from the maintenance menu as suggested

!! WARNING !!

There appear to be some hardware problems with the internal hard disk on this device.

Disk pending sector count is: 1
Disk offline sector count is: 1

Go to disk diagnostics


Having a quick check on the progress ( option 4) I get this message.

ImageUploadedByTapatalk1381330114.403404.jpg

Does the "completed real failure" part mean it is done? It has only been running 20 minutes and I can only hear fan noise.
 
Yes, it's done and has found the bad block. If you run the fix-disk process (option 1) then it should repair it for you.
 
Cheers af. When I tried running option 3 again it said it was running and would finish at19:50.

I will run option 1
 
Check disk now completed.

I got the previous warning message again whe I connected via webif and clicked on the acknowledge current disk faults button. I now get a message I can't clear. Any advice please?


!! WARNING !!

There appear to be some hardware problems with the internal hard disk on this device.

Disk pending sector count is: 0 (was 1)
Disk offline sector count is: 0 (was 1)

Don't panic; for help, visit wiki.hummy.tv
 
I had the same myself after acknowledging the new count, but when I closed WebIf and logged in again it was gone. For me it only remained gone a short time, which is why I replaced the disk, but for you it might be different. It depends on whether the bad sector count rises again.
 
I just got the message below, and I am running full disk check (option 3) from the maintenance menu as suggested

!! WARNING !!

There appear to be some hardware problems with the internal hard disk on this device.

Disk pending sector count is: 1
Disk offline sector count is: 1

Go to disk diagnostics

I have just got this message again, this time with a count of 3,which seems a little strange as I
have not had a count of 2. Besides, the counter was reset to zero after the last occurence.

The hard disk data is below. If I understand the second block of data correctly, it is consistantly failing at the same place (and this is the same place as last time). Does this mean I should not worry and just let the disk error correction deal with the error?

Should I be transferring off all my files and hunting down the reciept, which I believe has six months left?

Code:
AttributesID    Name    Flags    Raw Value    Value    Worst    Thresh    Type    Updated    When Failed
1    Raw_Read_Error_Rate    POSR--    98051273    115    098    006    Pre-fail    Always    -
3    Spin_Up_Time    PO----    0    095    095    000    Pre-fail    Always    -
4    Start_Stop_Count    -O--CK    3922    097    097    020    Old_age    Always    -
5    Reallocated_Sector_Ct    PO--CK  [COLOR=#ff0000]  3 [/COLOR]   100    100    036    Pre-fail    Always    -
7    Seek_Error_Rate    POSR--    296689179    084    060    030    Pre-fail    Always    -
9    Power_On_Hours    -O--CK    8065    091    091    000    Old_age    Always    -
10    Spin_Retry_Count    PO--C-    0    100    100    097    Pre-fail    Always    -
12    Power_Cycle_Count    -O--CK    1961    099    099    020    Old_age    Always    -
184    End-to-End_Error    -O--CK    0    100    100    099    Old_age    Always    -
187    Reported_Uncorrect    -O--CK    0    100    100    000    Old_age    Always    -
188    Command_Timeout    -O--CK    0    100    100    000    Old_age    Always    -
189    High_Fly_Writes    -O-RCK    0    100    100    000    Old_age    Always    -
190    Airflow_Temperature_Cel    -O---K    51    049    039    045    Old_age    Always    In_the_past
194    Temperature_Celsius    -O---K    51    051    061    000    Old_age    Always    -
195    Hardware_ECC_Recovered    -O-RC-    98051273    043    028    000    Old_age    Always    -
197    Current_Pending_Sector    -O--C-    0    100    100    000    Old_age    Always    -
198    Offline_Uncorrectable    ----C-    0    100    100    000    Old_age    Offline    -
199    UDMA_CRC_Error_Count    -OSRCK    0    200    200    000    Old_age    Always    -
 
[COLOR=#ff0000]Self-test logsNo.    Description    Status    Remaining    When    First Error LBA[/COLOR]
[COLOR=#ff0000]# 1    Short offline    Completed: read failure    90%    7927    219658[/COLOR]
[COLOR=#ff0000]# 2    Extended offline    Completed: read failure    90%    7927    219658[/COLOR]
[COLOR=#ff0000]# 3    Extended offline    Completed: read failure    90%    7927    219658[/COLOR]
[COLOR=#ff0000]# 4    Extended offline    Completed: read failure    90%    7926    219658[/COLOR]
[COLOR=#ff0000]#[COLOR=#333333] 5    Short offline    Completed without error    00%    7764    -[/COLOR][/COLOR]
[COLOR=#333333]# 6    Short offline    Completed without error    00%    7372    -[/COLOR]
[COLOR=#333333]# 7    Short offline    Completed without error    00%    6295    -[/COLOR]
[COLOR=#333333]# 8    Short offline    Completed without error    00%    5015    -[/COLOR]
[COLOR=#333333]# 9    Short offline    Completed without error    00%    4351    -[/COLOR]
[COLOR=#333333]#10    Short offline    Completed without error    00%    4007    -[/COLOR]
[COLOR=#333333]#11    Short offline    Completed without error    00%    3972    -[/COLOR]
[COLOR=#333333]#12    Short offline    Completed without error    00%    2434    -[/COLOR]
[COLOR=#333333]#13    Short offline    Completed without error    00%    1791    -[/COLOR]
[COLOR=#333333]#14    Short offline    Completed without error    00%    865   [/COLOR] -
 
The hard disk data is below. If I understand the second block of data correctly, it is consistantly failing at the same place (and this is the same place as last time). Does this mean I should not worry and just let the disk error correction deal with the error?
It is not showing any pending sector errors at the moment but it is not clear if the problem at LBA 219658 was fixed. The results in the second block are from the last time that fix-disk or smartctl was run. You could try running fix-disk from CF 2.19 to see if it fixes the problem.

A reallocated sector count of single figures is not really a problem unless it keeps rising. I find that it tends to have a few bad days where it will increase, then it will be stable for quite a long time. Mine was stable at zero for a long time, then increased to about 95 over a few days and was stable for a few months. It then had another burst to its current reading of 148, where it also has been stable for a few months. When this figure gets to over 1000 it is probably time to think about replacing the disk.
 
Thanks xyz123, for the comprehensive reply.

I did run the fix-disk again but it found no issue. I would obviously wish to get the disk replaced under warranty, I.e. Before next May. Apart from being a little noisy it seems to work okay. Difficult to return the HDR on the basis of CF diagnostics.

I have started to copy everything off to an external drive in case it needed to go back. I planned to reformat the HDD and copy it all back. Do you think it will help (particularly with the disk noise) or am I wasting my time? It is a very slow process.
 
I don't think so. I don't know how to convert the 'when' into a date. But I think the top two lines are yesterday's test in maintenance mode.

I have noticed that the error rate values in ID 1&3 are climbing quite quickly.

Attributes
ID Name Flags Raw Value Value Worst Thresh Type Updated When Failed
1 Raw_Read_Error_Rate POSR-- 77940004 111 098 006 Pre-fail Always -
3 Spin_Up_Time PO---- 0 095 095 000 Pre-fail Always -
4 Start_Stop_Count -O--CK 3926 097 097 020 Old_age Always -
5 Reallocated_Sector_Ct PO--CK 3 100 100 036 Pre-fail Always -
7 Seek_Error_Rate POSR-- 298176378 084 060 030 Pre-fail Always -
9 Power_On_Hours -O--CK 8087 091 091 000 Old_age Always -
10 Spin_Retry_Count PO--C- 0 100 100 097 Pre-fail Always -
12 Power_Cycle_Count -O--CK 1963 099 099 020 Old_age Always -
184 End-to-End_Error -O--CK 0 100 100 099 Old_age Always -
187 Reported_Uncorrect -O--CK 0 100 100 000 Old_age Always -
188 Command_Timeout -O--CK 0 100 100 000 Old_age Always -
189 High_Fly_Writes -O-RCK 0 100 100 000 Old_age Always -
190 Airflow_Temperature_Cel -O---K 49 051 039 045 Old_age Always In_the_past
194 Temperature_Celsius -O---K 49 049 061 000 Old_age Always -
195 Hardware_ECC_Recovered -O-RC- 77940004 045 028 000 Old_age Always -
197 Current_Pending_Sector -O--C- 0 100 100 000 Old_age Always -
198 Offline_Uncorrectable ----C- 0 100 100 000 Old_age Offline -
199 UDMA_CRC_Error_Count -OSRCK 0 200 200 000 Old_age Always -
Self-test logs
No. Description Status Remaining When First Error LBA
# 1 Short offline Completed without error 00% 8065 -
# 2 Short offline Completed without error 00% 8065 -
# 3 Short offline Completed: read failure 90% 7927 219658
# 4 Extended offline Completed: read failure 90% 7927 219658
# 5 Extended offline Completed: read failure 90% 7927 219658
# 6 Extended offline Completed: read failure 90% 7926 219658
# 7 Short offline Completed without error 00% 7764 -
# 8 Short offline Completed without error 00% 7372 -
# 9 Short offline Completed without error 00% 6295 -
#10 Short offline Completed without error 00% 5015 -
#11 Short offline Completed without error 00% 4351 -
#12 Short offline Completed without error 00% 4007 -
#13 Short offline Completed without error 00% 3972 -
#14 Short offline Completed without error 00% 2434 -
#15 Short offline Completed without error 00% 1791 -
#16 Short offline Completed without error 00% 865 -
 
ID's 1 & 3 are not usually associated with "Potential indicators of imminent electromechanical failure" , keep an eye on ID's 5, 197 and 198, more info. HERE
 
The 'when' values in the self test logs are taken from its count of power on hours. Attribute 9 gives the current power on hours so this would suggest that the test was performed 8087 - 8065 = 22 hours ago. This assumes that it has not been turned off during that time.
 
Back
Top