Tuesday, December 20, 2011

Mac drive diagnostics: TechTools Pro and Drive Genius find problems OS X missed

I've been having suspicious application crashes lately. In the Mac world, that suggests a hardware problem. (Once upon a time software was the cause, but these days it's hardware.)

I knew from some backup issues that I had 3 unreadable files. That suggests my 24+ month old 1TB iMac drive is dying youngish. It was time for some diagnostics, so I plugged in my old Apple Keyboard and mouse. (Most non-trivial diagnostic work requires a wired keyboard and mouse; Apple's bluetooth keyboard/mouse drivers may be unavailable when needed.)

After I deleted the bad (non-critical happily) files I ran Disk Utility - but the drive passed. Then I ran my Apple Hardware Test - extended, and loop mode. Still no problems.

I didn't believe it. Something had to be wrong. So I checked out Disk Warrior, Disk Genius and TechTool Pro - 3 reputable diagnostic apps. They're all $100. Disk Warrior has a good reputation, but Disk Genius has a trial version. It found about 58 bad blocks -- out of 1.8 billion. That seems a modest number, but DG said I needed to replace the drive. (Incidentally, Disk Genius has a built in uninstall feature -- very nice. Yes, the Mac needs an OS level uninstaller.)

I decided to get a second opinion. Andy M clued me to a MacUpdate bundle, so I got TechTool Pro 6 for $50 (plust a bunch of other apps I don't care about). It one-upped Disk Genius; as it found bad blocks it told me which of them had files (none in this case).

TTP also found bad blocks - 56 (so two less than Disk Genius, but I don't make much of that either way).

I wasn't sure what to make of this. After all, 56 out of 1.8 billion is minuscule. Unfortunately, a modern SATA drive shouldn't have any bad blocks. The excellent TTP manual explains why ...
... TechTool Pro should not normally report bad blocks for these types of drives. The drive controller in them automatically tries to map out bad blocks as they are encountered. It will do this unless either the bad block is in a critical area that cannot be mapped out at the moment or the bad block table is full. If this occurs, TechTool Pro will report a bad block and you will ultimately need to do a low level reinitialization of the drive. When the drive is reinitialized, the entire platter is accessible so that bad blocks can be mapped out if possible no matter where they occur...

.. You can use Apple's Disk Utility to reinitialize your drive. Be sure to choose the Security Option to "zero out data." Choosing this option will map out bad blocks, if possible, during the reinitialization. This may take several hours (depending on the size of your drive). If the reinitialization is successful, the drive should be fine at that point. We suggest, however, that you do a Surface Scan a few times in the next month or two just to be sure no new bad blocks are developing. If they are, then the drive is probably failing and you should consider replacing it. If a low level reinitialization fails, this indicates the drive is faulty and needs to be replaced...
So the bad blocks I see now are probably a small fraction of the number that have already been mapped out. I'm seeing the overflow, including blocks that went bad after they'd been written to.

My i5 iMac is 24 months and 2 weeks old - so it's past even my AMEX extended warranty (by two weeks!). If the drive were user serviceable (like my old G5 iMac!) I'd simply replace it. Since it's not a user serviceable I'll probably bring a new drive and the machine to FirstTech in Minneapolis for a $200 24 hour turnaround replacement. I'll make a bootable clone before I do that. (My usual Carbon Copy Cloner backups are to an encrypted image for offsite transfer, so not bootable.)

If it doesn't I'll try the reinitialization. (See Update - this drive is on death row.)

Update 12/21/2011: Various notes and reflections the day after ...
  • 16 months is a short lifespan for a hard drive. I bought this machine early in its lifecycle, I wonder if there will be more failures in this product line.
  • Modern drives don't write to bad blocks. Based on the dates of the files that were involved the involved blocks went bad in the past month. That fits with Carbon Copy Cloner not complaining until recently. (See my backup issue post for a twist to this story.)
  • I'm glad I bought TechTools Pro - I think I'll get good use of it. From what I know now though, I didn't really need it. In a modern drive a single bad block in a file, especially a relatively recently written file, means replacement. Carbon Copy Cloner told me 3 files were bad.
  • The TTP manual suggests reformatting. I suspect that might work if there was an initial formatting problem, but in this case I know existing blocks are going bad. This drive is on death row.
  • Carbon Copy Cloner complained about bad sectors in files during backup, but Time Machine didn't. That may be because Time Machine only reads files that have changed?
  • Most of the bad sectors are in unused parts of the drive. I suspect they were randomly distributed but were hidden by the drive OS as they were discovered in the parts of the drive that have been used (about 500GB of 1TB).
  • Drive Utility and Hardware Test didn't find any problems, but both Drive Genius 3 and Tech Tools Pro failed the drive and Carbon Copy Cloner complained too. The SMART diagnostics still pass the drive - even today! I'm a bit surprised; this isn't rocket science. Apple could do better. TechTools Pro gives more SMART diagnostics than Disk Utility -- my drive was complaining about heat (it's cold in this room!)
  • It's clearly worth running TechTools Pro or equivalent drive scan on a new drive then every few months. (11/23/11: TTP just crashed during a routine drive scan. I'm not impressed.)
  • I think Windows scandisk/chkdsk are superior diagnostic tools to Disk Utility.
  • TechTools Pro DVD includes an image for burning a PPC DVD. Nice touch. I still have an old PPC.
  • It's good to know the drive is dying before it dies. I have time to do extra backups and to move selected files to other machines -- including my Aperture and iPhoto Libraries and perhaps iTunes.
  • When you consider that iMac 27" hard drives are NOT user serviceable, the iMac is more expensive than it seems. The iMac G5 was entirely user serviceable. Design has its price.
  • Since I know the drive is dying I've disconnected my clone backup. It's my known good repository. I'll take it to my office then create a new clone, then disconnect that clone. I won't be saving data to this machine, I'll be treating it as a "guest" machine until I get it serviced. I may turn off Time Machine too.
  • OWC (Other World Computing - great Mac shop) showed me how to find my Model Identifier (System Profiler), it's iMac11,1. I can only go to 2TB of storage. I'm not confident that their options are correct however.
See also:

Backups - why you need two methods and abundant paranoia

I can't say I feel good about my backups. I believe data wants to die; it wants to be free of the burden of order. Against the despair of data, even the best backup is barely adequate.

Consider tonight, when everything almost failed - Time Capsule and Carbon Copy Cloner alike.

The Time Capsule serves all the machines in our home over a wireless network. I was surprised at first that backup would work over wireless, but it does. Each machine has its own unencrypted disk image; one on the TC's old internal 500 GB drive, two others have images on an external 2TB drive. The TC sits in a closet upstairs;  it's unlikely to be stolen but fire would destroy it. I have done 1-2 file Time Machine restores from that image, so I know it can work. The only test of a backup, of course, is a restore.

I don't trust Time Machine as much as old-time DantzRetrospect, but it seems Apple has gotten most of the bugs out.

I trust Carbon Copy Cloner [3] more. Each day it clones my server, on which all the important data lives. It's more than a cloner; CCC keeps copies of changed or deleted files in "_CCC Archives". I've configured CCC to use an encrypted image it automatically mounts every night. Since that backup is encrypted I can take it offsite, which I do every few weeks. Ok, every month or two. Offsite rotation relies on me, so it's prone to failure. Still, even if the house burns, I am unlikely to lose more than a month of images and videos. I can live with that.

So I have two backup methods, both fully automated, both relatively independent [2]. If each is 95% reliable each day, then the chance both fail on a given day is 1/400. If the daily chance of a server drive failure is 1/1000, the odds of all three failing on the same day are about 1/400,000 [2], [4]

Tonight though, my data got within a few miles of the cliff it wants to meet.

My server has been having worrisome memory exception (EXC_BAD_ACCESS) crashes, and a TV show I  recently downloaded had a file error [1]. There's something wrong on my 2yo i5 iMac; I need to run Apple Hardware Test (again). So I know my server data is at risk.

Time Capsule has had problems too -- it's reporting a "communications error" periodically. I think that error message is  a scarlet herring related to the iMac issues, but clearly I can't trust that backup.

Happily there's good old CCC -- but when I restarted my server for the first time in weeks it reported a problem. The backup drive didn't mount. That was easy to diagnose -- I'd unplugged it. Probably when I was debugging my Aperture crash 3 weeks ago. Why didn't CCC report the error? Maybe it had crashed.

I wasn't that close to data loss -- but I was in a bad neighborhood. As paranoid as I am, I'm almost not paranoid enough.

It's good to have two fully automatic and completely independent backup methods. Data wants to die, and backup is still an unsolved problem.

-fn-

[1] Incidentally, you can't easily report a purchase problem to Apple until they process a charge, and to reduce transaction costs they wait a few days before they process. This is very annoying! Also, the UI for reporting a purchase problem is suspiciously clumsy. More on that experience when I see what they do.
[2] In reality they common failure points of course - me, computer memory, etc. There is the older offsite backup though, so complete and total data loss is probably less than 1/1,000,000.
[3] Donationware. I donated. I wish donation ware apps would let us set a 'reminder' so I could donate yearly. I suppose I should just make donationware donations every year on my birthday against the apps I use.
[4] I'd love to have automated offsite backup too, but I've never foundan offsite vendor I trusted and I expect ISPs to eventually charge for bandwidth use.

See also:

Update 12/21/2011: I was closer to the cliff than I realized.

Tuesday, December 13, 2011

Twitter to WordPress via ifttt - limitations

Weeks after Google's Day of Infamy i'm still failing to fully replace Google Reader Shares. Recently I gave up on Tumblr, Posterous, the zombie version of Google Reader, and some screen-scraping attempts to turn G+ streams into feeds.

Lately I've been focusing on my @jgordonshare tweets and tonight I tried using ifttt to create a WordPress feed-equipped archive of tweets.

It was easy to setup the ifttt task to turn the tweets into WP posts. I used a "1 button install" Dreamhost [1] WordPress instance I've been testing. I had to turn on XML-RPC publishing (used by Windows Live Writer, MarsEdit, etc) and provide a WordPress username and password [2].

The ifttt doesn't trigger immediately after tweet creation. I assume it checks the Twitter stream every 15-30 minutes. I manually triggered a check from the ifttt dashboard.

Here's an example of what I got

Just testing iftt tweeting to wp (sorry). http://t.co/GbR8Qdud

... Just testing iftt tweeting to wp (sorry). http://t.co/GbR8Qdud...

Yeah,  not to impressive. The problem is a tweet is simply a string, it has no special structure, no way to distinguish URL from my commentary from page title from annotation (not that there's room for all that). Tweets are much simpler entities than old-style Google Reader shares.

The experiment did work, but the result isn't terribly interesting.

So the quest goes on ...

[1] Use the code "KATEVA" or this link and you are supposed to get 50% off your 1st year costs and I get an equal saving as credit.
[2] Obviously you should create a user for this purpose and create a unique password. IFTTT has to know your credentials.

See also:

Friday, December 09, 2011

How to learn what your current AT&T mobile contracted services are

AT&T has the lowest customer service rating of American mobile phone companies. Of course that's like asking what's worse - Ebola or Rabies?

This is not because of their retail staff. They must give them powerful drugs, because, despite working for a moderately evil corporation, they're remarkably cheerful.

Their web site though, that's part of what makes them "Rabies" rather than just "Ebola".

For example, for the past few days I've been trying to follow up on some extensive bill slashing changes. In particular I've been trying to find a current contract summary for our family plan. I think I've found the best that ATT offers, but they have one of the worst web sites I've ever come across. [1]

As of Nov 2011 try this. Don't click on the tabs, but mouse over to see the substructures. Note you may have to authenticate repeatedly.

  • Go to your AT&T mobile account page. Look at the top menu structure. It will say myAT&T, with "tabs" like "Overview", "Bill & Payments" and so on. Depending on the services you use some are not useful, but they will still appear. The tabs that are useful for a mobile-only customer are
    • Bill & Payments: see current bill statement
    • Wireless: usage and recent activity
    • Profile: user information
  • From user information look for "Contract Information". Click Customer Service Summary and Contract. Now you get a popup window. In there you find several options including two that, despite their names, both show a similar PDF (these links may actually work as shortcuts once you're authenticated):
    • Wireless customer agreement: This is the real deal. CSS plus six pages that summarize your true contract [2]
    • Customer service summary (wireless): Just the CSS
    • Alternatively, AT&T's "Your Phone CSS" email provides this link with goes to a screen I can't find when I navigate the site, in fact it seems to be a outside of the tabs they define and possibly a separate web site: https://www.wireless.att.com/olam/loginAction.olamexecute?target=CSS. There's a trick here. Unless you read carefully, you'll hit the "continue" button -- that will just take you back to the main site. Instead, look for the link under the wireless number drop down and click that. You get the PDF contract summary.

Note that under the Wireless tab is a "Rate Plan" link, but it only shows voice plan.

So, in summary, to determine your actually currently contracted services for a family plan you need to print/view a PDF for each individual family member and do the sums to produce an integrated view.

I wonder if there's a medical term for the psychosis induced by dealing with AT&T?

Next up: How to track SMS use.

[1] The primary flaw is that AT&T mixes marketing with service. There are other reasons, but that's the primary dysfunction. What I want to know is mixed with what they want to sell me. A secondary reason, is that they choose not to invest in areas that allow customers to see what their contracts are. Those investments have a low return on investment and will not contribute to someone making VP. It's just a happy accident that it works this way.

[2] Update: AT different times I got different PDFs for different users. Only my wife, a secondary number, produced the full six page report when I requested a "wireless customer agreement".

Update 12/11/11: I've studied the PDFs in more detail. They are quite hard to interpret, particularly for a family plan, but, with some study, they do show the current contract. I think the "wireless customer agreement" is a copy of the original contract, but the "customer service summary" is the current contract.

Wednesday, December 07, 2011

Deleting Google Profile breaks picasa share links

I  recently deleted my Google Profile. The link explains why.

I expected consequences and I am not disappointed.

Today many of my share links to my Picasa albums have broken. The albums are still gone, but the share links don't work.

It's easy to see why. Here's a link I shared last week to one of my recent albums. I've bolded the problem (omitting the auth key)

https://picasaweb.google.com/113810027503326386174/WildcatVsEagleNov2011

Here's today's link to the same album ...

https://picasaweb.google.com/jfaughnan/WildcatVsEagleNov2011

When I deleted my profile I removed my 2007 identity113810027503326386174. With the advent of G+ that identifier was the basis of shared image links. When I removed G+, those links broke.

Fortunately I hadn't shared that many albums recently. I got singed, but not burned.

I feel like I escaped the burning house of G+ just in time. Future exits will be far more difficult -- if not impossible.

Monday, December 05, 2011

Google doc share links cannot be changed

Google uses the secret share link feature in a few places. In several places the share link can be changed. I think of that as an essential feature; a way to correct a share mistake or deal with an escaped link.

Google Apps shared item links, however, cannot be changed.

Nobody talks about this.

Sometimes I think I'm the only person in the world who actually uses Google Docs.

Saturday, December 03, 2011

Blogger is dying more quickly than expected

I've been expecting Blogger to die, but since Google has been porting it to the "new look" I thought it had a year or two left.

Today when I search in Blogger I'm getting results sorted from oldest to newest, and search ends at 2006. I'm seeing this in both the old and new UI. In the new UI, of course, search is very limited; in particular you can't navigate large numbers of results readily.

Blogger is showing more of these inconsistent and failed behaviors. If Blogger were human, I'd say it had an untreatable cancer.

The good news is that WordPress.com's Blogger Import works fairly well -- though it omits all draft posts.

The bad news is that blogging is tracing the same trajectory I remember with web site authoring. In the 90s web site authoring tools were accessible to relatively non-technical amateurs. By the 00s that market had gone away, and quality "content management systems" were aimed at professionals and businesses. I'm seeing the same thing happening with blogging -- it's becoming a professional activity dominated rather than an amateur form of communication. The number of interesting blogs expressing personal opinions rather than muted convention is also diminishing.

These things are easier to understand if you live in a climate that has ice and snow and darkness.