Which Verification Metrics Are Appropriate for Rare-Event Classification Problems?