: float32|int8|int32(NHWC|NC4HW4) output: float32|int8|int32 Metal: half->int32, int32->half, uint8->half OpenCL Cast