Redefining Token Economics: How Inference Hardware Choices Impact Real-Time Voice Agent Latency and Cost