TREAD: Token Routing for Efficient Architecture-Agnostic Diffusion Training