Slicing Is All You Need: Towards a Universal One-Sided Distributed MatMul