Learning What and Where: Disentangling Location and Identity Tracking Without Supervision