Multi-Modal Architecture for Cricket Highlights Generation: Using Computer Vision and Large Language Model