This paper systematically describes the Fosafer system designed for the Mandarin Audio-Visual Speech Recognition (MAVSR) Challenge 2025 Track 2. The purpose of Track 2 is to evaluate the performance ...