Abstract: This paper explores the potential of utilizing the Whispers model to create unified interfaces for audio-to-text in the context of Natural Language Processing (NLP). It offers possibilities ...