Abstract: The prevalent use of Byte Pair Encoding (BPE) in Large Language Models (LLMs) facilitates robust handling of subword units and avoids issues of out-of-vocabulary words. Despite its success, ...
Abstract: For Tibetan speech recognition, the accuracy is often limited due to several challenges, including the heavy reliance of end-to-end systems on large amounts of annotated data, inconsistent ...