stereoplegic 's Collections
HyPoradise: An Open Baseline for Generative Speech Recognition with
Large Language Models
Paper
• 2309.15701
• Published
• 2
CoLLD: Contrastive Layer-to-layer Distillation for Compressing
Multilingual Pre-trained Speech Encoders
Paper
• 2309.07707
• Published
• 1
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo
Labelling
Paper
• 2311.00430
• Published
• 56
Reproducing Whisper-Style Training Using an Open-Source Toolkit and
Publicly Available Data
Paper
• 2309.13876
• Published
• 1
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large
Language Models
Paper
• 2309.10707
• Published
• 2
Massive End-to-end Models for Short Search Queries
Paper
• 2309.12963
• Published
• 1
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework
for Speech Recognition
Paper
• 2310.06434
• Published
• 4
MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics
Transcription
Paper
• 2108.02625
• Published
• 1
Beyond Universal Transformer: block reusing with adaptor in Transformer
for automatic speech recognition
Paper
• 2303.13072
• Published
• 1
Continual Learning for Monolingual End-to-End Automatic Speech
Recognition
Paper
• 2112.09427
• Published
• 1
ILASR: Privacy-Preserving Incremental Learning for Automatic Speech
Recognition at Production Scale
Paper
• 2207.09078
• Published
• 1
Multilingual Byte2Speech Models for Scalable Low-resource Speech
Synthesis
Paper
• 2103.03541
• Published
Bilingual End-to-End ASR with Byte-Level Subwords
Paper
• 2205.00485
• Published