Computer-implemented method of creating a translation model for low resource language pairs and a machine translation system using this translation model
Abstract:
A computer-implemented method for creating a translation model for low resource language pairs and applicable on noisy inputs utilizing several approaches: choosing particular input corpora covering in-domain noisy and clean texts as well as unrelated but larger general parallel texts, performing several chosen methods of creating synthetic parallel corpora and filtering, pre-processing, deduplicating and concatenating training corpora.
Information query
Patent Agency Ranking
0/0