[美国公开]US10842264

System and iterative method for lexicon, segmentation and language model joint optimization

著录项

申请号US10842264
申请日20040510
公开号US20040210434A1
公开日20041021
申请(专利权)人Microsoft Corporation
发明人Hai-Feng WangChang-Ning HuangKai-Fu LeeShuo DiJianfeng GaoDong-Feng CaiLee-Feng Chien
地址US WA Redmond
主分类号G06F017/27
分类号
G06F017/27

摘要

A method for optimizing a language model is presented comprising developing an initial language model from a lexicon and segmentation derived from a received corpus using a maximum match technique, and iteratively refining the initial language model by dynamically updating the lexicon and re-segmenting the corpus according to statistical principles until a threshold of predictive capability is achieved.

System and iterative method for lexicon, segmentation and language model joint optimization

著录项

摘要

信息查询

网页搜索

学术搜索