详细信息
藏文同元码与基本集相互转换的规则与实现 被引量:1
Regulars and realization in code transform between Tibetan Tongyuan codes and component sets
文献类型:期刊文献
中文题名:藏文同元码与基本集相互转换的规则与实现
英文题名:Regulars and realization in code transform between Tibetan Tongyuan codes and component sets
作者:武光利[1];于洪志[1];柳春[1,2]
第一作者:武光利
机构:[1]西北民族大学中国民族语言文字信息技术重点实验室,兰州730030;[2]甘肃中医学院公共课部,兰州730000
第一机构:西北民族大学中国民族语言文字信息技术重点实验室,兰州730030
年份:2009
卷号:45
期号:29
起止页码:134
中文期刊名:计算机工程与应用
外文期刊名:Computer Engineering and Applications
收录:CSTPCD;;北大核心:【北大核心2008】;CSCD:【CSCD2011_2012】;
基金:国家高技术研究发展计划(863)(No.AA2006010101)~~
语种:中文
中文关键词:藏文;拉丁转写;同元编码;基本集;编码转换
外文关键词:Tibetan; Latin transliteration; Tongyuan code; component set; code transform
摘要:在当今的计算机信息处理过程中,不同文字处理平台上相同字符的不同编码问题,即文字处理的不兼容,是一个亟待解决的重要问题。而在藏文信息处理的研究中,藏文的编码转换也是一个研究热点。藏文的文本、网站大多采用同元编码方式,而微软的Vista操作系统采用的是基本集的编码方式,所以两种编码的转换在藏文信息处理领域是非常重要的。主要介绍了藏文同元编码与基本集的相互转换技术,采用了将藏文按照拉丁转写拆分的方法,利用层数作为藏文同元编码字符结构与基本集编码字符结构的桥梁,通过一系列规则,实现了两种编码的相互转换。
Nowadays,in the processing course of computer information,the problem of using different codes to stand for the same characters on different characters processing platform,that is to say,the non-compatible of characters processing is a main problem to be settled.Well,in the research of Tibetan information processing,the research of Tibetan codes transforming is a hot point.Most Tibetan texts and websites use the Tongyuan codes while the Vista OS of Microsoft uses component sets.Therefore,in the field of Tibetan information processing,the codes transforming between these two is rather important.This paper mainly talks about transformational technique between Tibetan Tongyuan codes and component sets.The method of splitting Tibetan characters using Latin transliteration is taken.Tiers are taken as the bridge of Tibetan Tongyuan codes character structure and component set character structure,using a set of rules,to accomplish the transform of these two codes.
参考文献:
正在载入数据...