phương hướng mới cho tự điển chữ...

12
Phương hướng mi cho Tđin chNôm (Online Dictionary of Nôm Characters, a New Approach) Nguyn Hu Vinh (also known as Yuan Yu-Rong) Quality Control Manager, Industrial Technology Research Institute, Hsin Chu, Taiwan Abstract With the current computer and Internet technologies, it is feasible to compose a clear, precise, complete, easy-to-use electronic Nom characters dictionary that opens the door to new researching capabilities. The method of collecting Nom characters by searching through Nom literature is recommended as a useful and convincing solution. Each Nom character in the dictionary needs to be accompanied with quoted excerpts from multiple Nom writings existed from old time. The dictionary also needs to explain the composing structure of each character in the context of illustrated examples. An electronic Nom dictionary needs to incorporate current modern technologies which make it easy to broadcast through the Internet and easy to search (by Quoc Ngu, character, Unicode number, or by a set of characters, …). Most importantly, it has to be compatible among computers and computer platforms. In order to satisfy these demands, a Nom font conforming to Unicode standards needs to be created. In addition, a database structure needs to be defined from the beginning with essential fields such as characters, Unicode number, Quoc Ngu pinyin, structure of the characters, examples, reference sources, etc. Two user interfaces are required: an end-user’s interface and a developer’s interface. The end-user’s interface provides searching and querying capabilities to retrieve data from the database. The developer’s interface allows the Working Committee to record data and to collaborate all aspects of the work. Besides for the normal searching capabilities, the dictionary can also be expanded to provide statistical and translating tools. The Committee with different levels of knowledge and experiences not only in Nom characters and literature but also in computer technology, from different parts of the world, will bring more complete and comprehensive results, which are much more difficult to achieve by an individual. To ensure consistency and quality of the dictionary, the Committee needs to establish a working process, which conforms to proven project management, software development and quality assurance principles. Lastly, we will share some of design specifications of recently released Online Dictionary of Nom Characters with Excerpts. Tóm lược Vi các phương tin đin toán và Internet hin có, vic biên son mt quyn tđin chNôm đin trõ ràng, chính xác, đầy đủ, ddùng và có thmra nhng chiu hướng nghiên cu mi điu rt khthi. Phương thc thu nhp dkin bng cách lt soát các hình thái chNôm tnhng văn bn Nôm được xem như là mt gii pháp hu ích và có tính thuyết phc. Mi mt chNôm trong tđin đều phi được dn chng bng các thí dcthtrích dn tcác văn bn Nôm khác nhau. Tđin còn cn phi có phương cách gii thích cu trúc ca chNôm theo ngnghĩa ca ch. Mt tđin chNôm đin tcn phi được htrbi nhng phương tin kthut hin đại như dtruyn thông qua mng Internet, dtra cu (bng âm Quc Ng, mt ch, mã ch, hoc tp hp ch, v.v), và nht là có tính cách quc tế, máy nào cũng dùng được. Để đáp ng được các nhu cu đó, cn có mt bfont chNôm theo mã chun quc tế (Unicode). Thêm vào đó, cn có mt cơ sdkin (database) được xác định ngay tbước đầu để cha đựng

Upload: nguyenkhuong

Post on 21-May-2018

229 views

Category:

Documents


2 download

TRANSCRIPT

  • Phng hng mi cho T in ch Nm (Online Dictionary of Nm Characters, a New Approach)

    Nguyn Hu Vinh

    (also known as Yuan Yu-Rong) Quality Control Manager, Industrial Technology Research Institute, Hsin Chu, Taiwan

    Abstract With the current computer and Internet technologies, it is feasible to compose a clear, precise, complete, easy-to-use electronic Nom characters dictionary that opens the door to new researching capabilities. The method of collecting Nom characters by searching through Nom literature is recommended as a useful and convincing solution. Each Nom character in the dictionary needs to be accompanied with quoted excerpts from multiple Nom writings existed from old time. The dictionary also needs to explain the composing structure of each character in the context of illustrated examples. An electronic Nom dictionary needs to incorporate current modern technologies which make it easy to broadcast through the Internet and easy to search (by Quoc Ngu, character, Unicode number, or by a set of characters, ). Most importantly, it has to be compatible among computers and computer platforms. In order to satisfy these demands, a Nom font conforming to Unicode standards needs to be created. In addition, a database structure needs to be defined from the beginning with essential fields such as characters, Unicode number, Quoc Ngu pinyin, structure of the characters, examples, reference sources, etc. Two user interfaces are required: an end-users interface and a developers interface. The end-users interface provides searching and querying capabilities to retrieve data from the database. The developers interface allows the Working Committee to record data and to collaborate all aspects of the work. Besides for the normal searching capabilities, the dictionary can also be expanded to provide statistical and translating tools. The Committee with different levels of knowledge and experiences not only in Nom characters and literature but also in computer technology, from different parts of the world, will bring more complete and comprehensive results, which are much more difficult to achieve by an individual. To ensure consistency and quality of the dictionary, the Committee needs to establish a working process, which conforms to proven project management, software development and quality assurance principles. Lastly, we will share some of design specifications of recently released Online Dictionary of Nom Characters with Excerpts. Tm lc Vi cc phng tin in ton v Internet hin c, vic bin son mt quyn t in ch Nm in t r rng, chnh xc, y , d dng v c th m ra nhng chiu hng nghin cu mi l iu rt kh thi. Phng thc thu nhp d kin bng cch lt sot cc hnh thi ch Nm t nhng vn bn Nm c xem nh l mt gii php hu ch v c tnh thuyt phc. Mi mt ch Nm trong t in u phi c dn chng bng cc th d c th trch dn t cc vn bn Nm khc nhau. T in cn cn phi c phng cch gii thch cu trc ca ch Nm theo ng ngha ca ch. Mt t in ch Nm in t cn phi c h tr bi nhng phng tin k thut hin i nh d truyn thng qua mng Internet, d tra cu (bng m Quc Ng, mt ch, m ch, hoc tp hp ch, v.v), v nht l c tnh cch quc t, my no cng dng c. p ng c cc nhu cu , cn c mt b font ch Nm theo m chun quc t (Unicode). Thm vo , cn c mt c s d kin (database) c xc nh ngay t bc u cha ng

    http://www.nomfoundation.org/Conf2006/NHVinh_tudien_nom.pdf

  • cc thng tin v mt ch Nm nh mt ch, m ch, m Hn Vit, m c, cu trc, vn liu, v.v K n l mt giao din cho ngi dng (end-users interface), cho php ngi dng tra cu v tm kim (search) cc thng tin cha ng trong kho d kin ny mt cch d dng v nhanh chng. Ngoi ra, cn mt Giao din (interface) dnh ring cho Ban Bin Tp (BBT) nhp d kin vo kho d kin v tho lun nhng vn lin quan n mi kha cnh ca t in. Ngoi cc chc nng tra cu ca mt t in thng thng, t in cn c th m rng h tr cng vic thng k v phin m. Mt BBT vi nhiu kh nng v trnh khc nhau v c ch Nm ln k thut in ton, c ng nhiu ni trn th gii, s mang li nhng thnh qu m mt c nhn kh c th lm ni. BBT cn phi xc nh cc quy trnh lm vic r rng, vi mt s nguyn tc v qun l d n (project management), pht trin phn mm (software development) v qun l cht lng (sofware quality assurance) bo m tnh thng nht v cht lng ca t in. Cui cng l vi kha cnh ca quyn T in Ch Nm Trch Dn c thit k theo chiu hng trn. Ti sao T in Nm trn mng Mt t in c xem l thc dng v hu ch ti thiu phi hi nhng yu t: r rng, chnh xc, y , d dng v hiu qu. i vi mt t in ch Hn th vic sp xp tra tm ch l mt trong nhng yu t u tin hng u ph hp vi nhng yu t nu trn. Trong mt t in ch Nm, th cc yu t trn li c phn phc tp hn na. Ch Nm khng c in ch, vic xc nh hnh v ngha ca ch cha c r rng, cho nn cng vic lm nn mt t in ch Nm gp nhng tr ngi cn phi gii quyt. Ch Nm nn c sp xp th no cho tin li, tra cu cho nhanh chng, nn c thm nhng d kin cn thit no, lm sao ghi li c r rng v chnh xc hnh v ngha, dng trong trng no, thuc vn cnh no, tc phm no, ti sao c nh th ny th n v.v R rng, mt s vn nh trn cn phi c v gii quyt thch ng. Vi s p dng k thut my in ton vo cng vic bo tn ch Nm ngy cng tr nn thc t, th mt cun t in ch Nm online trn mng Internet c kh nng gii quyt mt s ln vn nan gii nu trn. M ch no v loi phng no thch hp T nhiu nm nay, mt s cng ty nc ngoi nh Cng ty Dynacomware ca i Loan thit k mt loi phng c bit cho ch Nm, tuy nhin nhng khng bit v l do g phng ny khng thy cho lu hnh trn th trng. Ring Mojikyo Institute Japan th cho ra th trng phn mm ni ting Kim Tch Vn T Knh ( 10 ) chy di Windows 2000 hoc Japanese version XP, nhng ch c khong 2000 ch Nm, vi li phn mm ny khng s dng c vi Unicode, cho nn khng thc dng i vi ngi Vit. Ngy nay, mt s ln ch Nm c chun ho v cp m Unicode nn s xc tin vic ch to cc phng ch Nm dng TrueType ng dng cho cc phn mm my in ton tr nn quan trng, cn thit v cp bch. Chn mt loi phng thch hp pht trin cc phn mm ng dng ch Nm phi l iu suy ngh u tin ca cc nh thit k phn mm. Trc khi ch Nm c m ha vo Unicode, th trong nc cng c ngi xc tin vic ch to phng, song v chng ta cha c c mt b m quc gia, do vic ch to phng ri vo tnh trng b tc. Ngay c sau khi ch Nm c a vo Unicode, cc phn mm ng dng cng cha ti. Mt phn bi l cc phn mm dng lm data processing nh Microsoft Word cng cha quan tm, mt phn bi l hin th ch Unicode trong cc vng CJK Unified Ideographs ca Unicode i hi k thut hin i v tin tin nht mi thc hin c. V d, hin th ch trong vng CJK Unified Ideographs B, Microsoft a ra k thut surrogate x l, va phc tp va i hi my tnh phi x l c a ngn ng v chy vi vn tc nhanh. Vi iu kin k thut v kinh nghim v phng ch rt hn ch nc ta hin nay, kh lng c th thc hin

  • c tt mt cng vic s v phc tp nh vy trong khi vic pht trin cc phn mm ng dng Unicode trn th gii ang cn nm trong thi k phi thai. Vi li, s ch Nm c cp m Unicode d c khong 10,000 ch, song con s ny vn ch c xem nh l rt khim tn. Cn rt nhiu ch Nm cha c m Unicode nm nhan nhn trong cc tc phm Nm. Ch hai vic nu trn, cng lm cho vic pht trin cc phn mm cn bn cho vic ng dng ch Nm nh Bn g, T in, database, webpage v.v u b tr ngi ln. dng c ch Nm vi b m Unicode, mt im k thut cn x l l cc ch Nm cha c m Unicode phi a vo vng m dng ring (Private Use Area) ca Unicode. Song m trong vng ny thng c cc phn mm khc s dng, nn hay xy ra phng ca cc phn mm xung t ln nhau. Mun trnh tr ngi ny, th c th s dng cc vng Private khc, song li b tr ngi k thut surrogate nh ni trn. Vic thit k mt b phng thc dng cho ch Nm l iu khng phi d dng. Ch Nm v b xp nm ri rc trong cc vng CJK Unified Ideographs (4E00-9FAF), CJK Unified Ideographs Extension A (3400-4DBF), CJK Unified Ideographs Extension B (20000-2A6DF) v sau ny s l vng CJK Unified Ideographs Extension C, do nu mt b phng ch Nm ch lc ra cc ch Nm c trong cc vng ch m loi b cc ch Hn trong vng th s gii hn tnh cht thc tin ca phng khi dng chung vi ch Hn trong cng mt ng dng (aplication). Nhng nu bao gm ton b ch Hn c trong cc vng th b phng tr nn to ln, knh cng v nht l i hi thi gian v cng sc rt l to ln thit k nn b phng ! May rng, chng ta c mt bc tin t ph l vic ra i ca hai b truetype phng Hn Nm A v Hn Nm B hin th ch Nm v ch Hn m Unicode do Chn Nguyn Quc Bo (Germany), T Minh Tm (USA), Ni sinh Thin Vin Vin Chiu (Vit Nam) v cc tc gi ca b T in Ch Nm Trch Dn online cng nhau thc hin. B phng Han Nom A v B x l ch Nm rt tin tin v mt k thut cho n s lng ch v cng ln nm trong ton th cc vng CJK Unified Ideographs, m sau ny c th dng c lun cho Extension C! Ngay c China v Taiwan, cho ti by gi vn cha c 1 b phng nh b phng Han Nom A/B chng ta ang c. Japan, t chc Mojikyo Institute vn cha c b phng t ti phm vi ln trong Unicode nh b phng Han Nom A v B ny. Vi iu kin k thut v kinh nghim v ch cn rt hn ch nc ta hin nay, vic thc hin c cng vic s v phc tp nh vy l iu khng phi d dng v ng khch l. Vi 2 b phng ln ny, chng ta c th x l v hin th c tt c cc ch Hn v ch Nm c m trong cc vng ch ca Unicode. Ngoi ra, b phng ny cng m ha tm thi khong hn 2000 ch Nm cha c m Unicode v vo 2 vng Private Use bt u t U+E000 v U+F0000. Tm li, s pht trin phn mm ng dng ch Nm c kh thi v thc dng hay khng u hon ton ph thuc vo loi phng v m ch chn dng cho phn mm , nhng yu t cn ti nh sau: 1) M Unicode hay m quc gia hay l ring ca mt phn mm, ca t chc no , 2) Nu l Unicode, c nn pht trin b phng trong bao gm cc loi ch vung khc cng nm trong cng mt vng ch, 3) Nn c bao nhiu ch cha c m Unicode nhng c m ha tm thi? v 4) Chn vng Private Use u?, nu t ngoi vng U+E000 th phi c phng cch no hin th chng v nu dng Iexplorer th k thut Surrogate by gi gii hn hin th ch trong vng B m thi. Computer platforms quan trng hay khng? Ni ti T in Online th ngi ta ngh ngay ti mt trang web trong ngi dng c th tra cu ngay ch . Thng thng, cc webpages u c vit bng dng Hypertext Markup Language (html), nhng html c nhng bt li nh l phi online, vo web mi xem c, v ngi khc c th ly v d dng, ly nguyn c website v lm data ring cho h! Nu ti cc phn mm b tc khc ngoi vic thit k bng dng Hypertext Markup Language, th mt t in online cng phi nn ch trng n cc yu t khc, nh d di chuyn (migration) vo cc

  • platforms khc, vn tc nhanh chm v.v. Mt trong nhng khuynh hng hin ti cho application programs l s dng ngn ng Java thit k phn mm. Ngn ng Java vi nhng u im nh l ch cn coding 1 ln, khng cn thay i g c nhng c th chy online ln offline; v c th chuyn (migration) vo tt c cc platforms khc nh Windows, Macintosh, Linux, Pocket PC, Palm, Cellular phone v.v. Chng ta khng th chuyn i platform linh ng nh th m khng dng ti Java. Ngoi Java ra, cha c programming language no c th lm c chuyn ny so vi cc language khc. Ch c java applet mi c th embed c trong mt webpage. Tin li l bt c ni u, nh ti th vin, i du lch vng quanh th gii, u cng c th tra cu t in c m khng cn phi c web bowsers. Khuyt im ca Java l chy hi chm, nhng c th b li bng cch nng cp my tnh ln hng vn tc cao hn, hin i hn, v d sao th x l ch Nm vn i hi my mc hin i v tin tin mi lm c. Mc d s dng Java programming trong mt t in online lm cho vic hin th ch c phn chm, song trong bi cnh hin ti, t in ch Nm online chy trn nhiu platforms l iu cn thit cho nhiu ngi dng khc nhau. Mt phng php tra ch mi v c bit

    Hin nay, nhng t in c ch thch ch Nm c b i Nam Quc m T V ca Hunh Tnh Ca, Dictionarium Anamitico Latinum ca AJ. L. Taberd, Dictionnaire Annamite Francais ca J. F. M. Gnibrel, Dicitionnaire Annamitte Francaise ca Bouet etc. Nhng sch xa ny ch yu l nhng t in tra ch t m Quc Ng, nn cc mc ch u c sp xp theo mu t Latinh. Trc , trong khong th k XVI n XIX cn c cc sch c xem nh l t in Hn Vit ghi ting Vit bng ch Nm, nh Ch Nam Ngc m Gii Ngha thi L s, Nht Dng Thng m ca Phan nh H thi Nguyn s, T Hc Ton Yu ca Ng Thi Nhm thi cha Nguyn, T c Thnh Ch T Hc Gii Ngha ca vua T c v.v u cn c vo nhng cch sp xp no lin h n ch Hn, nn mun tra tm 1 ch Nm u rt kh khn. Vo thi cui th k XX sau ny, cn c i T in Ch Nm ca V Vn Knh, Bng Phin m Nm Vit ca Trng nh Tn, T in Ch Nm ca Takeuchi Yonosuke, T in T Vng Lch S Ch Nm ca Paul Schneider v.v mi cun u c nhng cch sp xp ch theo nhng h thng khc nhau, nhng tu chung mt vn nan gii ca mt cun t in ch Nm trn giy l lm th no sp xp ch cho c h thng, d dng, d tra cu, nhanh chng v c hiu qu. T in ch Nm nn c sp xp nh th no? Cch xp t c ngi ta ngh ngay n l cch xp t theo b th nh cc t in ch Hn. Nhng ch Nm li rt phc tp, v tr ca phn c xem nh l b th li khng nht nh, kh nhn thy b th ch yu u, ly phn vit bn tri, bn phi, hay l bn trn, bn di, hay l ly phn biu lm b th u khng c quy nh nht thit. Mt iu bt thnh vn quyt nh u l b th cho ch Nm l ly phn biu hay l b th ca phn biu lm b th cho ton ch. Tuy nhin, iu ny tr thnh khng ng sau khi ch Nm c a vo Unicode. Cc chuyn gia Tu, c th khng hiu hoc v thin kin cho khng cho m cho nhng ch Nm c hnh dng trng hp vi ch Hn. C nhng ch Nm trng dng ch Hn nh ch n , ch ny l 1 ch Nm c hnh dng vi ch Hn , Unicode khng cp thm m cho ch, chng ta phi dng chung vi ch Hn, v v vy ch Nm ny c b th l thay v nh chng ta ngh nh th. C nhng ch Nm ngha ging nhau, phn biu ging nhau nhng v tr ca n i vi phn biu m khc nhau, Unicode li xp vo cc b th khc nhau, thay v xp vo bt th ca phn biu ca cc ch , v d: Hai ch ui , ny c xp vo hai b th khc nhau . Ch Nm mn m Hn Vit ca ch Hn ghi m, nn b th ca ch Hn khng biu th c ngha ca nhiu m Nm ca cng mt ch Nm trng dng ch Hn . Li cn c nhng ch Nm c bit nh (lm), (xng),(y),(tiu),(c),(v),(xong) th nn xp vo b

  • th no cho thch hp. Nh th, vn nhn nh v sp xp b th trong ch Nm qu l mt l mt iu cha gii quyt c. Do vy, sp xp ch trong t in ch Nm theo b th khng phi l iu n tha nht. Cch sp xp khc l xp t theo s nt. Thc ra th cch xp t theo s nt cng phi kt hp vi cch xp b th. Cn cch xp theo tng s nt cng khng km phn phc tp i vi ngi Vit nhng theo Trn Vn Gip th phng php ny l gin d v tin li nht so vi cc cch xp t khc. Cn nhng ch Nm trng vi ch Hn th cn c th mn cch sp xp theo cch c PinYin, ZhuYin hoc cc phng php xp t c th ca China hay l ca Taiwan v.v. Nhng cch xp t khc v mi hn nh l xp theo s m hoc theo phng php m bn gc (t gic hiu m), cch ny cng khoa hc v thc t, ch cn trng ch v tm hnh ngay, nhng phi nh ti bn ch s th cng phin toi lm m khng phi ngi Vit no cng rnh cch nhn mt ch Nm. Hin ti, cch sp xp ch trong mt t in ch Nm trn giy thng c cc phng cch sp xp thng dng nh trong cc t in ch Hn: bng m Nm, m PinYin ca China, Zhuyin ca Taiwan, bng phng php m bn gc, theo s m Unicode, bng s nt kt hp vi b th, bng tng s nt v.v. Mi phng cch u c ch ng ring ca n. Nh vy, i vi phng cch sp xp tra tm ch cho mt t in ch Nm online th sao? Ngoi cc phng cch tra tm nu trn, t in ch Nm online cn phi cho php tm kim (searching) ch Nm "theo mt ch vit", ngha l, cn c trn cc thnh phn cu to nn ch Nm . y phi l mt chc nng c bit v phi lm sao cho chc nng tm kim ny ca t in lm vic mt cch thng minh, d hiu, d dng, nhanh chng v c hiu qu. Ngi dng t in, khi gp mt ch Nm c th khng bit m c l g, khng bit thuc b th no, khng r s m Unicode, nhng chc chn l bit c thnh phn cu to nn ch , ch cn tm kim (searching) ch ny bng cch tra theo tp hp ca nhiu thnh phn ca ch v s tm ra nhng ch Nm c nhng thnh phn c trong t in. Li dng c tnh tm kim (search) nhanh chng ca computer, th vic sp xp ch Nm trong database sao cho vic tra tm ch c tr nn d dng v thng minh hn. Database ca t in cho Nm online cn phi c mt vi Trng (fields) cha ng nhng thng tin (information) cn thit v tp hp thnh phn cu to nn mt ch Nm, sau ny li dng mt searching algorithm no tra tm (searching) ch Nm c thnh phn cu to ghi trong fields ca ch trong c s d liu (database). i vi mt t in ch Nm online th y l mt phng cch tm kim hiu qu nht cho ti nay, nht l cho ngi dng khi tra tm mt ch Nm l khng r b th, khng bit m c, khng bit m ch. Nu thnh phn ca ch Nm ny bao gm b th ca ch, th c th ni rng y l phng php tng qut ha phng cch tm bng b th nhng nhanh chng v c hiu qu hn nhiu. V d: Tra tm ch , i vi t in bng giy, c th v khng bit m Nm, t in khng c phng php m bn gc, ngi tra t in c th phi tn th gi tra loi b th, nhng tra b th no, bng b , bng b , hay bng b , va mt th gi va mt nhc! Vi th d ch ny, ngi dng t in Nm online hoc l searching bng 2 keys: v c th tm ra cc ch sau: ,,,,, hoc l searching bng 2 keys: v c th tm ra cc ch ,,,,,, hoc l searching bng 2 keys: v c th tm ra cc ch ,,,,,, hoc l searching c bng 3 keys: , , th c th tm ra ngay cc ch ,. Tin li ca cch tra theo tp hp thnh phn ca ch ny l nhanh chng v hiu qu v ngi dng c th da trn mt ch tra tm khi phi cn bit bt k d kin no khc nh b th, m Unicode, m c v.v.

    Tm li, ngoi cc cch sp xp ch Nm theo cc phng cch c dng trong cc T in ch Hn v Nm trn giy, mt t in ch Nm online cn phi c thm nhng phng cch nhn din v tra tm ch bng cch cho php ngi dng tm kim (searching) bng thnh

  • phn to nn ch . Cho n nay, y l mt phng cch c xem nh l c hiu qu nht tra tm ch trong mt t in ch Nm online . Kt cu ch Nm Nhng php cu to ch ca ch Nm vt qua khi php Lc th ca ch Hn ni ln s c gng to ch c th khc vi phng php dng cho ch Hn, mang mt tinh thn dn tc ring ch khng nht nht u phi rp theo khun kh ch Hn. Trc thp nin 70, nhng nh nghin cu ni ting nh Dng Qung Hm, Hong Xun Hn, Vng Lc, Chen Ching Ho, Vn Hu, Trn Vn Gip, o Duy Anh, Bu Cm, V Vn Knh v.v u cho rng ch Nm s dng 3 phng php to ch trong 6 phng php (Lc Th) to ch ca ch Hn, l ba php Gi t, Hnh thanh v Hi . Chen Chin Ho nm 1949 cng phn loi cu trc ch Nm theo ba loi l Gi t, Hnh thanh v Hi . Trong Chen cho rng c ba loi Gi t, hai loi Hnh thanh v a ra nm ch Hi . Trc , Dng Qung Hm nm 1942 cho rng c nm loi Gi t, hai loi Hnh thanh v a ra mt ch Hi . Sau ny, o Duy Anh nm 1975 gii thch chi tic hn nm loi Gi t, hai loi Hnh thanh v ch a ra su ch Hi . Cc cch phn chia trn c th ni l bao gm hu ht cc loi ch Nm. Tuy nhin trn thc t, cn c nhiu loi ch Nm c cch cu trc c bit th c ba php to ch ny i hi linh ng hn khi gii thch s cu to ca ch. V d nh loi ch Nm c bit mang m Vit c, hoc ch b vit gin nt hoc dng mt ch Nm khc lm thnh t biu m th ba phng php ny hi gp lng tng trong s gii thch s cu to ch. Sau thp nin 70, cc nh nghin cu Nm nhn thy nhng m hnh cu to ch Nm thi trc nu vn cha c hon ho gii thch nhiu trng hp c bit, nn i su hn nh l tm hiu s lin quan gia m Nm ca ton ch vi m Hn Vit ca thnh t biu th m c, hay l da trn nhng phng php ng m hc lch s, hoc l chia ch Nm ra thnh loi c m c v khng c m c. Trc ht, Nguyn Ti Cn nm 1976, a ra m hnh cu trc chia ch Nm ra by loi da trn cn bn ch mn Hn gm ch Hn Vit, gc Hn Vit, vit tt, gi t v loi ch Sng to trong gm ch hnh thanh, hi , vit tt, du nhy v m Vit c. Cng trnh nghin cu v m hnh ca Nguyn Ti Cn dn n cc cng trnh nghin cu khc sau ny vi nhng li nhn khc hn v chi tit hn. L vn Qun nm 1981 a ra m hnh gii thch mi trong ch Nm c chia ra thnh 14 loi, trong 6 loi thuc nhm ch da trn ch Hn hin c, cn 8 loi sau thuc nhm ch Sng to gm ch hnh thanh, hi , vit tt, du nhy v m Vit c. Sau ny Nguyn Ngc San da vo m c chia ch Nm ra 14 loi trong ch c 4 loi l trc tip mn m Hn Vit. Vi ba m hnh gii thch cu to ch Nm ca ba hc gi ni trn cho thy s a dng ca cu trc ch Nm khng nhng ch n thun gm ba loi Gi t, Hnh thanh v Hi cn bn thng thy m cn nhiu loi ch c cu trc c bit, ni ln s a dng v phong ph ca m Nm hn m ting Hn. D cc nh nghin cu ch Nm a ra nhiu li phn loi ch Nm kh hon chnh, nhng cho n by gi vn cha c c mt m hnh kt cu (Classification schemes) tht hon ho gii thch s kt cu ca ch Nm, nht l vn kt cu ca nhng ch Nm c tnh cht c bit nh 1) Ch Nm m Vit c s dng nhng k t ch Hn nh , , , , , , , ,,,, ,, v.v ghi m tit ph. Nhng k t ny c lc i cng trong mt ch vung khc nh ch V hoc l dng 2 ch Hn ring bit ghi mt m Nm nh trong Pht Thuyt i Bo Ph Mu n Trng Kinh, v d, dng hai ch i cng nhau ghi mt m V, hoc dng hai chi cng nhau ghi mt m Li, hoc dng hai ch i cng nhau ghi mt m . 2) Ch gi t Nm, l loi ch Nm c cu to bi mt ch Nm khc cng m hay tng t, song ngha ca n khc hn vi ngha vi ch Nm mn lm ch biu m, 3) Ch Nm t ch Hn vit gin nt, th d in hnh nht l ch Lm m phn nhiu cc nh Nm hc cho rng n l cch vit

  • gin nt ch Vi , ch Pht l gin nt ch Phn , 4) Ch Nm do ch hnh thanh vit gin nt, c th v ch hnh thanh ny c qu nhiu nt, kh nh, kh vit nn trong qu trnh hnh thnh ch vit mt phn ca ch b c gin lc i. V d: i , l ch hnh thanh, tp hp ca , nhng ch b lc b; ch Theo l tp hp ca b Tc vi ch Thiu gin nt b Ha; ch Ni l tp hp b Thy vi ch gin nt b Thc , 5) Ch Nm do nh hng ca ch cng i i, loi ny thng thng mang b th ging b th ca ch thng hay i i vi ch. V d: Ch Gn thng hay i i vi Xa , (xa gn) nn Gn c b th l Bi, ging nh b th ca Xa, trong khi b th Bi trong ch Gn khng gip thm ngha g cho Gn c. Do vy, l gii ti sao ch Nm c vit nh th ny nh th kia, iu ny cng gp phn khng nh trong vic gip ngi s dng nh mt ch, nhn nh ra m c, t in ch Nm cn phi c phng cch gii thch s kt cu ca ch Nm xem ch Nm ny trong trng hp ny c nh th ny, trng hp kia gii thch ra lm sao cho thch hp vi vn mch. Dn chng v sao lc bin son mt b T in Ch Nm, cc nh lm T in thng thng mong mun T in ca mnh c mt s lng ch Nm s, nn cng vic trc tin l gp nht, trch dn tt c cc ch Nm trong cc t in xut bn. Song lm nh th cng cha , v cc tc phm trn, khng c bin son trn c s lt sot tt c cc tc phm ch Nm ng thi, cho nn c nhiu ch Nm thi trc khng c ghi vo. V d ch Gi ,,,,,,, th cc t in ch Nm nu trn cng ch c mt vi ch. Nhiu lc li c nhng ch Nm ch thy ghi trong t in m khng thy dng trong cc tc phm Nm. Nu mt t in Nm ch ch trng ti vic lm sao thu nht c tht nhiu ch cho s lng ch c s m khng ch tm ti vn ch Nm ny c tht hay khng, xut hin trong tc phm Nm c no, cch dng trong vn cnh no, th gi tr thc dng ca cun t in ny b gim i rt nhiu. Do vic su tm c liu ch Nm cn hn ch v cha c h thng, cha ni n vic bao qut ton b, th nn theo Trn Vn Gip th c nhng trng hp ch Nm c ngh ra, suy ra hoc sng ch ra hoc khng thy trong vn bn no c. Do vy, t in ch Nm phi trung thnh vi vn bn, phi ghi li r rng xut x, phi c cch gip l gii tnh thch hp trong vn mch v gii thch kt cu (classification schemes) ca ch Nm trong nhng trng hp dng trong mt tc phm no . Nh tnh cht dung np d kin rng ln ca computer, mt t in ch Nm online c th thu lm v c kt cng nhiu cng tt cch dng ca mt ch Nm qua nhiu th d ng dng ch ny trong nhiu trng hp trong cc vn bn khc nhau. Mun cho t in Nm thu thp c mt cch y cc ch Nm thng dng v c bit c gi tr th cn phi lt sot tt c cc tc phm ch Nm c trong dn gian. D nhin, chng ta khng th lt sot c ht tt c cc tc phm Nm, nhng t nht l phi lt sot nhng tc phm ni ting, quan trng v tiu biu ca nhiu th loi, thuc nhiu thi k trong lch s; cng la chn nhng d bn tiu biu ca mt tc phm; cng nh a vo t in nhng vn bn Nm c m hng a phng khc nhau ca t nc. Mi ch Nm trong t in, vi mt hay nhiu m c, vi mt hay nhiu ngha, u phi c dn chng bng cc th d c th trch dn t cc ti liu, vn bn Nm khc nhau v c ghi ch xut x chnh xc, nu c, gm tn tc phm, thi im xut bn ca tc phm, tn nh xut bn, trang/t, cu v.v. Bng phng cch ny, t in ch Nm s c c nhng ch/t thun Vit c, s tm ra c rt nhiu m c cho mt ch Nm, v d, ch chng ta s tm ra c rt nhiu m c trong nhiu vn cnh khc nhau: lng, lng, lng, lng, lng, lng, lng, lng, rng, sang, sng; ch c cc

  • m sau: li, ly, ly, ly, l, l, l, ry, r, r, tr, tr; ch c cc m: lng, lng, lng, lng, lng, sng; ch c cc m Nm: dng, nh, dnh, ng, rnh, nh. Nhng bc trin khai v tin hnh cng vic tin hnh cng vic thit k cho mt d n t in Nm online, tt c cc yu t lin quan n vic qun l d n nh nhn lc, phn cng, quy trnh lm vic, cng c cn thit, k thut, tc phm tham kho, nguyn tc lm vic, c tnh cn c, gii hn bit, quy tc thng tin ni b, quy trnh lm vic (development process), quy nh k thut (functional specifications), phn Giao din cho ngi dng (Guide User Interface-GUI) v C s d liu (database) v.v u phi c vit vo nhng h s qun l d n (project plans) r rng, phi c ban ban tp cu xt, sa cha nhanh chng v ban hnh cho mi ngi hiu r tc thi, nht l phi bn lun thn trng nhng g cn phi c trong Quy nh k thut, C s d liu v phn Giao din cho ngi dng. V phn Giao din v C s d liu th t nht nhng yu t sau y phi c tha mn: 1) Cc phng php tra tm ch (bng b th, bng nt, bng m c, bng ch Unicode, bng m Unicode, bng tp hp thnh phn ch), 2) Cc d kin cn c cho mt ch Nm nh thng tin v b th, s m Unicode, m ni b, s nt, tng s nt, kt cu ca ch, cc m c v phn loi cc v d trch dn t cc vn bn, cng nh thng tin v cc vn bn nh tc gi, tn sch, nh xut bn, nm xut bn v.v. Ban bin tp v thit k phi nh th no ? Ban Bin Tp phi bao gm cc chuyn gia ngn ng Hn-Vit-Nm, cc nh nghin cu ng m hc lch s, cc chuyn vin k thut in ton v Internet, cc chuyn gia ch to phng ch v c nhng ngi mi tm hc ch Nm. cng vic tin hnh d dng v c hiu qu cn phi c nhng phng cch iu hnh thng tin ni b lm sao cho thng tin ni b c nhanh chng, hiu qu v thng sut. S pht trin ca k thut in ton cng kh nng truyn thng trn Internet cho php kt hp mt gm nhiu ngi c ng ti nhiu ni trn th gii, th nn, cn phi cn n nhng cng c thng dng iu hnh thng tin ni b nh Blog, Listservers v.v Quy trnh lm vic.

    Cng vic thc hin t in online nn c tin hnh v lp i lp li nhiu ln nh sau: 1. Thu thp cc ti liu, vn bn ch Nm t cc th vin trn th gii hoc t nhng t

    liu ca cc nh nghin cu. Chn lc mt s vn bn nng ct lp thnh mt "Th mc" dng lm c s lt sot tng ch Nm mt. a ch vo t in vi tinh thn tn trng ti a nguyn tc, cc th d c trch dn phi chn lc k cng v chnh xc.

    2. Thit lp Giao din. C th thit k mt Giao din ring cho Ban bin tp c nhng chc nng cn bn nh thng k, phn tch, tm kim, thay i, tra tm v.v.

    3. Thit lp v cp nht C s d liu Mi tng quan gia phn Giao din v C s d liu u phi c kim chng v phi

    ph hp vi nhng yu t yu cu trong cc h s qun l d n. 4. Ch to phng ch Nm theo ng tiu chun m quc t Unicode Standard v Microsoft

    Specifications for True Type Fonts cho mi ch Nm ca kt qu cng trnh lt sot ch t cc vn liu su tm c.

    5. Thit k v cp nht mt bn g cho cng vic g nhp ch c d dng v nhanh chng hn.

    6. G nhp ch vo database ca t in. Mi ch (mc t) phi c kim chng (testing) nhanh chng bo m tnh chnh xc.

  • Ngoi quy trnh thit k mt cun t in online trn y, cn phi c ch trng n s tng quan gia quy trnh thit k b phng ang s dng v mt cng c bn g ph gip thm cho cng vic g nhp ch vo C s d liu ca t in Nm online. Khng nh cc ngn ng chu khc nh Hn, Hn v Nht, cc ngn ng ny u c cc b m quc gia y , nn vic thit k phn mm cho my tnh t b tr ngi v vn phng ch. Ch Nm th tri li. Mc d c c khong 10,000 ch c Unicode m ha , song con s ny vn ch c xem nh l nh v c rt nhiu ch Nm cha c m Unicode cn nm nhan nhn trong cc vn bn Nm. Nu khng c mt b phng y th vic ng dng ch Nm trn mng, trn my tnh vn cn nhiu hn ch. V nu nh khng c mt cng c g nhp t in mt cch d dng v hiu qu th cng vic thit k t in online cng s gp nhiu tr ngi. Do vy, phi c nhng lin h mt thit v ng b gia ba quy trnh lm vic ny (C s d liu, Bn g v Phng ch). Cu trc C s d liu

    Xc nh cu trc c s d liu ngay t lc u l u tin hng u i vi cc d n lm t in ni chung v t in ch Nm ni ring. Vic phn chia cc d kin thnh tng trng (field) vi nhng quy c nht nh gip gim thiu cc sai st trong khi nhp d kin v cho php ngi dng tm kim (search), sp xp (sort) d kin theo mun mt cch nhanh chng. Ngoi ra, kho d liu cn l nn tng ca cc cng c phn tch, thng k, nghin cu, v.v. Ta cn c th lin kt kho d liu hin c vi cc kho d liu khc gim thiu vic nhp d kin. Hnh v di y l mt ngh cho mt cu trc kho d liu cn bn cho T in Ch Nm Trch Dn-mt da n T in Ch Nm online.

    Theo cu trc ny, ngi dng c th tm kim (search) v sp xp cc d kin theo bt k trng (field) no c mt trong cc bng (table) ny (ch, m Unicode, tp hp ch, m Hn Vit, m Nm, ph, m ph, vn liu, v.v.). Cc bng (table) c th c thm bt nhng yu t cn thit ph hp vi cc chc nng thng k hay tra cu sau ny ca t in. V d: Nu d nh T in ny c th cho php ngi dng tra tm ch Nm theo phng php 4 gc

  • (Four corner method) th trong phn cu trc TdNm chng ta c th thm vo cc d kin cn thit ca phng php 4 gc. Cc chc nng thng k phi c thit k theo chiu hng gip cho cc nh nghin cu c thm phng tin tra cu, cung cp cc d kin quan trng cho cng vic nghin cu ch Nm, v d: tm ng cnh ca ch Nm, hoc lm thng k tng hp cc dng ch Nm c cng mt m c, hoc so snh m ngha, phn tch, suy lun hu xc nhn hoc a ra nhng kin gii v cch cu to, din bin ca s s dng ch Nm qua cc thi k lch s, v.v. C s d liu ca t in ch Nm online cha ng ch Nm da trn cn bn lt sot cc vn bn, theo mt kha cnh no , chnh l mt cun t in ch Vit c. Giao din cho ngi dng (end-users interface)

    Ngoi vic cung cp cc chc nng c bn ca mt t in nh tm kim theo m, m Unicode, ch, tp hp ch, v.v. ta c th m rng Giao din cho ngi dng bng cch cung cp cc cng c (tools) nh thng k, phn tch, phin m, bn g, kho vn bn, v.v. m c s d liu cho php. Giao din cn phi gn gng, d dng, v tin li. Giao din cho ngi dng (chc nng c bn) ca T in Ch Nm Trch Dn c thit k nh sau:

    C th thit k thm mt Giao din dnh ring cho Ban Bin Tp cun T in c nhng chc nng cn bn nh thng k, phn tch, tm kim, thng tin ni b v.v. Mi tng quan gia phn Giao din v C s d liu u phi c kim chng v phi ph hp vi nhng yu t yu cu trong cc h s qun l d n. Vn qun l cht lng Chnh xc l iu i hi hng u m mt T in cn phi c. Gi tr ca mt cun T in s b gim i rt nhiu, nu ngi dng pht hin c nhiu sai lc, lm li hin din trong cun t in . S hin din ca cc sai lm ny s v cng tai hi, v s c nhiu ngi dng t in s b sai lm theo. Do vy, thit k t in Nm online cng ging nh thit k cc loi phn mm khc i hi s quan tm v ch trng c bit n cc quy trnh thit k, pht trin v kim chng phn mm, nht l quy trnh kim chng cc d liu a vo t in, khng c

  • st 1 li lm no tn ti. p dng cc phng php qun l cht lng ng dng hin hnh trong k thut thit k v kim chng phn mm t khi d n bt u tin hnh cho ti khi d n chm dt l iu cn thit v quan trng. Cht lng ca t in online ph thuc rt nhiu vi cht lng ca cc loi h s qun l d n (project plans) nh Quy trnh lm vic, Quy nh k thut (functional specifications), Giao din cho ngi dng (Guide User Interface-GUI) v C s d liu (database) v.v. Cc h s qun l d n ny phi c ban Bin tp bn lun v kim chng nhiu ln qua sut thi k tin hnh d n. Mt i ng lm cng vic kim chng v pht hin cc li lm lm vic tc khc ngay sau khi cc qu trnh thit k tin hnh xong. Nh vy, quy trnh thit k-kim chng (design-test cycles) s xy ra lin tc cho n lc d n chm dt. Cc cng c cn thit: d n c th tin hnh nhanh chng v c hiu qu cho vic thit k mt t in online, nn thit k ring cho d n nhng cng c cn thit, nhng ng thi nn nghin cu v tm kim nhng cng c (tools) sn c trn th trng. Nhng cng c tra cu sau y nn ng lit k vo danh sch cn dng nh: 1. Unicode Character Map Utility for Windows: http://www.babelstone.co.uk/Software/BabelMap.html2. UniHan database: http://www.unicode.org/charts/unihan.html3. A Nm Lookup Tool based on Unicode http://nomfoundation.org/unicode/lookups.php4. Nhng phn mm thng dng g nhp ch Hn 5. Bn g ch Hn/Nm (HanNomIME) ca Alexandre L v Trn Uyn Thi http://www.viethoc.org/hannom/bango_intro.php Kt Lun Vic bo tn v pht huy nhng gi tr vn ha truyn thng tim tng trong di sn ch Nm ngy cng c coi trng v cp bch. Cng ngh thng tin hin i c xem nh l mt phng cch hu hiu phc hi v bo tn di sn ny. Vai tr ca mt cun t in ch Nm online s dng trn mng Internet cng theo phi c vi mt din mo mi, phong thi mi v k thut mi p ng nhu cu . Mt t in ch Nm online phi c mt phng php tra tm nhanh chng v hiu qu cho php ngi dng c th tm kim mt ch Nm bng thnh phn cu to nn ch Nm m khng cn bit thm cc d kin no khc; phi c nhng phng cch gip ngi dng hiu c kt cu to nn ch khi dng trong nhiu vn cnh khc nhau; phi c bin son da trn c s lt sot, sao lc v trch dn chnh xc t cc vn bn Nm xa v ng thi c s d liu (database) ca t in phi thit k nh th no c th cung cp cc d kin dng phn tch, thng k cho cc nh nghin cu Nm; cng nh d chuyn i (migration) vo cc platform khc. Ti liu tham kho 1. Thiu Chu (1942). Hn-Vit T-in . Hanoi: uc Tu 2. Chen Ching-ho (1949). . Taipei, Taiwan: . 3. o Duy Anh (1975). Ch Nm: Ngun-gc, cu-to, din-bin. H Ni: Khoa Hc X Hi. 4. , Socit Asiatique, 1822 5. . A new practical Chinese-English dictionary. : . 6. (1988). , . , Tokyo, Japan. 7. Nguyn nh Ha (1992). Graphemic Borrowings from Chinese - The Case of Chu Nom - Vietnams Demotic Script. Taipei, Taiwan: Bulletin of the Institute of History and Philosophy, Volume 61, part 2.

    http://www.babelstone.co.uk/Software/BabelMap.htmlhttp://www.unicode.org/charts/unihan.htmlhttp://www.unicode.org/charts/unihan.htmlhttp://nomfoundation.org/unicode/lookups.phphttp://www.viethoc.org/hannom/bango_intro.php

  • 8. Nguyn Khc Kham (1974). Ch Nm or the Former Vietnamese Script and Its Past Contributions to Vietnamese Literature. Tokyo University, Japan: Area and Culture Series, volumn 24. 9. T in Ch Nm Trch Dn http://www.viethoc.org/hannom/tdnom_intro.php10. ng Th Kit, L Vn ng, Nguyn Hu Vinh, T in Hn Vit Thiu Chu online http://www.viethoc.org/hannom/tdtc_intro.php11. Trn Uyn Thi v Alexandre L, Bn g Hn Nm (HanNomIME) http://www.viethoc.org/hannom/bango_intro.php12. Trn Vn Gip (2002), Lc Kho Vn Ch Nm, L Vn ng thc hin vn bn,

    Ngy Nay Publishing, USA. 13. Nguyn Quang Hng (2002), Ch Nm trn ng hi nhp vi khu vc v th gii, Vin

    Nghin Cu Hn Nm, H Ni. 14. Nguyn T Nh (1998), Cc Phng Thc Biu m trong cu trc Ch Nm Vit, Nh

    Xut Bn Khoa Hc X Hi, H Ni. 15. Nguyn Hu Vinh (2005), Ch Nm v Tinh thn dn tc, Vin Vit Hc, California, USA

    http://www.viethoc.org/hannom/tdnom_intro.phphttp://www.viethoc.org/hannom/tdtc_intro.phphttp://www.viethoc.org/hannom/bango_intro.phphttp://www.unicode.org/