{"id":14038,"date":"2023-03-21T02:50:16","date_gmt":"2023-03-21T01:50:16","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/qalam-a-convention-for-morphological-arabic-latic-arabic-transliteration\/"},"modified":"2023-03-21T08:45:57","modified_gmt":"2023-03-21T07:45:57","slug":"qalam-a-convention-for-morphological-arabic-latic-arabic-transliteration","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/qalam-a-convention-for-morphological-arabic-latic-arabic-transliteration\/","title":{"rendered":"QALAM: A Convention For Morphological Arabic-Latic-Arabic Transliteration"},"content":{"rendered":"<p>==============================================================================<br \/>\nQalam: A Convention for Morphological<br \/>\nArabic-Latin-Arabic Transliteration<br \/>\n&#8212;<br \/>\nAbdelsalam Heddaya (heddaya@cs.bu.edu)<\/p>\n<p>with contributions from<\/p>\n<p>Walid Hamdy (hamdy@lids.mit.edu)<br \/>\nM. Hashem Sherif (mhs@homxa.att.com)<br \/>\n&#8212;<br \/>\nCreated: 1985.11<br \/>\nModified: 1986-1989 often<br \/>\nModified: 1990.01<br \/>\nModified: 1990.12.21<br \/>\nModified: 1990.12.31; accepted LAiLA upper case convention,<br \/>\nadded punctuation, ,<br \/>\nand<br \/>\nModified: 1991.01.23; added a couple of sentences.<br \/>\nModified: 1991.01.31; decided for<br \/>\nModified: 1991.08.22; cleaned up acknowledgements<br \/>\nModified: 1992.01.13; changed back to<br \/>\n&#8212;<br \/>\nDRAFT&#8212;DRAFT&#8212;DRAFT<br \/>\n&#8212;<\/p>\n<p>0. Introduction<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;<\/p>\n<p>Qalam is an Arabic-Latin-Arabic transliteration system between the<br \/>\nArabic script and the Latin script embodied in the ASCII (American<br \/>\nStandard Code for Information Interchange) character set. The goal of<br \/>\nthe Qalam system is to transliterate Arabic script for computer<br \/>\ncommunication by those literate in the language. The main<br \/>\nconsideration in the design of Qalam is suitability for<br \/>\ntransliteration, as well as reverse transliteration, both manually by<br \/>\nhumans and automatically by computers. Qalam also includes several<br \/>\nArabic script letters used to transliterate other languages *into*<br \/>\nArabic script. Finally, Qalam aims to serve all Arabic script<br \/>\nlanguages, such as Farsi, Urdu, and Ottoman.<\/p>\n<p>Qalam is a morphological system in the sense that Arabic script<br \/>\nwords are transliterated based on spelling and diacritics (the marks<br \/>\nthat represent vowels in Arabic), rather than on phonetics. This<br \/>\nmakes it easy to deduce the Arabic script word from its<br \/>\ntransliteration (i.e., to transliterate the word back into Arabic<br \/>\nscript). The pronounciation of words, however, can still be deduced<br \/>\nfrom the transliteration, because the (optional) inclusion of<br \/>\ndiacritic marks makes the transliterated word more pronouncable.<\/p>\n<p>We describe Qalam&#8217;s mapping between Arabic letters and diacritics<br \/>\nto ASCII characters. Each Arabic letter or diacritic maps into (and<br \/>\nback from) one or two ASCII characters. The choice is made in order<br \/>\nto approximate, as much as possible, the Arabic pronounciation, while<br \/>\nmaintaining the one-to-one morphological correspondence needed for<br \/>\nunambiguousness of reverse transliteration into Arabic script.<\/p>\n<p>Arabic script letters that do not correspond to Latin sounds are<br \/>\nrepresented with upper case letters or with two character sequences.<br \/>\nThus, Qalam uses upper-case ASCII characters to denote Arabic letters<br \/>\nthat are different from those denoted by the corresponding lower-case<br \/>\ncharacters. This convention deviates from the common practice of<br \/>\ninserting a dot beneath the letter or a dash above it.<\/p>\n<p>We give below the list of transliterations for Arabic letters and<br \/>\ndiacritics, followed by an example and a description of the rules of<br \/>\ntransliteration.<\/p>\n<p>1. Character Mappings:<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-<br \/>\n1.1. Letters:<br \/>\n&#8212;&#8212;&#8212;&#8212;-<\/p>\n<p>hamza &#8216;<br \/>\n&#8216;alef aa zayn z qaaf q<br \/>\nbaa&#8217; b syn s kaaf k<br \/>\ntaa&#8217; t shyn sh laam l<br \/>\nthaa&#8217; th Saad S mym m<br \/>\njym j Daad D nuwn n<br \/>\nHaa&#8217; H Taa&#8217; T haa&#8217; h<br \/>\nkhaa&#8217; kh Zaa&#8217; Z waaw w<br \/>\ndaal d `ayn ` yaa&#8217; y<br \/>\ndhaal dh ghayn gh<br \/>\nraa&#8217; r faa&#8217; f<\/p>\n<p>taa&#8217; marbuwTah t or h<br \/>\nhaa&#8217; marbuwTah h<\/p>\n<p>&#8216;alef maqSuwrah ae<br \/>\nhamzat alwaSl e<\/p>\n<p>1.2. Transliteration Letters:<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<\/p>\n<p>These are characters used in the Arabic script to represent or<br \/>\ntransliterate letters from other languages such as<br \/>\nEnglish, French, German, etc.<\/p>\n<p>Egyptian sound g (= Arabic script with bar<br \/>\nor dots, pronounced<br \/>\nor )<br \/>\nEnglish &#8220;v&#8221; sound v (= Arabic script with<br \/>\nthree dots)<br \/>\nEnglish &#8220;p&#8221; sound p (= Arabic script with<br \/>\nthree dots)<\/p>\n<p>1.3. Diacritics :<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<\/p>\n<p>fatHah a<br \/>\nkasrah i<br \/>\nDammah u<br \/>\nshaddah double previous letter<br \/>\nmaddah ~aa<br \/>\nsukuwn &#8211;<br \/>\ntanwyn N<\/p>\n<p>1.4. Punctuation:<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;-<\/p>\n<p>question mark ?<br \/>\ndouble quotes &lt;&gt;<br \/>\nsingle quotes<br \/>\n,<\/p>\n<p>2. Examples:<br \/>\n&#8212;&#8212;&#8212;&#8211;<\/p>\n<p>The Qalam transliteration of the first in the ,<br \/>\ncalled goes as follows:<\/p>\n<p>bismi ellaahi elraHmaani elraHym<\/p>\n<p>&#8216;alHamdu lillaahi rabbi el`aalamyn *<br \/>\nalraHmaani elraHym *<br \/>\nmaaliki yawmi eldyn *<br \/>\n&#8216;iyaaka na`budu wa&#8217;iyaaka nasta`yn *<br \/>\n&#8216;ihdinaa elSiraaTa elmustaqym *<br \/>\nSiraaTa alladhyna &#8216;an`amta `alayhim *<br \/>\nghayri elmaghDuwbi `alayhim *<br \/>\nwalaa alDaalyn *<\/p>\n<p>3. Qalam Rules and Conventions:<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-<\/p>\n<p>Transliterate a word by following its Arabic script spelling letter by<br \/>\nletter, as well as any available diacritics (i.e., or<br \/>\n), and substituting the specified Latin script. The only<br \/>\nfrequent exception is the in the definite article (i.e.,<br \/>\n), which is better to write as if it is a ,<br \/>\nor (, or ) as the case may be.<\/p>\n<p>Diacritics are optional unless they are necessary to disambiguate<br \/>\nthe original Arabic script spelling. For example, the verb<br \/>\nmay be written , because the ambiguity does not affect the<br \/>\noriginal Arabic script spelling. On the other hand,<\/p>\n<p>may stand<br \/>\nfor a as in the word or for a followed by a<br \/>\nas in , in which case the between the and<br \/>\nthe is necessary.<\/p>\n<p>The with a transliterates to if the<br \/>\nis above, and to if it is below. That is, it is treated as if it<br \/>\nis simply a with a or .<\/p>\n<p>The definite article (equivalent to &#8220;the&#8221; in English) should<br \/>\nnot be separated from the rest of the word by a hyphen; e.g.<br \/>\n, meaning &#8220;the sun.&#8221; Write the even if it is<br \/>\nsilent&#8211;. This is a case where literal transliteration is<br \/>\ngiven precedence over phonetic transliteration to make reverse<br \/>\ntransliteration easy.<\/p>\n<p>Observe word boundaries in the original Arabic, e.g.<br \/>\nis wrong, but is right.<\/p>\n<p>Arabic has no capitalization, and hence Arabic script<br \/>\ntransliterated by Qalam uses capitals to stand for letters that are<br \/>\ndifferent from those denoted by the corresponding lower case character.<\/p>\n<p>As a convention, we quote transliterated Arabic script text<br \/>\nembedded in another script with Arabic script quotation marks and vice<br \/>\nversa.<\/p>\n<p>4. Technical Discussion:<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;<\/p>\n<p>We would like to argue that Qalam is a superior code for communicating<br \/>\nArabic script text over data networks between heterogeneous computers.<br \/>\nQalam possesses the characteristics required of a good communication<br \/>\ncode: unambiguity, compactness, and simplicity of coding\/decoding.<\/p>\n<p>(((Compatibility, Human readability, Code efficiency. Existing<br \/>\ncodes.)))<\/p>\n<p>Qalam&#8217;s goals include supporting automatic transliteration by<br \/>\ncomputers, as well as manual transliteration for typing in Arabic<br \/>\nscript using Latin script available on ASCII terminals. This permits<br \/>\ncomputers that support the Arabic script directly to hide the<br \/>\ntransliterated text from the user. Thus, a personal computer user,<br \/>\nfor example, should be able to type in Arabic script a message, and<br \/>\nhave the machine transliterate it for submission to<br \/>\nsoc.culture.lebanon. Conversely, when this user receives an Arabic<br \/>\nscript message from soc.culture.lebanon, the computer would transliterate it<br \/>\nback into Arabic script for display. The above scenario should hold<br \/>\nequally true for text that mixes Latin and Arabic scripts.<\/p>\n<p>5. Bugs:<br \/>\n&#8212;&#8212;&#8211;<\/p>\n<p>The , should be distinct from the and<br \/>\nboth must differ from the . Qalam doesn&#8217;t provide for<br \/>\ntransliterating the written as a vertical bar shaped<br \/>\ndiacritic, as in archaic spellings of the . The only way to<br \/>\ndistinguish digraphs such as from the identically<br \/>\ntransliterated followed by , is to force the inclusion of<br \/>\na diacritic vowel between the two letters. Qalam needs a method to do<br \/>\nso without including the vowel, since it&#8217;s not always available in the<br \/>\noriginal Arabic script text.<\/p>\n<p>6. Acknowledgements:<br \/>\n&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<\/p>\n<p>Nayel el-Shafei provided the initial impetus for this work by<br \/>\nresearching the various transliteration systems in use in the US, and<br \/>\npublishing the results on egypt-net in July 1985. C.I. Browne<br \/>\n(cib%a@lanl.gov) provided, in August 1988, useful comments about the<br \/>\nplacement of &#8220;.&#8221; (no longer in use by Qalam) and pointed out that<br \/>\nwas missing in an earlier draft of Qalam. Ali Mili, of the<br \/>\nUniversity of Tunis, commented on an early version of Qalam.<\/p>\n<p>Stavros Macrakis pointed out the absence of a convention for and the old form of that appears as a vertical bar<br \/>\ndiacritic (e.g., in the ). The first problem has been<br \/>\ncorrected, but the second remains. In winter 1990\/91, a debate<br \/>\nsurfaced on USENET about transliterating Arabic text, one particular<br \/>\nproposal, called LAiLA, convinced us to use upper case Latin letters<br \/>\ninstead of special characters.<\/p>\n<p>References:<br \/>\n&#8212;&#8212;&#8212;-<\/p>\n<p>@article{Becker87,<br \/>\nAUTHOR = &#8220;J.D. Becker&#8221;,<br \/>\nTITLE = &#8220;Arabic word processing&#8221;,<br \/>\nJOURNAL = &#8220;Communications of the ACM&#8221;,<br \/>\nVOLUME = &#8220;30&#8221;,<br \/>\nNUMBER = &#8220;7&#8221;,<br \/>\nPAGES = &#8220;600&#8211;611&#8221;,<br \/>\nMONTH = &#8220;July&#8221;,<br \/>\nYEAR = &#8220;1987&#8221;}<\/p>\n<p>@article{Becker84,<br \/>\nAUTHOR = &#8220;J.D. Becker&#8221;,<br \/>\nTITLE = &#8220;Multilingual word processing&#8221;,<br \/>\nJOURNAL = &#8220;Scientific American&#8221;,<br \/>\nVOLUME = &#8220;251&#8221;,<br \/>\nNUMBER = &#8220;1&#8221;,<br \/>\nPAGES = &#8220;&#8221;,<br \/>\nMONTH = &#8220;July&#8221;,<br \/>\nYEAR = &#8220;1984&#8221;}<\/p>\n<p>==============================================================================<\/p>\n<div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-14038 jlk' href='javascript:void(0)' data-task='like' data-post_id='14038' data-nonce='9941108d62' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-14038 lc'>0<\/span><\/a><\/div><\/div> <div class='status-14038 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>============================================================================== Qalam: A Convention for Morphological Arabic-Latin-Arabic Transliteration &#8212; Abdelsalam Heddaya (heddaya@cs.bu.edu) with contributions from Walid Hamdy&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[27],"class_list":["post-14038","post","type-post","status-publish","format-standard","hentry","category-othernonsense","tag-english","wpcat-7-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/14038","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=14038"}],"version-history":[{"count":2,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/14038\/revisions"}],"predecessor-version":[{"id":14243,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/14038\/revisions\/14243"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=14038"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=14038"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=14038"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}