Internationalized domain names: definition, challenges, and status
A linguistic revolution is underway, which spares no written language. While the number of spoken languages is estimated at around 7,000, linguists identify around 200 written languages. For a few of them, the Internet offers an unprecedented stimulus, while most languages entered into a form of resistance. In this context, the domain name system serves as an indicator of the progress in developing multilingualism in the digital world. What is an internationalized or multilingual domain name? What are the issues and the foundations? These are the questions to which we will begin to answer in this article.
Initially, the Internet only knew “A-Z”, “0-9” and the hyphen “-“. This group of characters is called the “American Standard Code for Information Interchange” (ASCII). A domain name with at least one non-ASCII character is an internationalized domain name (IDN) (IETF, RFC 5890, para. 22.214.171.124). As early as the mid-1990s, at the dawn of the globalization of the Internet, Domain Name System (DNS) engineers began to develop the protocols necessary for the internationalization of domain names. The process is complicated since it relies, in part, on the transliteration of a sequence of non-ASCII characters into a sequence of ASCII characters, preceded by the prefix “xn--” (called “ACE prefix”). After a few attempts, a protocol developed by the Internet Engineering Task Force (IETF) was approved by the Internet Engineering Steering Group (IESG) (RFC 5890, ISSN 2070-1721). Firstly, internationalized domain names were made available, and secondly, top-level domains. Today, the technique allows domain names to be registered in the form idn.idn.
The “processes of globalization, which have been facilitated by the rapid development of information and communication technologies, afford unprecedented conditions for enhanced interaction between cultures, they also represent a challenge for cultural diversity, namely in view of risks of imbalances between rich and poor countries” (Preamble of the UNESCO Convention of 20 October 2005 on the protection and promotion of the diversity of cultural expressions: unesco.org). Signed by more than 140 states, this convention entered into force on 18 March 2007 (NB: the United States has not signed it). The UNESCO Universal Declaration of 2 November 2001 on Cultural Diversity (unesco.org) already included declarations and objectives in this direction. Finally, the works resulting from the World Summit for the Information Society systematically reinforce the urgent need to promote cultural diversity to heal the fracture that had set in during the first decade of the Internet. Since domain names are digital addresses, their internationalization was inevitable. In this context, IDNs can help preserve and promote diverse knowledge, sometimes available in the only language understandable by the learner. IDNs can therefore play a central role. “In the information society, language, in addition to being a medium of communication, plays a socio-economic role similar to that of money in industrial society. While money is used to acquire material goods, language is used to obtain knowledge and immaterial goods” (A. El Zaïm, Avant-propos in D. Osborn, Les langues africaines à l’ère du numérique. Défis et opportunités de l’informatisation des langues autochtones, Presses de l’université de Laval, 2011, p. 2).
The stakes are such that UNESCO actively engages for a multilingual and inclusive Internet, against the idea of a linguistic hegemony which, by definition, would be exclusive. In this regard, UNESCO has entered into partnerships with ICANN and Eurid (the European top-level domain registry: eurid.eu). The first aims to support the introduction of IDNs “to support the introduction of top level Internationalized Domain Names (“IDNs”) in order to offer new opportunities and benefits for the Internet users around the world by allowing them to establish and use domains in their native languages and scripts” ( article 1) and “collaborate to explore the potential to assist developing countries in enhancing capacities to actively participate in building an inclusive and multilingual Internet” (ibid.) (ICANN-UNESCO Memorandum of Understanding, 10 December 2009). On this occasion, the President of ICANN recalled that “Over half the internet users around the world don’t use a Latin-based script as their native language. IDNs are about making the Internet more global and accessible for everyone” (unesco.org, 10 December 2012). UNESCO and ICANN’s partnership was strengthened the following year by a joint letter of intent (Letter of Intent between UNESCO and ICANN, 16 September 2010). The second partnership, with Eurid, takes the form of the publication of a joint annual report to analyze the development of IDNs (idnworldreport.eu).
However, the issues are not only cultural and linguistic. They also raise problems of an economic and societal nature that impact access to knowledge and, therefore, to development, in such a way and to such an extent that we are getting closer to stakes of power and sovereignty.
The evolution of internationalized ccTLDs is carried out mainly through ICANN’s fast-track procedure on 16 November 2009 (icann.org). The following table shows the IDN ccTLDs created as part of this procedure. Currently, all of these IDNs have passed the delegation phase, except xn – mgb2ddes اليمن (for Yemen) and xn – 4dbrk0ce ישראל (for Israel), whose delegation phase is underway.
ISO 3166-1 Reference Country/Territory IDN DZ Algeria xn--lgbbat1ad8j = الجزائر (Arabic / Arabic) AM Armenia xn--y9a3aq = հայ (Armenian / Armenian) BH Bahrain xn--mgbcpq6gpa1a = البحرين (Arabic / Arabic) BD Bangladesh xn--54b7fta0cc = বাংলা (Bangla / Bangla) BY Belarus xn--90ais = бел (Belarusian, Russian / Cyrillic) BG Bulgaria xn--90ae = бг (Bulgarian / Cyrillic) CN China xn--fiqs8S = 中国 (Chinese / Simplified Chinese) CN China xn--fiqz9S = 中國 (Chinese / Traditional Chinese) EG Egypt xn--wgbh1c = مصر (Arabic / Arabic) EU European Union xn--e1a4c = ею (Bulgarian / Cyrillic) EU European Union xn--qxa6a = ευ (Greek / Greek) GE Georgia xn--node = გე (Georgian / Georgian (Mkhedruli)) GR Greece xn--qxam = ελ (Greek / Greek) HK Hong Kong xn--j6w193g = 香港 (Chinese / Simplified Chinese) IL Israel xn--4dbrk0ce = ישראל (Hebrew / Hebrew) IN India xn--h2brj9c = भारत (Hindi / Devanagari) IN India xn--mgbbh1a71e = بھارت (Urdu / Arabic) IN India xn--fpcrj9c3d = భారత్ (Telugu / Telugu) IN India xn--gecrj9c = ભારત (Gujarati / Gujarati) IN India xn--s9brj9c = ਭਾਰਤ (Punjabi / Gurmukhi) IN India xn--45brj9c = ভারত (Bengali / Bengali) IN India xn--xkc2dl3a5ee0h = இந்தியா (Tamil / Tamil) IN India xn--2scrj9c = ಭಾರತ (Kannada / Kannada) IN India xn--rvc1e0am3e = ഭാരതം (Malayalam / Malayalam) IN India xn--45br5cyl = ভাৰত (Assamese / Bengali) IN India xn--3hcrj9c = ଭାରତ (Oriya / Oriya) IN India xn--mgbbh1a = بارت (Kashmiri / Arabic) IN India xn--h2breg3eve = भारतम् (Sanskrit / Devanagari) IN India xn--h2brj9c8c = भारोत (Santali / Devanagari) IN India xn--mgbgu82a = ڀارت (Sindhi / Arabic) IR Iran xn--mgba3a4f16a = ایران (Persian / Arabic) IQ Iraq xn--mgbtx2b = عراق (Arabic / Arabic) JO Jordan xn--mgbayh7gpa = الاردن (Al-Ordon) KZ Kazakhstan xn--80ao21a = қаз (Kazakh / Cyrillic) KR Korea xn--3e0b707e = 한국 (Korean / Hangul) LA Laos xn--q7ce6a = ລາວ (Lao / Lao) MO Macau xn--mix891f = 澳門 (Chinese / Traditional Chinese) MK North Macedonia xn--d1alf = мкд (Macedonian / Cyrillic) MY Malaysia xn--mgbx4cd0ab = مليسيا. (Malay / Arabic) MR Mauritania xn--mgbah1a3hjkrd = موريتانيا (Arabic / Arabic) MN Mongolia xn--l1acc = мон (Mongolian / Cyrillic) MA Morocco xn--mgbc0a9azcg = المغرب (Arabic / Arabic) OM Oman xn--mgb9awbf = عمان (Arabic / Arabic) PK Pakistan xn--mgbai9azgqp6j = پاکستان (Urdu / Arabic) PS Palestine xn--ygbi2ammx = فلسطين (Arabic / Arabic) QA Qatar xn--wgbl6a = قطر (Qatar) RU Russia xn--p1ai = рф (Russian / Cyrillic) SA Saudi Arabia xn--mgberp4a5d4ar = السعودية (Arabic / Arabic) RS Serbia xn--90a3ac = срб (Serbian / Cyrillic) SG Singapore xn--yfro4i67o = 新加坡 (Chinese / Han) SG Singapore xn--clchc0ea0b2g2a9gcd = சிங்கப்பூர் (Tamil / Tamil) LK Sri Lanka xn--fzc2c9e2c = ලංකා (Sinhalese / Sinhala) LK Sri Lanka xn--xkc2al3hye2a = இலங்கை (Tanil / Tamil) SD Sudan xn--mgbpl2fh = سودان (Arabic / Arabic) SY Syria xn--ogbpf8fl = سورية (Arabic / Arabic) TW Taiwan xn--kpry57d = 台灣 (Chinese / Simplified Chinese) TW Taiwan xn--kprw13d = 台湾 (Chinese / Traditional Chinese) TH Thailand xn--o3cw4h = ไทย (Thai / Thai) TN Tunisia xn--pgbs0dh = تونس (Arabic / Arabic) UA Ukraine xn--j1amh = укр (Ukrainian / Cyrillic) AE United Arab Emirats xn--mgbaam7a8h = امارات (Arabic / Arabic) YE Yemen xn--mgb2ddes = اليمن (Arabic / Arabic)
Source : icann.org
An internationalized TLD is not necessarily the translation or transliteration of an ASCII ccTLD. Indeed, an internationalized TLD can also be a generic TLD, identical to a generic word. In 2012, when ICANN called for new gTLDs, it received over 100 requests for internationalized generic extensions. For example, .com exists in several other scripts such as .คอม (Thai), .कॉम (Hindi), .كوم (Arabic), and .ком (Russian).
An internationalized TLD can also reflect a brandTLD. As the English language dominates the business world, there are relatively few internationalized brandTLDs. However, some markets are large and/or strategic enough to persuade companies to acquire an internationalized brandTLD. In doing so, companies send their marketing message in the language of the target consumer. This commercial strategy is based on the raison d’être of brands: the ability to identify the origin of products. This is, for example, the case of the following TLDs:. 大众 汽车 (.Volkswagen, in Chinese),. 飞利浦 (Philips, in Chinese),. 谷 歌 (.Google, in Chinese),. 诺基亚 (Nokia, in Chinese ) and. 亚马逊 /. ア マ ゾ ン (.Amazon, respectively in Chinese and Japanese). Some companies, whose historical headquarters are in a country where the writing system is different from the Latin system, have acquired internationalized brandTLDs. This is particularly the case of the Chinese companies such as Weibo (. 微 博, in Chinese), PCCW (. 電訊 盈科, in Chinese) and CITIC Group (. 中信, in Chinese), Hong Kong companies such as Shangri-la Hotels and Resorts (.香格里拉, in Chinese) and Kerry Logistics (. 嘉 里), the Singaporean company Temasek (. 淡 马 锡, in Chinese), the Saudi company Aramco (.ارامكو) or the Korean company Samsung (. 삼성, in Korean ).
Finally, whether it is a gTLD or a ccTLD, each registry decides which scripts and languages will be made available to the public. For example, among the gTLDs, .com is available in several languages, including Chinese, Arabic, French, Spanish, Japanese, and Korean (iana.org, Repository of IDN Practices). Likewise, it is possible to register .pizza, .career, .coffee, .school or .science domain names in several languages.
As for brandTLDs, each company decides according to its commercial policy. Global companies have a vested interest in making domain names available to their regional teams in an intelligible language to the target consumer. For example, .accenture, .bing, .shell, .swatch, .tiffany and .windows brandTLDs can be registered in many languages (iana.org, Repository of IDN Practices).