Factsheet Idn Fast Track 12jun09 en
Factsheet Idn Fast Track 12jun09 en
To achieve a fully internationalized Internet, we need the collaborative involve- Languages | Scripts | Alphabets Languages are used by speech communities. IDNs: Internationalized Domain Names
ment of technical and linguistic experts, policy-makers, application developers, end Scripts are used to write the various languages and this is done by using the cor-
users and other stakeholders. ICANN looks forward to continuing to facilitate open
forums for all interested parties.
responding alphabets or alternative writing systems.
Did you know?
LDH (Letter, Digit, Hyphen) A subset of the ASCII characters that only contains
• Internationalized Domain Names (IDNs) are domain names with characters other than a, b…, z; 0, 1,…, 9; and “-“.
Additional information about the IDN Program can be found at http://icann.org/ letters a-z, digits 0-9 and the hyphen “-“, and the term “LDH code points” usually
• IDNs on the second and third levels exist in some generic top-level domains (gTLDs) and in some
topics/idn/. For more information on how to participate in IDN activities, contact refers to this subset. Originally, domain name labels were restricted to this subset
country code top-level domains (ccTLDs). TLD registries determine the choice of characters available under
[email protected]. of characters.
these TLDs.
Punycode This is the sequence of ASCII characters all IDNs will be encoded into in • Some languages, like Arabic, are written from right to left. IDNs at the second level are not very useful for the
Internationalized Domain Name Glossary communities that use these languages because users would need to change the type direction and script in
order for the Domain Name System (DNS) to understand and manage the names.
the middle of entering a domain name.
Historically, domain names on the Internet were restricted to using ASCII characters The intention is that domain name registrants and users will never see this decoded
• IDN TLDs will be made available through two separate processes, initially through the New gTLD Program and
(a–z, 0–9, and “-“). However, with the increasing use of the Internet in all regions form of a domain name. The sole purpose is for the DNS to be able to resolve for the IDN ccTLD Fast Track Process.
and by diverse linguistic groups around the world, the need for multilingual example a web address containing local characters. The DNS is only capable of • 60 percent of Internet users are non-English speakers, while the dominant language used on the Internet is
domain names has become more intense. Various acronyms are used widely in handling ASCII characters. For example, the punycode version of [ ] (this English (see Global Reach at www.glreach.com).
communications around internationalizing the domain name space. Explanations is the Hindi, in Devanagari script, version of “example.test”) is: xn--p1b6ci4b4b3a.
for these are provided below to make this topic simpler to understand. The glossary xn--11b5bs3a9aj6g. One of the most significant innovations in the Internet since its inception is the
here is based on a broader, more dynamic online glossary of IDN-related terms
introduction of Internationalized Domain Names (IDNs) in the Internet’s address
available at http://www.icann.org/en/topics/idn/idn-glossary.htm. The prefix for the punycode version of the domain names is always “xn—“. Hence this
space. You may be familiar with some of the larger top-level domains: dot-com,
prefix is often reserved at the registry level to avoid confusion in registration of IDNs.
dot-net, and dot-org are just three examples. IDN top-level domain names will offer
ASCII (American Standard Code for ASCII (American Standard Code for
The Unicode Consortium is a nonprofit organization founded to develop, extend many new opportunities and benefits for Internet users around the world by
Information Interchange) ASCII is a common numerical code for computers and
and promote use of the Unicode standard. For membership and more information, allowing them to establish and use domains in their native languages and scripts.
other devices that work with text. Computers can only understand numbers, so an
ASCII code is the numerical representation of a character such as “a” or “@”. When please visit http://www.unicode.org. Unicode is a commonly used single encoding
ICANN, the Internet Corporation for Assigned Names and Numbers, is responsible
used in relation to ASCII TLDs or ASCII domain names, this refers to the fact that scheme that provides a unique number for each character across a wide variety
for managing and coordinating the Domain Name System (DNS) to ensure that
before internationalization only the letters a–z, digits 0–9, and the hyphen “-“ were of languages and scripts. The Unicode Tables hold the code pints for each local
every address is unique and that all of us who use the Internet can find all valid
allowed in domain names. character identified. These tables continue to expand as more and more characters
Internet addresses. It does this by overseeing the distribution of unique IP (Internet
are digitalized. Although many websites contain multilingual content, if you look closely at the
Protocol) addresses and domain names. It also ensures that each domain name
DNS (Domain Name System) The DNS makes using the Internet easier by allowing
maps to the correct IP address. site’s web address on your browser, you will find IDNs represented only on the
a familiar string of letters (the domain name) to be used instead of the arcane IP
About ICANN second level or lower levels. This representation happens in some gTLDs and some
address. So instead of typing 207.151.159.3, you can type www.internic.net.
One of ICANN’s top priorities is its IDN Program. This program is made up of several ccTLDs, and each registry decides which characters are available for registration in
ICANN was formed in 1998 to coordinate the Internet’s unique
initiatives that involve a significant cross-section of the global Internet community domain names. A map showing the IDN implementation across TLDs is available at
IDNA (Internationalized Domain Names in Application) protocol IDNA is a identifiers around the world. Without that coordination we wouldn’t have one global
and are aimed at providing a secure method for implementing internationalized http://www.icann.org/en/maps/idntld.htm.
protocol defined in RFC3490 by the Internet Engineering Task Force (http://www. Internet. It is a not-for-profit public-benefit corporation with participants from all
top level domains in the root zone. Through this program, users will be able to
ietf.org) that makes it possible for applications to handle domain names with over the world dedicated to keeping the Internet secure, stable and interoperable.
register and use domain names based on their local language scripts. This includes Soon users around the globe will have access to IDN TLDs made up of non-ASCII
non-ASCII characters. IDNA converts domain name strings with non-ASCII characters It promotes competition and develops policy on the Internet’s unique identifiers.
users of languages based on right-to-left scripts such as Arabic and users of languages characters to the right of the dot. ICANN is carrying out three initiatives to bring
to ASCII domain name labels that applications that use the DNS can accurately ICANN doesn’t control content on the Internet. It cannot stop spam and it doesn’t
based on non-alphabetic scripts such as Mandarin Chinese. End users will be able to this about. Two are being implemented and one is at the policy development stage.
understand. Not all characters used in the world’s languages will be available for deal with access to the Internet. But through its coordination role of the Internet’s
more easily identify localized online content by using fully localized identifiers. One These three processes are described briefly in the graphic on page 2.
use in domain names. Hence IDNA is not able to convert all such characters into naming system, it does have an important impact on the expansion and evolution
example is http://실례.테스트 (“example.test” in Korean Hangul script).
ASCII labels. of the Internet. For more information please visit: www.icann.org. IDN usage in emails—or, more specifically, in the domain name part of the email
Most of the program’s objectives are related to: address—is not yet available. The technical standard that will make this possible
IDN (Internationalized Domain Name) IDNs are domain names represented
• The secure and stable technical and operational deployment of IDNs in the is being developed by the Internet Engineering Task Force, or IETF. And some
by local language characters (non-ASCII characters). Such domain names could
Domain Name System (DNS) at the root level. implementations are now available for users to try out. More information and test
contain characters with diacritical marks as required by many European languages, • Engaging the global Internet community in the policy processes that will guide environments can be found at http://cslab.kr/eai-test/index.php/Main_Page, http://
or characters from non-Latin scripts (for example, Arabic or Chinese) the deployment, allocation and secure management of IDNs.
global-email.info/intro.html, and http://www.ietf.org/html.charters/eai-charter.html.
IDN SLDs Usually a reference for domain names with local characters at the
second level, while the top level remains in ASCII-only characters. For example: What are IDNs? Why are IDNs important?
© 2009 Internet Corporation for Assigned Names and Numbers
[παράδειγμα.test] (“example.test” in Greek). IDN is the short name for an Internationalized Domain Name. These domain names For many reasons. The most important is the growing number of Internet users
are represented by local language characters. Such domain names could contain around the world for whom it is difficult to use ASCII characters. The fact is, the
characters with diacritical marks as required by many European languages, or Internet is accessed by more people who do not use Latin languages and scripts than
characters from non-Latin scripts (for example, Arabic or Chinese). those who do. This means that it is difficult for them to recognize ASCII characters
Brussels 6 Rond Point Schuman, Bt. 5 B-1040 Brussels Belgium T +32 2 234 7870 F +32 2 234 7848
and reproduce them on keyboards or use software to enter website addresses in
Marina del Rey 4676 Admiralty Way, Suite 330 Marina del Rey, CA 90292 USA T +1 310 823 9358 F +1 310 823 8649
Washington, DC 1875 I Street NW, 5th Floor Washington, DC 20006, USA USA T +1 202 429 2407 F +1 202 429 2714 browsers. For example:
Sydney Level 2, 48 Hunter Street Sydney NSW 2000 Australia T +61 2 8236 7900 F +61 2 8236 7913
In Implementation: IDN ccTLD Fast Track Process The issues with the current IDN model that led to the revision work are discussed version 2.1, is available at http://icann.org/topics/idn/implementation-guidelines.htm.
• To introduce a limited number of IDN TLDs in non-Latin scripts in RFC4690. The Internet-drafts with proposed revisions are available at http://icann.
Work is under way to revise the IDNA protocol, based on experience since the 2003
• Requires strings to be meaningful representations of country or territory org/topics/idn/rfc.htm.
names introduction. Additional work is under way by an IDN TLD registry working group
• Only for countries or territories represented in the ISO 3166-1 list The next IETF meeting where the IDNA protocol revision will be discussed is to amend the guidelines further to ensure that the guideline directions will be used
scheduled for Stockholm, Sweden, on 26-31 July 2009. http://www.ietf.org deeper into the DNS hierarchy, particularly as a set of principles for implementing
In Policy Development: IDN ccTLDs – Long Term
internationalized top-level labels.
• Full policy that caters for all
• Follows the ccNSO policy development process
For a more detailed history of technical development see http://www.isoc.org/
• Was initiated formally on 13 April 2009 by the ccNSO Council http://www.
icann.org/en/announcements/announcement-13apr09-en.htm pubpolpillar/docs/i18n-dns-chronology.pdf.
2 3