Very often, people who are not used to language
tagging hesitate before choosing a tag for a given
combination of language, region, script, etc. You can find the result
of these hesitations on the Web, with people tagging, for instance,
jp
(the proper
subtag for the japanese language is ja
,
jp
is for the country) or using subtags that are not
registered
because they did not find the valid
ones. The purpose of this small text is to explain how to find a
subtag registered in the Language Subtag Registry. There are
many ways to do so, of course, and you are free to report
better ways.
The first thing to try is probably to use Richard Ishida's
Language Subtag Registry Search. You can enter text which appears
in the Description or Comments field of the registry and the
corresponding subtags will be displayed. For instance, for japanese,
it will correctly report ja
(and the script
Jpan
, which indicates the mix of
A more powerful, but probably less user-friendly, method, is to use
the registry directly. Since its canonical form is more adapted to
computer programs than to humans (for instance,
Cans
.
Both Richard Ishida's Web service and the above method have a
limit: they only use information that is in the registry. If the relationship between common
names and the tags is not in the registry, you will not find it. For
instance, if you want to identify "en-GB
(not en-UK
) from
subtags in the registry. Similarly, if you want to
write texts in
You have to use external tools, but please check their results
against the registry, with the tools mentioned above. A good search
tool is
gsw
(
For languages (but not scripts or dialects), another very useful
source is Ethnologue, which is
managed by the http://www.ethnologue.com/show_language.asp?code=gsw
, where there is the
comment: "Called 'Schwyzerdütsch' in Switzerland, and 'Alsatian' in France". But be careful:
Ethnologue displays only 3-letters code, while the registry
uses 2-letters code whenever they are available. For instance, fr
, not fra
.
TODO: endonyms and exonyms