Test of all links
This commit is contained in:
parent
85f4c9e1e5
commit
c4589a4d45
4
Makefile
4
Makefile
@ -16,9 +16,9 @@ all: ${ALLHTML}
|
||||
--output $@ ${STYLESHEET} $< && xmllint --noout --valid $@
|
||||
|
||||
install: all
|
||||
cp -a ../SQL/* .
|
||||
cp -a ../GaBuZoMeu/SQL/*/*.sql .
|
||||
touch ${GOOGLEVERIF}
|
||||
rsync -q -a ${ALLHTML} ${GOOGLEVERIF} ${IMAGES} ltru.css registries test-suites PostgreSQL SQLite ${WEBSERVER}
|
||||
rsync -q -a ${ALLHTML} ${GOOGLEVERIF} ${IMAGES} ltru.css registries test-suites ${WEBSERVER}
|
||||
|
||||
clean:
|
||||
rm -f *.html
|
||||
|
4
TODO
4
TODO
@ -1,4 +1,6 @@
|
||||
* Check all the links
|
||||
* Export database schemas (link to GaBuZoMeu?)
|
||||
* Configure Web site
|
||||
* Quote on the main page must be under all three columns
|
||||
* HTTPS
|
||||
* HTTPS for all links
|
||||
* lsr.atom (requires some porting)
|
@ -10,8 +10,7 @@
|
||||
href="http://www.w3.org/International/articles/language-tags/">excellent
|
||||
article from the W3C</a> about language tags</li>
|
||||
<li>Also from the <wikipedia>W3C</wikipedia>, a <a href="http://www.w3.org/International/tutorials/language-decl/">tutorial "Declaring Language in XHTML and HTML"</a></li>
|
||||
<li><a href="http://www.inter-locale.com">Other resources</a></li>
|
||||
<li><a href="http://people.w3.org/rishida/utils/subtags/">Search subtags online in the registry</a> (some explanations are provided, but this tool is more useful if you know the concepts behind the registry)</li>
|
||||
<li><a href="https://r12a.github.io/app-subtags/index.html">Search subtags online in the registry</a> (some explanations are provided, but this tool is more useful if you know the concepts behind the registry)</li>
|
||||
<li>Browse <a href="http://unicode.org/cldr/utility/languageid.jsp">language tags</a> and see their meaning</li>
|
||||
</ul>
|
||||
</div>
|
||||
@ -19,8 +18,6 @@ article from the W3C</a> about language tags</li>
|
||||
<h2>For software developers</h2>
|
||||
<ul>
|
||||
<li>The subtag registry (version <em><lsr-version/></em>) in <a href="registries.html">various formats</a></li>
|
||||
<li><a href="http://unicode.org/cldr/data/tools/java/org/unicode/cldr/util/data/langtagRegex.txt">Mark Davis' parser (ICU)</a>, written as a <wikipedia name="Regular expression">regexp</wikipedia> suitable for <wikipedia>Perl</wikipedia> or <wikipedia>Java</wikipedia></li>
|
||||
<li><a href="http://www.dpawson.co.uk/java/rfc4646.html">Dave Pawson's implementation</a> in <wikipedia name="Java (programming language)">Java</wikipedia></li>
|
||||
<li>Martin Dürst's implementation in <wikipedia name="Ruby (programming language)">Ruby</wikipedia> is available as a <wikipedia name="RubyGems">Gem</wikipedia> named "langtag" at <wikipedia>RubyForge</wikipedia></li>
|
||||
<li><a href="http://www.bortzmeyer.org/gabuzomeu-parsing-language-tags.html">Stéphane Bortzmeyer's implementation</a> in <wikipedia name="Haskell (programming language)">Haskell</wikipedia>.</li>
|
||||
<li><a href="philips-regexp.html">Addison Phillips' code</a>, as a <wikipedia name="Regular expression">regexp</wikipedia> for <wikipedia name="Java (programming language)">Java</wikipedia></li>
|
||||
@ -34,7 +31,7 @@ in <wikipedia>XSLT</wikipedia> (requires a XSLT 2.0 processor) and the
|
||||
<div class="menu3">
|
||||
<h2>For standard authors</h2>
|
||||
<ul>
|
||||
<li>The current standard: <a href="http://www.rfc-editor.org/rfc/rfc5646.txt">RFC 5646</a> and <a href="http://www.rfc-editor.org/rfc/rfc4647.txt">RFC 4647</a></li>
|
||||
<li>The current standard: <a href="https://www.rfc-editor.org/info/rfc5646">RFC 5646</a> and <a href="https://www.rfc-editor.org/info/rfc4647">RFC 4647</a></li>
|
||||
<li>How to <a href="register-new-subtag.html">register a new
|
||||
subtag</a> (for instance, for a variant of a language),</li>
|
||||
<li>A <a href="registries/lsr.atom">syndication feed</a> (<wikipedia name="Atom (standard)">Atom</wikipedia> format) of the changes in the registry</li>
|
||||
|
@ -39,7 +39,6 @@
|
||||
<hr class="before-footer"/>
|
||||
<p class="footer"><a href="index.html">Home</a>. Web site maintained by
|
||||
<code><a href="mailto:webmaster@langtag.net">webmaster@langtag.net</a></code>.
|
||||
Hosted by <a href="http://www.afnic.fr/">AFNIC</a>.
|
||||
</p>
|
||||
</body>
|
||||
</html>
|
||||
|
@ -1,4 +1,4 @@
|
||||
<?xml version="1.0" encoding="us-ascii"?>
|
||||
<?xml version="1.0" encoding="utf-8"?>
|
||||
<page title="How to register a new subtag">
|
||||
<p>The <em><a href="registries.html">language subtag registry</a></em>
|
||||
includes many <a href="whatare.html">subtags</a> identifying countries, languages or variants
|
||||
@ -6,7 +6,7 @@ such as local dialects. Your favorite language and/or variant is
|
||||
probably already there. But, if it is not, you can ask for the
|
||||
registration of a new subtag. This text explains how, but the
|
||||
complete and authoritative explanation is in <a
|
||||
href="http://www.rfc-editor.org/rfc/rfc5646.txt">RFC 5646</a>,
|
||||
href="https://www.rfc-editor.org/info/rfc5646">RFC 5646</a>,
|
||||
specially its section 3.5, <a href="http://tools.ietf.org/html/rfc5646#section-3.5">Registration Procedure for Subtags</a>". It is
|
||||
recommended to read at least this section.</p>
|
||||
<p>Before you start, a warning: the process takes time, documentation
|
||||
@ -30,7 +30,7 @@ you want the IETF to make another choice). We then concentrate on
|
||||
</ul>
|
||||
<p>The process is the following (it is a simplified version; did I
|
||||
tell you to read the full story in <a href="http://tools.ietf.org/html/rfc5646#section-3.5">section 3.5</a> of <a
|
||||
href="http://www.rfc-editor.org/rfc/rfc5646.txt">RFC 5646</a>?):</p>
|
||||
href="https://www.rfc-editor.org/info/rfc5646">RFC 5646</a>?):</p>
|
||||
<ol>
|
||||
<li>Collect background information, typically references to published
|
||||
descriptions of the language or dialect. A Wikipedia page is possible but
|
||||
@ -79,7 +79,7 @@ fields are typically not used for a variant such as
|
||||
Suppress-Script.</li>
|
||||
<li>Send it to the mailing list <code><a
|
||||
href="mailto:ietf-languages@iana.org">ietf-languages@iana.org</a></code>
|
||||
(you may choose to <a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages">subscribe to the mailing list</a> before, to get an
|
||||
(you may choose to <a href="https://www.ietf.org/mailman/listinfo/ietf-languages">subscribe to the mailing list</a> before, to get an
|
||||
idea of the people and discussions, and to be sure to have the
|
||||
complete thread).</li>
|
||||
<li>Reply to questions, address objections, be prepared to modify your
|
||||
@ -130,7 +130,7 @@ clearly state that it is a variant of Middle French.</p>
|
||||
<p>Let's see a second example from the same author, with the added
|
||||
difficulty that we use <wikipedia>XML</wikipedia>-like encoding for
|
||||
the composed characters (see section 3.1 of RFC 5646). This specificies the early modern French, as
|
||||
described by the <wikipedia name="Academie francaise">french academy</wikipedia>:</p>
|
||||
described by the <wikipedia name="Académie française">french academy</wikipedia>:</p>
|
||||
<pre>
|
||||
LANGUAGE SUBTAG REGISTRATION FORM
|
||||
1. Name of requester: C. E. Whitehead
|
||||
|
@ -1,7 +1,7 @@
|
||||
<page title="The registry in various formats">
|
||||
|
||||
<p>Files available here were automatically produced from the official
|
||||
registry maintained by <wikipedia>IANA</wikipedia>. The current version of
|
||||
registry maintained by <wikipedia name="https://en.wikipedia.org/wiki/Internet_Assigned_Numbers_Authority">IANA</wikipedia>. The current version of
|
||||
the registry is <em><lsr-version/></em>.</p>
|
||||
|
||||
<ul>
|
||||
@ -20,7 +20,8 @@ second):
|
||||
</li>
|
||||
<li>As <wikipedia>XML</wikipedia> :
|
||||
<ul>
|
||||
<li>According to <a href="registries/ltru.rnc">this schema</a> (written in <wikipedia>RelaxNG</wikipedia>): <a href="registries/language-subtag-registry.xml">registry in XML</a>, Stéphane Bortzmeyer's version</li>
|
||||
<li>According to <a href="registries/ltru.rnc">this schema</a>
|
||||
(written in <wikipedia name="RELAX NG">RelaxNG</wikipedia>): <a href="registries/language-subtag-registry.xml">registry in XML</a>, Stéphane Bortzmeyer's version</li>
|
||||
</ul>
|
||||
</li>
|
||||
<li>As <wikipedia>SQL</wikipedia>. Since SQL is not really portable, there are
|
||||
|
@ -6,8 +6,8 @@ is always glad to receive bug reports, fixes and improvments.</p>
|
||||
(computing)">patches</wikipedia> is to retrieve the source and work on
|
||||
them.</p>
|
||||
<p>You can retrieve the source of the pages for this Web site with
|
||||
<wikipedia name="Subversion (software)">Subversion</wikipedia> at URL
|
||||
<code>https://svn.langtag.net/langtag/</code>. These sources are in
|
||||
<wikipedia name="Git">git</wikipedia> at URL
|
||||
<code>https://forge.chapril.org/bortzmeyer/Web-LangTag</code>. These sources are in
|
||||
<wikipedia>XML</wikipedia>, using a small superset of
|
||||
<wikipedia>XHTML</wikipedia> in a <code><page></code> element.</p>
|
||||
<p>The recommended method
|
||||
@ -15,4 +15,4 @@ to request a change to the Web site is to send the <wikipedia
|
||||
name="Patch (computing)">patches</wikipedia> in <code>diff -u</code>
|
||||
format. If you cannot, send the modified XML page, and please try to
|
||||
convince your XML editor to do as little reformatting as possible.</p>
|
||||
</page>
|
||||
</page>
|
||||
|
10
whatare.xml
10
whatare.xml
@ -13,17 +13,17 @@ written and spoken in <wikipedia>Australia</wikipedia>,</li>
|
||||
<li><code>az-Latn-IR</code>, <wikipedia name="Azerbaijani language">Azeri</wikipedia> language,
|
||||
written in the <wikipedia name="Latin alphabet">Latin</wikipedia> script, as used in <wikipedia>Iran</wikipedia>.</li>
|
||||
</ul>
|
||||
<p>They are specified in <wikipedia>IETF</wikipedia>
|
||||
<p>They are specified in <wikipedia name="Internet Engineering Task Force">IETF</wikipedia>
|
||||
<wikipedia name="Request for Comments">RFC</wikipedia> <em>5646</em> (<a
|
||||
href="http://www.rfc-editor.org/rfc/rfc5646.txt">available
|
||||
href="https://www.rfc-editor.org/info/rfc5646">available
|
||||
online</a>).</p>
|
||||
<p>Language tags are made of <em>subtags</em> separated by
|
||||
hyphens. The list of possible subtags is mostly directly copied from
|
||||
various <wikipedia>ISO</wikipedia> standards such as <wikipedia name="ISO
|
||||
various <wikipedia name="International Organization for Standardization">ISO</wikipedia> standards such as <wikipedia name="ISO
|
||||
639">ISO 639</wikipedia>.</p>
|
||||
<p>They are used in many formats and protocols for instance in
|
||||
<wikipedia>XML</wikipedia> (through the <code>xml:lang</code>
|
||||
attribute) and in <wikipedia>HTTP</wikipedia> (the browser can
|
||||
indicate to the <wikipedia>Web</wikipedia> server what language the
|
||||
indicate to the <wikipedia name="World Wide Web">Web</wikipedia> server what language the
|
||||
user prefers, should the Web server have several versions).</p>
|
||||
</page>
|
||||
</page>
|
||||
|
@ -43,9 +43,8 @@ is 'l' in <wikipedia name="English language">English</wikipedia> but
|
||||
choosen depending on the language used.</p>
|
||||
<p>Language tagging also allow filters to keep only some documents,
|
||||
those written in a language that the user understands. At the present
|
||||
time, most <wikipedia>search engines</wikipedia>, like
|
||||
<wikipedia>Google</wikipedia>, use <a
|
||||
href="http://www.macchiato.com/slides/unicode_at_google.ppt">heuristics</a>
|
||||
time, most <wikipedia name="Search engine">search engines</wikipedia>, like
|
||||
<wikipedia name="Google Search">Google</wikipedia>, use heuristics
|
||||
to find out the language of a Web page. While it works fine to tell
|
||||
apart <wikipedia name="German language">German</wikipedia> from
|
||||
<wikipedia name="Japanese language">Japanese</wikipedia>, it is much
|
||||
|
Loading…
Reference in New Issue
Block a user