Test of all links

This commit is contained in:
Stephane Bortzmeyer 2023-06-11 16:17:27 +02:00
parent 85f4c9e1e5
commit c4589a4d45
9 changed files with 25 additions and 27 deletions

View File

@ -16,9 +16,9 @@ all: ${ALLHTML}
--output $@ ${STYLESHEET} $< && xmllint --noout --valid $@
install: all
cp -a ../SQL/* .
cp -a ../GaBuZoMeu/SQL/*/*.sql .
touch ${GOOGLEVERIF}
rsync -q -a ${ALLHTML} ${GOOGLEVERIF} ${IMAGES} ltru.css registries test-suites PostgreSQL SQLite ${WEBSERVER}
rsync -q -a ${ALLHTML} ${GOOGLEVERIF} ${IMAGES} ltru.css registries test-suites ${WEBSERVER}
clean:
rm -f *.html

4
TODO
View File

@ -1,4 +1,6 @@
* Check all the links
* Export database schemas (link to GaBuZoMeu?)
* Configure Web site
* Quote on the main page must be under all three columns
* HTTPS
* HTTPS for all links
* lsr.atom (requires some porting)

View File

@ -10,8 +10,7 @@
href="http://www.w3.org/International/articles/language-tags/">excellent
article from the W3C</a> about language tags</li>
<li>Also from the <wikipedia>W3C</wikipedia>, a <a href="http://www.w3.org/International/tutorials/language-decl/">tutorial "Declaring Language in XHTML and HTML"</a></li>
<li><a href="http://www.inter-locale.com">Other resources</a></li>
<li><a href="http://people.w3.org/rishida/utils/subtags/">Search subtags online in the registry</a> (some explanations are provided, but this tool is more useful if you know the concepts behind the registry)</li>
<li><a href="https://r12a.github.io/app-subtags/index.html">Search subtags online in the registry</a> (some explanations are provided, but this tool is more useful if you know the concepts behind the registry)</li>
<li>Browse <a href="http://unicode.org/cldr/utility/languageid.jsp">language tags</a> and see their meaning</li>
</ul>
</div>
@ -19,8 +18,6 @@ article from the W3C</a> about language tags</li>
<h2>For software developers</h2>
<ul>
<li>The subtag registry (version <em><lsr-version/></em>) in <a href="registries.html">various formats</a></li>
<li><a href="http://unicode.org/cldr/data/tools/java/org/unicode/cldr/util/data/langtagRegex.txt">Mark Davis' parser (ICU)</a>, written as a <wikipedia name="Regular expression">regexp</wikipedia> suitable for <wikipedia>Perl</wikipedia> or <wikipedia>Java</wikipedia></li>
<li><a href="http://www.dpawson.co.uk/java/rfc4646.html">Dave Pawson's implementation</a> in <wikipedia name="Java (programming language)">Java</wikipedia></li>
<li>Martin D&#xFC;rst's implementation in <wikipedia name="Ruby (programming language)">Ruby</wikipedia> is available as a <wikipedia name="RubyGems">Gem</wikipedia> named "langtag" at <wikipedia>RubyForge</wikipedia></li>
<li><a href="http://www.bortzmeyer.org/gabuzomeu-parsing-language-tags.html">St&#xE9;phane Bortzmeyer's implementation</a> in <wikipedia name="Haskell (programming language)">Haskell</wikipedia>.</li>
<li><a href="philips-regexp.html">Addison Phillips' code</a>, as a <wikipedia name="Regular expression">regexp</wikipedia> for <wikipedia name="Java (programming language)">Java</wikipedia></li>
@ -34,7 +31,7 @@ in <wikipedia>XSLT</wikipedia> (requires a XSLT 2.0 processor) and the
<div class="menu3">
<h2>For standard authors</h2>
<ul>
<li>The current standard: <a href="http://www.rfc-editor.org/rfc/rfc5646.txt">RFC 5646</a> and <a href="http://www.rfc-editor.org/rfc/rfc4647.txt">RFC 4647</a></li>
<li>The current standard: <a href="https://www.rfc-editor.org/info/rfc5646">RFC 5646</a> and <a href="https://www.rfc-editor.org/info/rfc4647">RFC 4647</a></li>
<li>How to <a href="register-new-subtag.html">register a new
subtag</a> (for instance, for a variant of a language),</li>
<li>A <a href="registries/lsr.atom">syndication feed</a> (<wikipedia name="Atom (standard)">Atom</wikipedia> format) of the changes in the registry</li>

View File

@ -39,7 +39,6 @@
<hr class="before-footer"/>
<p class="footer"><a href="index.html">Home</a>. Web site maintained by
<code><a href="mailto:webmaster@langtag.net">webmaster@langtag.net</a></code>.
Hosted by <a href="http://www.afnic.fr/">AFNIC</a>.
</p>
</body>
</html>

View File

@ -1,4 +1,4 @@
<?xml version="1.0" encoding="us-ascii"?>
<?xml version="1.0" encoding="utf-8"?>
<page title="How to register a new subtag">
<p>The <em><a href="registries.html">language subtag registry</a></em>
includes many <a href="whatare.html">subtags</a> identifying countries, languages or variants
@ -6,7 +6,7 @@ such as local dialects. Your favorite language and/or variant is
probably already there. But, if it is not, you can ask for the
registration of a new subtag. This text explains how, but the
complete and authoritative explanation is in <a
href="http://www.rfc-editor.org/rfc/rfc5646.txt">RFC 5646</a>,
href="https://www.rfc-editor.org/info/rfc5646">RFC 5646</a>,
specially its section 3.5, <a href="http://tools.ietf.org/html/rfc5646#section-3.5">Registration Procedure for Subtags</a>". It is
recommended to read at least this section.</p>
<p>Before you start, a warning: the process takes time, documentation
@ -30,7 +30,7 @@ you want the IETF to make another choice). We then concentrate on
</ul>
<p>The process is the following (it is a simplified version; did I
tell you to read the full story in <a href="http://tools.ietf.org/html/rfc5646#section-3.5">section 3.5</a> of <a
href="http://www.rfc-editor.org/rfc/rfc5646.txt">RFC 5646</a>?):</p>
href="https://www.rfc-editor.org/info/rfc5646">RFC 5646</a>?):</p>
<ol>
<li>Collect background information, typically references to published
descriptions of the language or dialect. A Wikipedia page is possible but
@ -79,7 +79,7 @@ fields are typically not used for a variant such as
Suppress-Script.</li>
<li>Send it to the mailing list <code><a
href="mailto:ietf-languages@iana.org">ietf-languages@iana.org</a></code>
(you may choose to <a href="http://www.alvestrand.no/mailman/listinfo/ietf-languages">subscribe to the mailing list</a> before, to get an
(you may choose to <a href="https://www.ietf.org/mailman/listinfo/ietf-languages">subscribe to the mailing list</a> before, to get an
idea of the people and discussions, and to be sure to have the
complete thread).</li>
<li>Reply to questions, address objections, be prepared to modify your
@ -130,7 +130,7 @@ clearly state that it is a variant of Middle French.</p>
<p>Let's see a second example from the same author, with the added
difficulty that we use <wikipedia>XML</wikipedia>-like encoding for
the composed characters (see section 3.1 of RFC 5646). This specificies the early modern French, as
described by the <wikipedia name="Academie francaise">french academy</wikipedia>:</p>
described by the <wikipedia name="Académie française">french academy</wikipedia>:</p>
<pre>
LANGUAGE SUBTAG REGISTRATION FORM
1. Name of requester: C. E. Whitehead

View File

@ -1,7 +1,7 @@
<page title="The registry in various formats">
<p>Files available here were automatically produced from the official
registry maintained by <wikipedia>IANA</wikipedia>. The current version of
registry maintained by <wikipedia name="https://en.wikipedia.org/wiki/Internet_Assigned_Numbers_Authority">IANA</wikipedia>. The current version of
the registry is <em><lsr-version/></em>.</p>
<ul>
@ -20,7 +20,8 @@ second):
</li>
<li>As <wikipedia>XML</wikipedia> :
<ul>
<li>According to <a href="registries/ltru.rnc">this schema</a> (written in <wikipedia>RelaxNG</wikipedia>): <a href="registries/language-subtag-registry.xml">registry in XML</a>, St&#xE9;phane Bortzmeyer's version</li>
<li>According to <a href="registries/ltru.rnc">this schema</a>
(written in <wikipedia name="RELAX NG">RelaxNG</wikipedia>): <a href="registries/language-subtag-registry.xml">registry in XML</a>, St&#xE9;phane Bortzmeyer's version</li>
</ul>
</li>
<li>As <wikipedia>SQL</wikipedia>. Since SQL is not really portable, there are

View File

@ -6,8 +6,8 @@ is always glad to receive bug reports, fixes and improvments.</p>
(computing)">patches</wikipedia> is to retrieve the source and work on
them.</p>
<p>You can retrieve the source of the pages for this Web site with
<wikipedia name="Subversion (software)">Subversion</wikipedia> at URL
<code>https://svn.langtag.net/langtag/</code>. These sources are in
<wikipedia name="Git">git</wikipedia> at URL
<code>https://forge.chapril.org/bortzmeyer/Web-LangTag</code>. These sources are in
<wikipedia>XML</wikipedia>, using a small superset of
<wikipedia>XHTML</wikipedia> in a <code>&lt;page&gt;</code> element.</p>
<p>The recommended method
@ -15,4 +15,4 @@ to request a change to the Web site is to send the <wikipedia
name="Patch (computing)">patches</wikipedia> in <code>diff -u</code>
format. If you cannot, send the modified XML page, and please try to
convince your XML editor to do as little reformatting as possible.</p>
</page>
</page>

View File

@ -13,17 +13,17 @@ written and spoken in <wikipedia>Australia</wikipedia>,</li>
<li><code>az-Latn-IR</code>, <wikipedia name="Azerbaijani language">Azeri</wikipedia> language,
written in the <wikipedia name="Latin alphabet">Latin</wikipedia> script, as used in <wikipedia>Iran</wikipedia>.</li>
</ul>
<p>They are specified in <wikipedia>IETF</wikipedia>
<p>They are specified in <wikipedia name="Internet Engineering Task Force">IETF</wikipedia>
<wikipedia name="Request for Comments">RFC</wikipedia> <em>5646</em> (<a
href="http://www.rfc-editor.org/rfc/rfc5646.txt">available
href="https://www.rfc-editor.org/info/rfc5646">available
online</a>).</p>
<p>Language tags are made of <em>subtags</em> separated by
hyphens. The list of possible subtags is mostly directly copied from
various <wikipedia>ISO</wikipedia> standards such as <wikipedia name="ISO
various <wikipedia name="International Organization for Standardization">ISO</wikipedia> standards such as <wikipedia name="ISO
639">ISO 639</wikipedia>.</p>
<p>They are used in many formats and protocols for instance in
<wikipedia>XML</wikipedia> (through the <code>xml:lang</code>
attribute) and in <wikipedia>HTTP</wikipedia> (the browser can
indicate to the <wikipedia>Web</wikipedia> server what language the
indicate to the <wikipedia name="World Wide Web">Web</wikipedia> server what language the
user prefers, should the Web server have several versions).</p>
</page>
</page>

View File

@ -43,9 +43,8 @@ is 'l' in <wikipedia name="English language">English</wikipedia> but
choosen depending on the language used.</p>
<p>Language tagging also allow filters to keep only some documents,
those written in a language that the user understands. At the present
time, most <wikipedia>search engines</wikipedia>, like
<wikipedia>Google</wikipedia>, use <a
href="http://www.macchiato.com/slides/unicode_at_google.ppt">heuristics</a>
time, most <wikipedia name="Search engine">search engines</wikipedia>, like
<wikipedia name="Google Search">Google</wikipedia>, use heuristics
to find out the language of a Web page. While it works fine to tell
apart <wikipedia name="German language">German</wikipedia> from
<wikipedia name="Japanese language">Japanese</wikipedia>, it is much