Web-LangTag/philips-regexp.xml

15 lines
918 B
XML

<page title="Addison Phillips Java regexp for language tags">
<p>Here is a <wikipedia>regular expression</wikipedia> to parse the <em>future</em> versions of
<a href="index.html">language tags</a>. Suitable for the syntax of the RFC 5646. Written by Addison Phillips, <code>addison - at - amazon.com</code> for the <wikipedia name="Java (programming language)">Java programming language</wikipedia>.</p>
<pre>
static final String langtag_ex =
"(\\A[xX]([\\x2d]\\p{Alnum}{1,8})*\\z)"
+ "|(((\\A\\p{Alpha}{2,8}(?=\\x2d|\\z)){1}"
+ "(([\\x2d]\\p{Alpha}{3})(?=\\x2d|\\z)){0,3}"
+ "([\\x2d]\\p{Alpha}{4}(?=\\x2d|\\z))?"
+ "([\\x2d](\\p{Alpha}{2}|\\d{3})(?=\\x2d|\\z))?"
+ "([\\x2d](\\d\\p{Alnum}{3}|\\p{Alnum}{5,8})(?=\\x2d|\\z))*)"
+ "(([\\x2d]([a-wyzA-WYZ](?=\\x2d))([\\x2d](\\p{Alnum}{2,8})+)*))*"
+ "([\\x2d][xX]([\\x2d]\\p{Alnum}{1,8})*)?)\\z";
</pre>
</page>