Maven 1

maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml

Details

  • Type: Bug Bug
  • Status: Closed Closed
  • Priority: Major Major
  • Resolution: Duplicate
  • Affects Version/s: 1.1-beta-1
  • Fix Version/s: None
  • Component/s: model
  • Labels:
    None
  • Number of attachments :
    0

Description

To make project.xml readable across operating systems and parsers (even when turned into html by the site plugin), we have used entities for non-US characters in project xml. The XML parser used in maven 1.1 chokes on the use of these entities (but not on the entity definition). This is very unfortunate, as using entities for abbreviations and symbols is perfectly legal Xml.

Example: won't work with 1.1:

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE project [
<!ENTITY OSlash "ø">
<!ENTITY CapitalOSlash "Ø">
]>
<project>
<pomVersion>3</pomVersion>
...
<developers>
<developer>
<name>Marit Finne J&OSlash;rgensen</name>
<id>mfj</id>
</developer>
</developers>
....
</project>

Example: fix for 1.1, with cross-system compatibility issues.

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE project [
<!ENTITY OSlash "ø">
<!ENTITY CapitalOSlash "Ø">
]>
<project>
<pomVersion>3</pomVersion>
...
<developers>
<developer>
<name>Marit Finne Jørgensen</name>
<id>mfj</id>
</developer>
</developers>
....
</project>

The XML parser chokes on the Usage of the XML Entity, inside 'Jørgensen', not on the definition.

Issue Links

Activity

Hide
Trygve Laugstol added a comment -

All the standard HTML entities should be supported, so use ø for 'ø'.

Show
Trygve Laugstol added a comment - All the standard HTML entities should be supported, so use ø for 'ø'.
Hide
Brett Porter added a comment -

we do need to support a xerces based parser though for backwards compat...

Show
Brett Porter added a comment - we do need to support a xerces based parser though for backwards compat...

People

Vote (0)
Watch (0)

Dates

  • Created:
    Updated:
    Resolved: