Post #355,104
3/7/12 10:44:01 AM
|
XML question on "disapble-output-escaping"
I've got some XML that has special characters in it -- "<" and ">" primarily, though also some "&". Right now these are displayed as escaped entities on the web page -- "<" and ">" for example.
I can "fix" this by putting disable-output-escaping="yes" on the nodes, but lots of what I'm reading says that's the wrong fix; that if the special characters were escaped properly in the XML then they would render without having to special-case the nodes.
Anybody dealt with this and have a good answer?
--
Drew
|
Post #355,113
3/7/12 2:58:55 PM
3/7/12 7:53:03 PM
|
Re: XML question on "disapble-output-escaping"
http://www.w3schools.../ref_entities.asp
Try < for < and > for > inside the XML?
Edit: I'm a dumbass.
-Mike
"They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety."
- Benjamin Franklin, 1759 Historical Review of Pennsylvania
Edited by mvitale
March 7, 2012, 07:53:03 PM EST
|
Post #355,116
3/7/12 5:43:22 PM
|
Uhh ... not getting what you're saying
--
Drew
|
Post #355,117
3/7/12 6:34:09 PM
|
He's having the inverse problem ;-)
I suspect the first entity of both sets started life as an & escaped sequence. Submitting the post converted the second, and now both are rendered in final form by the browser.
|
Post #355,118
3/7/12 7:53:15 PM
|
Check again. Sorry.
-Mike
"They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety."
- Benjamin Franklin, 1759 Historical Review of Pennsylvania
|
Post #355,123
3/7/12 9:38:49 PM
|
That's (kind of) what I'm thinking
If individual nodes need to be special-cased, they should have been encoded/escaped differently before passing to the XSL.
So when you're storing XML in a database (not just a string that happens to have tags, but column type XML) shouldn't the html entities be encoded? The source data I'm looking at has nodes in the DB like:
<nodename>The number 5 is > the number 4</nodename>
And after passing through the XSL, it renders as >
--
Drew
|
Post #355,124
3/7/12 9:52:04 PM
|
Not nearly that simple.
A lot of XML ingestors require entities to be declared in the DTD. We have that problem with one of our external feeds; we ingest it with a Java library that is quite fussy, but they generate the XML with simple string concatenation. :-/
If the code generating the XML is using the same DTD as the code transforming it, you may be in the lucky position of just having to declare the entiries and everything Will Just Work. Of course, the odds of it being that simple are not high.
Wade.
|
Post #355,125
3/7/12 10:13:59 PM
|
Yeah, not at all ... well, maybe it is
We're assembling data from multiple sources -- and by multiple, I mean thousands. But we're building the XML, so it's not like we have to worry about different encoding schemes from the sources.
But leaving aside the particulars of XSL transforms, shouldn't the entities be escaped before saving? Seeing XML with greater-than symbols inside node content just feels wrong.
--
Drew
|
Post #355,126
3/7/12 11:27:49 PM
|
Ah...
A greater-than sign is technically legal in XML. But it annoys and upsets most programmers. :-)
IMO, the place to escape problem characters in XML is where the data is turned into XML nodes. Not before. Not after.
Wade.
|
Post #355,127
3/7/12 11:57:48 PM
|
It's legal? Well crap, this argument just got a lot harder
I'm the one saying we should escape it before it goes into the database, which matches your "when it becomes a node". But that means they have to change the import process, which is deeply magical and finicky. (Don't get me started. [No really ... don't.])
--
Drew
|
Post #355,128
3/8/12 4:26:33 AM
|
"Into the database"?
It's stored as XML in a database? [raised eyebrow] Well, I have seen wierder things...
Wade.
|
Post #355,131
3/8/12 9:27:36 AM
|
Umm ... yeah
We have tons of XML in the database. And I think some people might be starting to see why that's maybe not such a good idea.
--
Drew
|
Post #355,133
3/8/12 9:56:35 AM
|
? Its been a few years since I went to xml dtd school
but storing strings in a database is kind of missing the point of dtd's and formatting
just my 2 cents
Any opinions expressed by me are mine alone, posted from my home computer, on my own time as a free American and do not reflect the opinions of any person or company that I have had professional relations with in the past 55 years. meep
|
Post #355,137
3/8/12 10:32:59 AM
3/8/12 10:33:16 AM
|
It's not a string
XML is a real datatype. You can do XPATH queries against it. (Which breaks your brain, by the way. And don't get me started on namespaces.) I've even seen databases that are entirely XML based.
--
Drew
Edited by drook
March 8, 2012, 10:33:16 AM EST
|
Post #355,140
3/8/12 12:07:02 PM
|
not my point
storing xml in a database is storing strings from a database point of view as opposed to using xml as designed
Any opinions expressed by me are mine alone, posted from my home computer, on my own time as a free American and do not reflect the opinions of any person or company that I have had professional relations with in the past 55 years. meep
|
Post #355,141
3/8/12 12:50:17 PM
|
Well ...
The way we use it, it's a way of stuffing multiple data structures into one field instead of having multiple child tables.
We have multiple products. Each product generates a "result", which is a collection of data. The data structure depends on the product. Rather than have a table for each product, we simply store every result as an XML value in the Result column of the Order table.
Besides saving us the incredible overhead of an extra join to the appropriate result table, this also allows us to change the data structure for a product without having to change the table. Simply write a new XML format into the Result column and you're done.
Don't point out that when you do that our XSLs to render reports has to be updated to recognize the two data structures and render each appropriately. Please also don't point out that since you can't create indices on nodes within the XML field you can't generate summaries of product outcomes. (eg: What percentage of product X yielded results in county Y?)
--
Drew
|
Post #355,142
3/8/12 3:41:14 PM
|
I guess that's better than binary blobs.
And I am *so* glad the previous developers at work never thought of that.
I'm still cleaning up from someone's fixation with TEXT fields. And then there's still a few FLOATs lurking around carrying currency. And Unix timestamps instead of DATETIME. Or in one case, instead of just DATE.
Wade.
|
Post #355,129
3/8/12 7:51:11 AM
|
Based on the first line and the rest of the thread
It sounds like you have multiple XSLs in sequence. The first one to touch the special characters turns them into properly escaped entities, but then a subsequent transform blindly escapes every ampersand it comes across.
If that is correct, then storing them as escaped sequences in the DB will not make a difference either.
In any case, I'd take a look at the transforms next.
|
Post #355,132
3/8/12 9:28:56 AM
|
That's a good point
I'm pretty sure everything is going through multiple levels of transforms before getting to the final "formatting" ones that output the HTML. Hmm ... more research to do.
--
Drew
|