5 replies [Last post]
jsabarese
jsabarese's picture
Offline
Enthusiast
Positively 4th Street
Last seen: 10 years 22 weeks ago
Positively 4th Street
Timezone: GMT-4
Joined: 2005-02-05
Posts: 404
Points: 7

In my ongoing quest to find the most comprehensive, "one-size fits-all" resource for NCR's and/ or HTML Entities, i've managed to gather somewhat of an obsessively diverse collection of resources on the subject.

(many of these have proved to lead to learning about other cool stuff along the way -- if you're spongelike in your browsing, you might get something out of it regardless of your previous knowledge of NCR's 'n their ilk)

From what i've found so far, I18N Guy's site seems to be the most complete list on a single page. still, something missing from one or the other (or in my blind spot)
Enjoy!

FYI: Perhaps the most thoroughly Educational of the resources i'll list today is FileFormat dot Info , which suggests, for example, that this forum will display only "Decimal" type Entity encodings, but not "Hexidecimal" or "Named", at least not for those that i provided as examples in the list above! Now, don't tell your momma i never gave you nuthin'!

[/]
Tags:
Chris..S
Chris..S's picture
Offline
Moderator
Last seen: 10 years 1 week ago
Timezone: GMT+1
Joined: 2005-02-22
Posts: 6078
Points: 173

Character Entity; Numeric Character Reference; entities

I have heard browser support can be problematic outside the basic set. Personally, I'd advocate moving to utf-8 if you need to deal a lot with special characters, glyphs, symbols, scripts, runes, etc, etc.

jsabarese
jsabarese's picture
Offline
Enthusiast
Positively 4th Street
Last seen: 10 years 22 weeks ago
Positively 4th Street
Timezone: GMT-4
Joined: 2005-02-05
Posts: 404
Points: 7

Character Entity; Numeric Character Reference; entities

Chris..S wrote:
I have heard browser support can be problematic outside the basic set. Personally, I'd advocate moving to utf-8
indeed... on both points!
crazy stuff for sure... not at all cinchy... as far as i'm concerned, but yes-- UTF-8 would seem the logical approach, however, beware if you're a PostgreSQL 8.0 user on Windows that there are some special considerations.

my guess is that "someday" UTF-8 will be the [only] way to go, and we won't have these issues (such as "why does this php / mysql forum read only the decimal encoding of some HTML entites [at least what i tested quite briefly], yet it doesn't the hex or named-- yet, W3C didn't validate my decimal encodings, in favor of Hex! ugh! \:D/

C.S: note clarification [only]

Chris..S
Chris..S's picture
Offline
Moderator
Last seen: 10 years 1 week ago
Timezone: GMT+1
Joined: 2005-02-22
Posts: 6078
Points: 173

Character Entity; Numeric Character Reference; entities

Some day?

utf-8 has been around since 1992. Its been supported by common webbrowsers for a long time. IE4 understands it, so that's 1997.

The surprising thing for me is that people who use languages other than english haven't been pushing harder for it - but I guess they would need to convert their documents.

jsabarese
jsabarese's picture
Offline
Enthusiast
Positively 4th Street
Last seen: 10 years 22 weeks ago
Positively 4th Street
Timezone: GMT-4
Joined: 2005-02-05
Posts: 404
Points: 7

Character Entity; Numeric Character Reference; entities

Chris..S wrote:
Some day?... I guess they would need to convert their documents.

looks like you answered your own question there, eh?

Chris..S
Chris..S's picture
Offline
Moderator
Last seen: 10 years 1 week ago
Timezone: GMT+1
Joined: 2005-02-22
Posts: 6078
Points: 173

Character Entity; Numeric Character Reference; entities

I guess - but they are the ones who will benefit. The only character I use with any regularity that is not common to iso-8859 and utf-8 is "£". And I would think you never have cause to use one in normal day-to-day stuff.