Answered Unicode In Database Not Displaying Properly on View?

Mitchelln11 · Jul 31, 2020

I am using the National Park Service REST API to pull park info which I am saving to a SQL database. It saves with the unicode value. One of the parks in Hawai'i is Haleakalā National Park, which is fine, only whenever I want it to display, it shows exactly that unicode and not the actual symbol. ( ā ) It is supposed to be an a with a slash over the top. I believe it's Polynesian. My site is already set to UTF-8, but it doesn't work from the database. If I copy and paste the @#257; anywhere else on a view, it shows up properly.

Any thoughts? Thanks.

Skydiver · Jul 31, 2020

I've never heard of a database that uses HTML/XML entities to Unicode characters. That is a first for me.

Are you sure that the data that went into the database was not HTML or XML encoded?

Attach a debugger, look at the string that comes back from the database query. If the value was truly a 16-bit Unicode code point, then Visual Studio inspectors (local and autos) will display the string with the a. Now if the data stored in the database had that HTML encoded value ("ā"), then the inspectors would show those entity encoding digits.

Mitchelln11 · Jul 31, 2020

Skydiver said:
I've never heard of a database that uses HTML/XML entities to Unicode characters. That is a first for me.

Are you sure that the data that went into the database was not HTML or XML encoded?

Attach a debugger, look at the string that comes back from the database query. If the value was truly a 16-bit Unicode code point, then Visual Studio inspectors (local and autos) will display the string with the a. Now if the data stored in the database had that HTML encoded value ("ā"), then the inspectors would show those entity encoding digits.

Looking back at Postman, and even that says

C#:

"name": "Haleakal&#257;"

Skydiver · Jul 31, 2020

If that's the case, then you need to convert the HTML entity into an actual Unicode character before putting it into your database.

HttpUtility.HtmlDecode Method (System.Web)

Converts a string that has been HTML-encoded for HTTP transmission into a decoded string. To encode or decode values outside of a web application, use the WebUtility class.

docs.microsoft.com

Skydiver · Jul 31, 2020

That's really sad that the API does that though. I've only started skimming the API documentation, but it looks like the API returns JSON. JSON supports Unicode strings. I don't know why they HTML encoded that character since there is no need to.

JohnH · Aug 1, 2020

There's also WebUtility Class (System.Net) that can do this.

Mitchelln11 · Aug 2, 2020

Skydiver said:
That's really sad that the API does that though. I've only started skimming the API documentation, but it looks like the API returns JSON. JSON supports Unicode strings. I don't know why they HTML encoded that character since there is no need to.

Could that possibly be a typo? In other parts of the API, the symbols are directly in there. It's just the name that's like that.

Also, yes, decoding did the job, so thank you for this!

C#:

park.ParkName = HttpUtility.HtmlDecode(individualPark.FullName);

JohnH · Aug 2, 2020

Thread split, start new threads for new topics.

why .Update duplicating a record?

Do you know why .Update would be duplicating a record? I am running the REST call to save info to my Parks database, but before I save it, I am checking the park code to see if it's already in the database. Is this even the right code? It does hit the else statement, content just doesn't get...

csharpforums.net

Skydiver · Aug 2, 2020

More likely a data conversion error combined with a proofreading error. The NPS paid someone to do the data conversion from their original data format, and the person reviewing the data/person in charge of quality assurance just missed it.

My company had a similar issue when it contracted out converting all employee's resumes from the old data format that HR kept stuff in (it was essentially flat US CodePage 437) to go into a more modern Oracle database. Unfortunately, HR again short-sightedly picked a specific code page, instead of going Unicode. Anyway during the conversion process, various people's names, company names, bullet symbols, etc. got mangled. Unfortunately, they were blaming us, the web team for not correctly displaying the data because they kept asserting that they followed the modern Unicode standard. I had to prove to them that their data was not Unicode.

Answered Unicode In Database Not Displaying Properly on View?

Mitchelln11

Active member

Skydiver

Mitchelln11

Active member

Skydiver

HttpUtility.HtmlDecode Method (System.Web)

Skydiver

JohnH

C# Forum Moderator

Mitchelln11

Active member

JohnH

C# Forum Moderator

why .Update duplicating a record?

Skydiver

Similar threads

Share this page

Latest posts