You’ve got strange characters like “�” or “ö” display in your application? Yes, handling non-English characters in application code, files and databases can be a challenge, to say the least. Whether that’s German Umlauts, Cyrillic letters, Asian Glyphs or Emojis: It’s always a mess in an international application. In this session you will see why that is and how handling characters evolved in computing. You will also see how handling characters in applications and databases can be done less painfully. And don’t worry when EBCDIC, BOM or ISO-8859-7 are Greek to you and your Unicode is a bit rusty: we’ll have a look at them too!

Comments

Comments are closed.

Excellent talk: both informative and entertaining! Learned several things I had no idea about regarding how unicode is stored and interpreted, especially regarding MySQL handling.
Cheers, Andreas! ?

Sean Wallis at 13:57 on 10 Jun 2017

A really engaging talk. Very informative with excellent slides full of useful information. The content and advice given will really help our team. Thanks Andreas

A very good and insightful talk that gave a nice overview with appropriate deeper dives into encoding and character sets in a good presentation style.

Wim Godden at 15:31 on 10 Jun 2017

Really good talk ! Insightful !

Liam Wiltshire at 15:50 on 10 Jun 2017

A really great talk - well paced, informative and with good humour.

Mark Railton at 16:31 on 10 Jun 2017

Fantastic talk that gives an overview of character encoding and how it affects the applications that we write. Also gave some great tips on how to best implement and make sure it works for our systems.

Tim Stamp at 20:54 on 10 Jun 2017

???????????
.
.
L̨̛̰̩̭̬̺͖̔̎̈́́̔͌o̭͉̰͓̤͉̣̗̓͊̂̄̈̑͢v̶̛̯͎͕̱̘̪̤͗̅͌͟͟͡ē̶̛̲̭̖̙̮̜̞͓͓̩̄́͘ḑ̸̳̬̭̦̭̲̪̾́̾́̂̅̇̚͜ t̷̠̹̘͎͊̽̃͒̏̇͜͠h̡̝̦̩̜̣͎͍͖̿̆̌̑͋ȇ̻̰̖̠͉̖̺̈́͆̃͋̎̀͞͝ t̻̹̮̯̱̥̞̲̂́̉̃̈̀̋̐͘͞a̵̢̭̯͚͉̥͕͈͗̊̓̄͡͡ĺ̴̨̡̛̙̪͙̹͚̅̈̚̚͝͝ͅk̟̬͉̱͈̯͙͍̘̒̌͒̅̈́̀̌̔͘̕,̯̞̻̭̜͔̇̊̒̚͠ g̶̨̠̜̟̗̖̀̉̓͊̋o̴͕̩̬̰̱͒̂͑̇̑̃͌o̡̤̳̗̒͗̇̀͗̃ͅd͖̻̺͓̠͈̙̫͙̉͌̑͊̅̐́ j̶̭̝̲̼̽̎̃͆͌̏̓̈́́͜͟ơ̷͉̻̜̖͖̫͑̔͋͡͞b̷͇̲̰͓̘͎̰͆͌̍̐́̑̕͟͞!̵̣̪̭̙̩͉̇̏̃̋̀͟
.
.
?
??????

Luna Chen at 22:31 on 10 Jun 2017

Amazing speech Andreas! Learn a lot! Very useful tips on handling special chars in database! Thank you so much!

Chris Emerson at 22:40 on 10 Jun 2017

This was really interesting - a complex topic made straight-forward with great explanations, some tips for supporting unicode properly in applications (and MySQL!) and more. I look forward to putting it into practice!

Peter Fisher at 23:22 on 10 Jun 2017

Great talk. The really handy stuff for me was the MYSQL stuff.
Very enjoyable

Very informative and entertaining talk on what turns out to be a massive subject.

It's always great when speakers not only show you why you are doing things incorrectly but also how to fix things.

Lee Boynton at 19:23 on 11 Jun 2017

A very informative talk on a subject I imagine a lot of people simply gloss over or avoid. It was easy enough to follow for most parts.

It was good to find out exactly how confusing MySQL is when it comes to unicode, I hadn't realised how many places that the encoding needs to be specified and for why they need to be set.

Thanks Andreas!

Gary Jones at 00:29 on 12 Jun 2017

Really enjoyable talk. Very educational, clearing up misconceptions and teaching me things I've always wondered about!

Naomi Gotts at 19:45 on 12 Jun 2017

Really, really good talk. Well explained and presented - my understanding of character encoding is sooo much better!

Mark Dain at 20:56 on 12 Jun 2017

Brilliant talk, thank you!

My only suggestion would be perhaps mentioning some of the problems counting glyphs, e.g. mb_strlen("??") will result in 2 (at-least as of PHP 7). In Swift 4, however, this should result in 1 as I think it uses Unicode 9 rules. What's a good tactic for accurately counting characters when emojis are involved?

Great talk. Entertaining and useful.

Neil Nand at 23:02 on 15 Jun 2017

Great talk, for me it was the best one of the conference. Learnt a lot, and a fair amount of it is applicable to what I do & will be doing.

Mike Simonson at 09:51 on 20 Jun 2017

very good talk