 As I record this video, I respectfully acknowledge that I'm standing on the unceded traditional territory of the Comox First Nation. And this video is about Unicode and some of the things that Jay does with Unicode that are interesting and Jay actually can display them. So let's start off with looking at an emoji. And this emoji, of course, as it turns out, is not Unicode. It's literal, and it's going to a UTF-8 encoding, which has four characters. So it's got that four character encoding to be literal. But at the same time, if I go to this one, it will look identical, it will look identical. But in this case, it's Unicode. But when I click on this, it shows me the Unicode encoding for straight Unicode 553735756832. So it's using those two numbers to provide a Unicode encoding, and those two numbers are called surrogates. Because in this version of Unicode, you can't represent every single Unicode code point. The way they do it is they split up a number or a section of it into surrogates. And if you use those two numbers together, that's how you get it to Unicode 4. So let's take a look at Unicode 4. And this one here looks the same on the text, but it's Unicode 4. And in Unicode 4, you don't need the surrogates. So your encode point is 128512. Now this is where Jay differs a little bit from strict Unicode. To strict Unicode, if you fed surrogate to Unicode 4, it would return an error. But in Jay, to be friendly, all these look the same, of course. But to Jay to be friendly, when you go to Unicode 4, you've got 5537, you're back to your surrogates. So it will actually display surrogates in Unicode 4 when, in fact, it probably shouldn't. Let's take a look. I think it was B is the other surrogate. Yeah. So you can see the difference in color, too. This is a much lighter yellow, which tells you immediately it's Unicode. And it's wider, so you know it's a surrogate. It's interesting that C, which is Unicode 4, and you see it's narrower because it's a single number, and D, which is Unicode 4, but it's a surrogate pair, we actually do C match D, even though they're both Unicode 4, they actually do not match because one's representing a surrogate and the other one is a straight code point. So that is kind of one of the things that can get kind of tricky with Unicode. So when you play with Unicode, there's a huge amount of power with it. You can come up with absolutely weird characters like this. As it turns out, that's an aboriginal character for Canadian First Nations, and I don't know exactly what it means, but it's the way that their language is represented. You can of course do all the different languages of the world, or many of the different languages of the world, and they're different characters. So there's 5574 as the code point for that one, and that wraps it up for Unicode. You can distinguish between Unicode and Unicode 4, and you can distinguish between surrogate pairs which are required for Unicode and surrogate pairs which are enabled for Unicode 4 but are not required. As I said, Unicode is its own special world, and if you decide to do a lot with Unicode, there's an awful lot to learn. But this is how Jig represents it.