Blog

> Unicode, Character Sets, and Data Types

April 24, 2020

Overview

Project Information
Unicode, Character Sets, and Data Types Research

Progress

  • Tried to better understand character sets and data types
  • Found random useful sites

Useful Links

Reflection

Took another random deep dive into understanding character encoding and binary data. I've been mocked a few times already this year for not understanding this stuff, so now's good motivation. I need to figure out how to do these things to decode a binary file for my research.

Here is a very sassy and savage article on character encoding:
The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)

Some research notes:

Bytes, ASCII, Unicode, Big/Little Endian

UTF-8

Java Datatypes

Format Characters

Reading files in Python

Handling binary data

Ascii Table