LearnOpenGL

My learn OpenGL

Encoding

Glossary

Word	Meaning	Example
Codepoint	Numerical value maps to a specific character	\u0000
Codepage	Collection of codepoints	GBK; GB2312; Unicode;
Encoding	Codepoint format in bytes	UTF-8; UTF-16; ANSI;

Common Regulations

Following table shows the encoding of the strings depends on their physical location:

Physical Location	Encoding
Network	UTF-8
Disk	UTF-8
Memory(Windows)	UTF-16
Memory(Linux)	UTF-8

Note: Encoding here is NOT regulated by any standard! This regulation only suits in MY program!

Following table shows the encoding of the strings depends on their storage class:

Class	Encoding
std::string	UTF-8
std::wstring(Windows)	UTF-16
std::wstring(Linux)	UTF-32
std::u8string	UTF-8
std::u16string	UTF-16
std::u32string	UTF-32

Note: Encoding here is NOT regulated by any standard! This regulation only suits in MY program!

Note: These classes knows nothing about encodings, and so they CAN'T deal with Variable Length encodings.

Locale

Before using some locale dependent C APIs, we need to set correct locale to ensure correct codepage and encoding are used. The following code do this thing:

setlocale(LC_ALL, ".UTF-8");

Then the locale-dependent functions, such as mbstowcs; wcstombs, will return properly translated strings.

Note: Remember to set the encoding for the compiler, for example, use /utf8 for MSVC.

std::locale::global

The following code is Needless to call, as its default value is C, which means it will use the locale set by setlocale for C environment.

std::locale::global(std::locale(".UTF-8"));

Common Character Sets

ANSI

ANSI is an encoding method that regulates the conversion of codepoints to bytes and vice versa. To identify a specific character using its codepoints, it is necessary to know the codepage being used.

Language specific codepages:

Codepage	Language
GB2312	Simplified Chinese
GBK	Simplified Chinese
GB18030	Simplified Chinese
Big5	Traditional Chinese
S-JIS	Japanese

ANSI codepages can be different on different computers, or can be changed for a single computer, leading to data corruption. For the most consistent results, applications should use Unicode.

Unicode

While ANSI uses same encoding method across different codepages, Unicode has unique codepage but has different encodings.

Encodings:

Encoding	Length	Range	Codepoint Size(Byte)
UTF-8	Variable	Full Unicode Codepage	1 ~ 6
UCS-2	Fixed	\u0000 ~ \uFFFF	2
UTF-16	Variable	Full Unicode Codepage	2 / 4
UTF-32(UCS-4)	Fixed	Full Unicode Codepage	4

Sequence

Shader

sequenceDiagram
    participant F as File
    participant O as Object
    participant P as Progrom
    participant S as Shader

    %% Init------------------------------------
    Note over S: Shader Init
    S -->>+F: ResourceLocation

    par Async
        F ->>+ O: Compile/Use Cache
        O -->> F: Cache
        deactivate F
    end

    O ->> P: Link
    deactivate O

    P ->>+ S: ProgramId

    Note over S: Use Shader

    %% Reloaded--------------------------------
    par EventHandler
        Note over F: Evt: ResourceReloaded
        activate F

        F ->>+ O: Compile
        O -->> F: Cache
        deactivate F

        O ->> P: Link
        deactivate O
    end

    P ->> S: ProgramId
    S -->> S: DeleteOld
    Note over S: Evt: ShaderReloaded

Resource

sequenceDiagram
    participant R as ResourceLocation
    participant M as ResourceManager
    participant E as EmbeddedProvider
    participant F as FileProvider

    R ->> M: ResourceKey
    M ->> M: FindProviderByDomain

    M -->> E: ResourceKey
    E ->> E: Offset, Size
    E ->> M: FileStream

    M -->> F: ResourceKey
    F ->> M: FileStream

    M ->> R: FileStream

Reference

https://learnopengl-cn.github.io/intro/

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.vscode		.vscode
builder		builder
ctest		ctest
embed/shader		embed/shader
glad		glad
glfw @ 1ad5df8		glfw @ 1ad5df8
include		include
src		src
.clang-format		.clang-format
.cmake-format		.cmake-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LearnOpenGL

Encoding

Glossary

Common Regulations

Locale

std::locale::global

Common Character Sets

ANSI

Unicode

Sequence

Shader

Resource

Reference

About

Releases

Packages

Languages

Windmill-City/LearnOpenGL

Folders and files

Latest commit

History

Repository files navigation

LearnOpenGL

Encoding

Glossary

Common Regulations

Locale

std::locale::global

Common Character Sets

ANSI

Unicode

Sequence

Shader

Resource

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages