Reinfocement Learning Framework for 2048

Description

The Puzzle Game of 2048 offers a deeply engaging and strategically complex challenge despite its minimalist design and simplistic rule set. Thanks to this addictive nature, it has reached over 23 million players since its release in 2014.

2048 has also experienced strong interest among researchers, both as a challenge to develop the best-performing algorithm and also as an excellent platform to benchmark Reinforcement Learning techniques.

The core objectives of this project were to examine the mathematical properties of 2048, to implement a highly optimised simulation framework, and to develop a well-performing algorithm by combining various Reinforcement Learning approaches with Computational Mathematics and experimentally perfected novel contributions.

Installation and Setup

Clone the repository to your local machine with:

git clone https://github.com/Cence2002/2048_CPP.git

Navigate to the root directory of the repository:

cd 2048_CPP

Create new build directory and navigate to it:

mkdir build && cd build

Run CMake to generate the build files:

cmake ..

Build the project:

make

Run the built executable to run the program:

./2048

Alternatively, run the other executable to perform validity tests:

./tests

To clean the build files, run:

make clean

Customisation

The testing and training configurations are intuitively customisable at the top of main.cpp where these setting are abstracted into constexpr variables to avoid sacrificing performance.

Results

Combining all value approximation enhancement strategies wit two novel additions - PNP and BFCS -- our method can achieve:

Average score over 600,000, over 95% of the current state-of-the-art (reaching an average of 625,000)
Using only 1-2 milliseconds per move, which is less, than 1-2% of the thinking time of the current state-of-the-art (using 0.4 seconds per move)

These results make our solution one of the most efficient algorithm for 2048, and arguably the most sophisticated decision-making strategy to date.

Motivation

This project was completed as a prerequisite for Part II of Computer Science Tripos (2024) at the University of Cambridge.

License

The project is protected by the standard MIT License, which is included in the repository and can be found below:

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
backup_after_training		backup_after_training
backups		backups
logs		logs
other_programs		other_programs
run		run
weights_backups		weights_backups
.gitattributes		.gitattributes
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
algorithm.h		algorithm.h
assets.h		assets.h
board_3x3.h		board_3x3.h
board_all.h		board_all.h
endgame_bruteforce.h		endgame_bruteforce.h
eval.h		eval.h
learn.h		learn.h
lines.h		lines.h
main.cpp		main.cpp
output.log		output.log
tests.cpp		tests.cpp
tuple.h		tuple.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinfocement Learning Framework for 2048

Description

Installation and Setup

Customisation

Results

Motivation

License

About

Uh oh!

Releases

Packages

Languages

License

Cence2002/2048-CPP

Folders and files

Latest commit

History

Repository files navigation

Reinfocement Learning Framework for 2048

Description

Installation and Setup

Customisation

Results

Motivation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages