Artificial Intelligence Techniques for Playing Concurrent Games of 2048

Stanford CS 221: Artificial Intelligence -- Final Project (Autumn 2017-18)

Alex Wang, Robin Cheong, Vince Ranganathan

Updated 12/21/18

Poster available at: http://web.stanford.edu/class/cs221/2018/restricted/posters/robinc20/poster.pdf

Paper available at: https://drive.google.com/file/d/1Nlr24oJz7EglIGuhd2YSlQPAiO_6jK8Y/view?usp=sharing

The aim of this project is to develop an algorithm to play n concurrent games of 2048, where a single swipe is applied across all n boards. We are interested in understanding how the strategy an agent learns in playing one game needs to change in order to play several of these games concurrently, since oftentimes the strategy for solving one instance of a problem does not generalize well to solving multiple instances of the problem.

We model the problem as a Markov decision process, and implement our modified version of Expectiminimax (described in detail in the paper above) to make decisions that balance the efficiency in quality and time.

To run Expectimax on multiple boards, cd to the Expectimax folder and run game.py with flags:

-d = Depth for Expectimax, default=2, type=int

-b = Number of boards to play on, default=2, type=int

-g = Number of full games to play, default=1, type=int

-m = Which weighting strategy to use, default='simple', choices=('direness', 'simple', 'weighted', 'max'))

-f = Use fill (1) or sampling (0) for Expectimax, default=1, type=int)

-n = File name to use to store the data

All data used for graphs and tables are stored in Expectimax/data, and is stored in python3 picklized format. data_visualizer.py can be (internally) modified to output the tables or graphs shown in the final report

Run puzzle.py to see our code run on a GUI with 4 boards!

gameutil.py stores all of our board manipulation logic player.py stores all of the logic for the actual Expectimax algorithm

Multi_Game_2048 stores all the files we used for playing multiple games of 2048 in an RL setting in OpenAI Gym's Format

DQ_learning.py stores the code for Deep Q Learning -- run by calling the script in command line using python3

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
Expectimax		Expectimax
Graphs		Graphs
Multi_Game_2048		Multi_Game_2048
Old Files		Old Files
__pycache__		__pycache__
model		model
DQ_learning.py		DQ_learning.py
Experience_Buffer.py		Experience_Buffer.py
README.md		README.md
graph.png		graph.png
nn.PNG		nn.PNG
player.py		player.py
puzzle.py		puzzle.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Artificial Intelligence Techniques for Playing Concurrent Games of 2048

About

Releases

Packages

Contributors 3

Languages

robinreversi/cs221-2048RL

Folders and files

Latest commit

History

Repository files navigation

Artificial Intelligence Techniques for Playing Concurrent Games of 2048

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages