Hello world.

elphinkuo · Jul 3, 2018 · 8597a0f · 8597a0f
1 parent 4ae90de
commit 8597a0f
Show file tree

Hide file tree

Showing 40 changed files with 8,628 additions and 2 deletions.
diff --git a/README.md b/README.md
@@ -1,2 +1,111 @@
-# neural-symbolic-machines
-Neural Symbolic Machines is a framework to integrate neural networks and symbolic representations using reinforcement learning, with applications in program synthesis and semantic parsing.
+# Introduction
+
+## Neural Symbolic Machines
+<img src="https://github.com/crazydonkey200/neural-symbolic-machines/blob/master/images/nsm.png" width="100%">
+
+Neural Symbolic Machines (NSM) is a framework to integrate neural networks and symbolic representaions using reinforcement learning. 
+
+## Applications
+<img src="https://github.com/crazydonkey200/neural-symbolic-machines/blob/master/images/ap.png" width="100%">
+
+The framework can be used to learn semantic parsing and program synthesis from weak supervision (e.g., question-answer pairs), which is easier to collect and more flexible than full supervision (e.g., question-program pairs). Applications include virtual assistant, natural language interface to database, human-robot interaction, etc. It has been used to <a href="https://arxiv.org/abs/1611.00020">learn semantic parsers on Freebase<a> and <a href="MAPO">natural language interfaces to database tables<a>. 
+
+## Memory Augmented Policy Optimization
+<img src="https://github.com/crazydonkey200/neural-symbolic-machines/blob/master/images/mapo.png" width="100%">
+
+We use <a href="MAPO link">Memory Augmented Policy Optimization (MAPO)</a> to train NSM. It is a novel policy optimization formulation that incorporates a memory buffer of promising trajectories to reduce the variance of policy gradient estimates for deterministic environments with discrete actions. We also apply systematic exploration to improve exploration and marginal likelihood constraint to accelerate and stabilize training. 
+
+## Distributed Actor-Learner Architecture
+<img src="https://github.com/crazydonkey200/neural-symbolic-machines/blob/master/images/al.png" width="100%">
+
+Our implementation uses a distributed actor-learner architecture that utilizes multiple CPUs and GPUs for scalable training, similar to the one introduced in <a href="https://arxiv.org/abs/1802.01561">the IMPALA paper from DeepMind</a>. 
+
+# Dependencies
+
+- Python 2.7
+- TensorFlow>=1.7
+- Other required packages are summarized in `requirements.txt`.
+
+# Quick start
+
+## Setup AWS instance
+Start a g3.8xlarge instance with “Deep Learning AMI (Ubuntu) Version 10.0” image. (The experiments are conducted using this type of instance and image, you will need to adjust the configurations in scripts to run on other instances.)
+
+Open port (for example, 6000-6010) in the security group for tensorboard. Instructions:
+https://docs.aws.amazon.com/dlami/latest/devguide/tutorial-tensorboard.html
+
+ssh into the instance.
+
+## Download the data and install the dependencies 
+```
+mkdir ~/projects
+cd ~/projects/
+git clone https://github.com/crazydonkey200/neural-symbolic-machines.git 
+
+cd ~/projects/neural-symbolic-machines/
+./aws_setup.sh
+```
+
+## Running experiments and monitor with tensorboard
+
+### Start WikiTable experiment
+```
+screen -S wtq
+source activate tensorflow_p27
+cd ~/projects/neural-symbolic-machines/table/wtq/
+./run.sh mapo mapo
+```
+The experiment trains the model for 30k steps. It takes about 3hrs to finish.
+
+
+### Start tensorboard to monitor WikiTable experiment
+```
+screen -S tb
+source activate tensorflow_p27
+cd  ~/projects/data/wikitable/
+tensorboard --logdir=output
+```
+To see the tensorboard, in the browser, go to 
+[your AWS public DNS]:6006
+`avg_return_1` is the main metric (accuracy). 
+
+
+### Start WikiSQL experiment.  
+```
+screen -S ws
+source activate tensorflow_p27
+cd ~/projects/neural-symbolic-machines/table/wikisql/
+./run.sh mapo mapo
+```
+The experiment trains the model for 15k steps. It takes about 8hrs to finish.
+
+### Start tensorboard to monitor WikiSQL experiment
+```
+screen -S tb
+source activate tensorflow_p27
+cd  ~/projects/data/wikisql/
+tensorboard --logdir=output
+```
+To see the tensorboard, in the browser, go to 
+[your AWS public DNS]:6006
+`avg_return_1` is the main metric (accuracy). 
+
+## Example outputs
+
+Example learning curves for WikiTable (left) and WikiSQL (right) experiments (0.9 smoothing):
+<img src="https://github.com/crazydonkey200/neural-symbolic-machines/blob/master/images/wikitable_curve.png" width="50%"><img src="https://github.com/crazydonkey200/neural-symbolic-machines/blob/master/images/wikisql_curve.png" width="50%">
+
+
+## Citation
+If you use the code in your research, please cite:
+
+    @inproceedings{liang2017neural,
+      title={Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision},
+      author={Liang, Chen and Berant, Jonathan and Le, Quoc and Forbus, Kenneth D and Lao, Ni},
+      booktitle={Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
+      volume={1},
+      pages={23--33},
+      year={2017}
+    }
+
+[TODO] Need to add link to MAPO paper. 
diff --git a/aws_setup.sh b/aws_setup.sh
@@ -0,0 +1,10 @@
+source activate tensorflow_p27
+pip install -r requirements.txt
+mkdir ~/projects/data/
+cd ~/projects/data/
+cp ~/projects/neural-symbolic-machines/table/wtq/wikitable.zip ~/projects/data/wikitable.zip
+unzip wikitable.zip
+cp ~/projects/neural-symbolic-machines/table/wikisql/wikisql.zip ~/projects/data/wikisql.zip
+unzip wikisql.zip
+cd ~/projects/neural-symbolic-machines/
+python setup.py develop
diff --git a/images/al.png b/images/al.png
diff --git a/images/ap.png b/images/ap.png
diff --git a/images/mapo.png b/images/mapo.png
diff --git a/images/nsm.png b/images/nsm.png
diff --git a/images/wikisql_curve.png b/images/wikisql_curve.png
diff --git a/images/wikitable_curve.png b/images/wikitable_curve.png
diff --git a/nsm/__init__.py b/nsm/__init__.py