jingweiz
diff --git a/‎.gitignore
Lines changed: 4 additions & 0 deletions b/‎.gitignore
Lines changed: 4 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 199 additions & 0 deletions b/‎README.md
Lines changed: 199 additions & 0 deletions
diff --git a/‎assets/dnc_repeat_copy_test.png
15.3 KB b/‎assets/dnc_repeat_copy_test.png
15.3 KB
diff --git a/‎assets/dnc_repeat_copy_train.png
20.7 KB b/‎assets/dnc_repeat_copy_train.png
20.7 KB
diff --git a/‎core/__init__.py b/‎core/__init__.py
diff --git a/‎core/accessor.py
Lines changed: 63 additions & 0 deletions b/‎core/accessor.py
Lines changed: 63 additions & 0 deletions
diff --git a/‎core/accessors/__init__.py b/‎core/accessors/__init__.py
diff --git a/‎core/accessors/dynamic_accessor.py
Lines changed: 69 additions & 0 deletions b/‎core/accessors/dynamic_accessor.py
Lines changed: 69 additions & 0 deletions
diff --git a/‎core/accessors/static_accessor.py
Lines changed: 51 additions & 0 deletions b/‎core/accessors/static_accessor.py
Lines changed: 51 additions & 0 deletions
@@ -0,0 +1,4 @@
+*.pyc
+./dnc/*.pyc
+./dnc/agents/*.pyc
+./utils/*.pyc
@@ -0,0 +1,199 @@
+# **Neural Turing Machine** (NTM) &
+# **Differentiable Neural Computer** (DNC) with
+# **pytorch** & **visdom**
+*******
+
+
+* Sample on-line plotting while training(avg loss)/testing(write/read weights & memory) DNC on the repeat-copy task:
+<img src="/assets/dnc_repeat_copy_train.png" width="205"/> <img src="/assets/dnc_repeat_copy_test.png" width="600"/>
+
+
+* Sample loggings while training DNC on the repeat-copy task (we use ```WARNING``` as the logging level currently to get rid of the ```INFO``` printouts from visdom):
+```bash
+[WARNING ] (MainProcess) <===================================>
+[WARNING ] (MainProcess) bash$: python -m visdom.server
+[WARNING ] (MainProcess) http://localhost:8097/env/daim_17051000
+[WARNING ] (MainProcess) <===================================> Agent:
+[WARNING ] (MainProcess) <-----------------------------======> Env:
+[WARNING ] (MainProcess) Creating {repeat-copy | } w/ Seed: 123
+[WARNING ] (MainProcess) Word     {length}:   {4}
+[WARNING ] (MainProcess) Words #  {min, max}: {1, 2}
+[WARNING ] (MainProcess) Repeats  {min, max}: {1, 2}
+[WARNING ] (MainProcess) <-----------------------------======> Circuit:    {Controller, Accessor}
+[WARNING ] (MainProcess) <--------------------------------===> Controller:
+[WARNING ] (MainProcess) LSTMController (
+  (in_2_hid): LSTMCell(70, 64, bias=1)
+)
+[WARNING ] (MainProcess) <--------------------------------===> Accessor:   {WriteHead, ReadHead, Memory}
+[WARNING ] (MainProcess) <-----------------------------------> WriteHeads: {1 heads}
+[WARNING ] (MainProcess) DynamicWriteHead (
+  (hid_2_key): Linear (64 -> 16)
+  (hid_2_beta): Linear (64 -> 1)
+  (hid_2_alloc_gate): Linear (64 -> 1)
+  (hid_2_write_gate): Linear (64 -> 1)
+  (hid_2_erase): Linear (64 -> 16)
+  (hid_2_add): Linear (64 -> 16)
+)
+[WARNING ] (MainProcess) <-----------------------------------> ReadHeads:  {4 heads}
+[WARNING ] (MainProcess) DynamicReadHead (
+  (hid_2_key): Linear (64 -> 64)
+  (hid_2_beta): Linear (64 -> 4)
+  (hid_2_free_gate): Linear (64 -> 4)
+  (hid_2_read_mode): Linear (64 -> 12)
+)
+[WARNING ] (MainProcess) <-----------------------------------> Memory:     {16(batch_size) x 16(mem_hei) x 16(mem_wid)}
+[WARNING ] (MainProcess) <-----------------------------======> Circuit:    {Overall Architecture}
+[WARNING ] (MainProcess) DNCCircuit (
+  (controller): LSTMController (
+    (in_2_hid): LSTMCell(70, 64, bias=1)
+  )
+  (accessor): DynamicAccessor (
+    (write_heads): DynamicWriteHead (
+      (hid_2_key): Linear (64 -> 16)
+      (hid_2_beta): Linear (64 -> 1)
+      (hid_2_alloc_gate): Linear (64 -> 1)
+      (hid_2_write_gate): Linear (64 -> 1)
+      (hid_2_erase): Linear (64 -> 16)
+      (hid_2_add): Linear (64 -> 16)
+    )
+    (read_heads): DynamicReadHead (
+      (hid_2_key): Linear (64 -> 64)
+      (hid_2_beta): Linear (64 -> 4)
+      (hid_2_free_gate): Linear (64 -> 4)
+      (hid_2_read_mode): Linear (64 -> 12)
+    )
+  )
+  (hid_to_out): Linear (128 -> 5)
+)
+[WARNING ] (MainProcess) No Pretrained Model. Will Train From Scratch.
+[WARNING ] (MainProcess) <===================================> Training ...
+[WARNING ] (MainProcess) Reporting       @ Step: 500 | Elapsed Time: 30.609361887
+[WARNING ] (MainProcess) Training Stats:   avg_loss:         0.014866309287
+[WARNING ] (MainProcess) Evaluating      @ Step: 500
+[WARNING ] (MainProcess) Evaluation        Took: 1.6457400322
+[WARNING ] (MainProcess) Iteration: 500; loss_avg: 0.0140423600748
+[WARNING ] (MainProcess) Saving Model    @ Step: 500: /home/zhang/ws/17_ws/pytorch-dnc/models/daim_17051000.pth ...
+[WARNING ] (MainProcess) Saved  Model    @ Step: 500: /home/zhang/ws/17_ws/pytorch-dnc/models/daim_17051000.pth.
+[WARNING ] (MainProcess) Resume Training @ Step: 500
+...
+```
+*******
+
+
+## What is included?
+This repo currently contains the following algorithms:
+
+- Neural Turing Machines (NTM) [[1]](https://arxiv.org/abs/1410.5401)
+- Differentiable Neural Computers (DNC) [[2]](http://www.nature.com/nature/journal/v538/n7626/full/nature20101.html)
+
+Tasks:
+- copy
+- repeat-copy
+
+## Code structure & Naming conventions
+NOTE: we follow the exact code structure as [pytorch-rl](https://github.com/jingweiz/pytorch-rl) so as to make the code easily transplantable.
+* ```./utils/factory.py```
+> We suggest the users refer to ```./utils/factory.py```,
+ where we list all the integrated ```Env```, ```Circuit```, ```Agent``` into ```Dict```'s.
+ All of the core classes are implemented in ```./core/```.
+ The factory pattern in ```./utils/factory.py``` makes the code super clean,
+ as no matter what type of ```Circuit``` you want to train,
+ or which type of ```Env``` you want to train on,
+ all you need to do is to simply modify some parameters in ```./utils/options.py```,
+ then the ```./main.py``` will do it all (NOTE: this ```./main.py``` file never needs to be modified).
+* namings
+> To make the code more clean and readable, we name the variables using the following pattern:
+> * ```*_vb```: ```torch.autograd.Variable```'s or a list of such objects
+> * ```*_ts```: ```torch.Tensor```'s or a list of such objects
+> * otherwise: normal python datatypes
+
+
+## Dependencies
+- Python 2.7
+- [PyTorch](http://pytorch.org/)
+- [Visdom](https://github.com/facebookresearch/visdom)
+*******
+
+
+## How to run:
+You only need to modify some parameters in ```./utils/options.py``` to train a new configuration.
+
+* Configure your training in ```./utils/options.py```:
+> * ```line 12```: add an entry into ```CONFIGS``` to define your training (```agent_type```, ```env_type```, ```game```, ```circuit_type```)
+> * ```line 28```: choose the entry you just added
+> * ```line 24-25```: fill in your machine/cluster ID (```MACHINE```) and timestamp (```TIMESTAMP```) to define your training signature (```MACHINE_TIMESTAMP```),
+ the corresponding model file and the log file of this training will be saved under this signature (```./models/MACHINE_TIMESTAMP.pth``` & ```./logs/MACHINE_TIMESTAMP.log``` respectively).
+ Also the visdom visualization will be displayed under this signature (first activate the visdom server by type in bash: ```python -m visdom.server &```, then open this address in your browser: ```http://localhost:8097/env/MACHINE_TIMESTAMP```)
+> * ```line 28```: to train a model, set ```mode=1``` (training visualization will be under ```http://localhost:8097/env/MACHINE_TIMESTAMP```); to test the model of this current training, all you need to do is to set ```mode=2``` (testing visualization will be under ```http://localhost:8097/env/MACHINE_TIMESTAMP_test```).
+
+* Run:
+> ```python main.py```
+*******
+
+
+## Implementation Notes:
+The difference between ```NTM``` & ```DNC``` is stated as follows in the
+```DNC```[2] paper:
+> Comparison with the neural Turing machine. The neural Turing machine (NTM) was
+the predecessor to the DNC described in this work. It used a similar
+architecture of neural network controller with read–write access to a memory
+matrix, but differed in the access mechanism used to interface with the memory.
+In the NTM, content-based addressing was combined with location-based addressing
+to allow the network to iterate through memory locations in order of their
+indices (for example, location n followed by n+1 and so on). This allowed the
+network to store and retrieve temporal sequences in contiguous blocks of memory.
+However, there were several drawbacks. First, the NTM has no mechanism to ensure
+that blocks of allocated memory do not overlap and interfere—a basic problem of
+computer memory management. Interference is not an issue for the dynamic memory
+allocation used by DNCs, which provides single free locations at a time,
+irrespective of index, and therefore does not require contiguous blocks. Second,
+the NTM has no way of freeing locations that have already been written to and,
+hence, no way of reusing memory when processing long sequences. This problem is
+addressed in DNCs by the free gates used for de-allocation. Third, sequential
+information is preserved only as long as the NTM continues to iterate through
+consecutive locations; as soon as the write head jumps to a different part of
+the memory (using content-based addressing) the order of writes before and after
+the jump cannot be recovered by the read head. The temporal link matrix used by
+DNCs does not suffer from this problem because it tracks the order in which
+writes were made.
+
+We thus make some effort to put those two together in a combined codebase.
+The classes implemented have the following hierarchy:
+* Agent
+  * Env
+  * Circuit
+    * Controller
+    * Accessor
+      * WriteHead
+      * ReadHead
+      * Memory
+
+The part where ```NTM``` & ```DNC``` differs is the ```Accessor```, where in the
+code ```NTM``` uses the ```StaticAccessor```(may not be an appropriate name but
+we use this to make the code more consistent) and ```DNC``` uses the
+```DynamicAccessor```. Both ```Accessor``` classes use ```_content_focus()```
+and ```_location_focus()```(may not be an appropriate name for ```DNC``` but we
+use this to make the code more consistent). The ```_content_focus()``` is the
+same for both classes, but the ```_location_focus()``` for ```DNC``` is much
+more complicated as it uses ```dynamic allocation``` additionally for write and
+```temporal link``` additionally for read. Those focus (or attention) mechanisms
+are implemented in ```Head``` classes, and those focuses output a ```weight```
+vector for each ```head``` (write/read). Those ```weight``` vectors are then used in
+```_access()``` to interact with the ```external memory```.
+
+## A side note:
+The sturcture for ```Env``` might look strange as this class was originally
+designed for ```reinforcement learning``` settings as in
+[pytorch-rl](https://github.com/jingweiz/pytorch-rl); here we use it for
+providing datasets for ```supervised learning```, so the ```reward```,
+```action``` and ```terminal``` are always left blank in this repo.
+*******
+
+
+## Repos we referred to during the development of this repo:
+* [deepmind/dnc](https://github.com/deepmind/dnc)
+* [ypxie/pytorch-NeuCom](https://github.com/ypxie/pytorch-NeuCom)
+* [bzcheeseman/pytorch-EMM](https://github.com/bzcheeseman/pytorch-EMM)
+* [DoctorTeeth/diffmem](https://github.com/DoctorTeeth/diffmem)
+* [kaishengtai/torch-ntm](https://github.com/kaishengtai/torch-ntm)
+* [Mostafa-Samir/DNC-tensorflow](https://github.com/Mostafa-Samir/DNC-tensorflow)
@@ -0,0 +1,63 @@
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+import torch.nn as nn
+
+class Accessor(nn.Module):
+    def __init__(self, args):
+        super(Accessor, self).__init__()
+        # logging
+        self.logger = args.logger
+        # params
+        self.use_cuda = args.use_cuda
+        self.dtype = args.dtype
+
+        # params
+        self.batch_size = args.batch_size
+        self.hidden_dim = args.hidden_dim
+        self.num_write_heads = args.num_write_heads
+        self.num_read_heads = args.num_read_heads
+        self.mem_hei = args.mem_hei
+        self.mem_wid = args.mem_wid
+        self.clip_value = args.clip_value
+
+        # functional components
+        self.write_head_params = args.write_head_params
+        self.read_head_params = args.read_head_params
+        self.memory_params = args.memory_params
+
+        # fill in the missing values
+        # write_heads
+        self.write_head_params.num_heads = self.num_write_heads
+        self.write_head_params.batch_size = self.batch_size
+        self.write_head_params.hidden_dim = self.hidden_dim
+        self.write_head_params.mem_hei = self.mem_hei
+        self.write_head_params.mem_wid = self.mem_wid
+        # read_heads
+        self.read_head_params.num_heads = self.num_read_heads
+        self.read_head_params.batch_size = self.batch_size
+        self.read_head_params.hidden_dim = self.hidden_dim
+        self.read_head_params.mem_hei = self.mem_hei
+        self.read_head_params.mem_wid = self.mem_wid
+        # memory
+        self.memory_params.batch_size = self.batch_size
+        self.memory_params.clip_value = self.clip_value
+        self.memory_params.mem_hei = self.mem_hei
+        self.memory_params.mem_wid = self.mem_wid
+
+    def _init_weights(self):
+        raise NotImplementedError("not implemented in base calss")
+
+    def _reset_states(self):
+        raise NotImplementedError("not implemented in base calss")
+
+    def _reset(self):           # NOTE: should be called at each child's __init__
+        raise NotImplementedError("not implemented in base calss")
+
+    def visual(self):
+        self.write_heads.visual()
+        self.read_heads.visual()
+        self.memory.visual()
+
+    def forward(self, lstm_hidden_vb):
+        raise NotImplementedError("not implemented in base calss")
@@ -0,0 +1,69 @@
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+import torch
+import torch.nn as nn
+from torch.autograd import Variable
+
+from core.accessor import Accessor
+from core.heads.dynamic_write_head import DynamicWriteHead as WriteHead
+from core.heads.dynamic_read_head import DynamicReadHead as ReadHead
+from core.memory import External2DMemory as ExternalMemory
+
+class DynamicAccessor(Accessor):
+    def __init__(self, args):
+        super(DynamicAccessor, self).__init__(args)
+        # logging
+        self.logger = args.logger
+        # params
+        self.use_cuda = args.use_cuda
+        self.dtype = args.dtype
+        # dynamic-accessor-specific params
+        self.read_head_params.num_read_modes = self.write_head_params.num_heads * 2 + 1
+
+        self.logger.warning("<--------------------------------===> Accessor:   {WriteHead, ReadHead, Memory}")
+
+        # functional components
+        self.usage_vb = None    # for dynamic allocation, init in _reset
+        self.link_vb = None     # for temporal link, init in _reset
+        self.preced_vb = None   # for temporal link, init in _reset
+        self.write_heads = WriteHead(self.write_head_params)
+        self.read_heads = ReadHead(self.read_head_params)
+        self.memory = ExternalMemory(self.memory_params)
+
+        self._reset()
+
+    def _init_weights(self):
+        pass
+
+    def _reset_states(self):
+        # reset the usage (for dynamic allocation) & link (for temporal link)
+        self.usage_vb  = Variable(self.usage_ts).type(self.dtype)
+        self.link_vb   = Variable(self.link_ts).type(self.dtype)
+        self.preced_vb = Variable(self.preced_ts).type(self.dtype)
+        # we reset the write/read weights of heads
+        self.write_heads._reset_states()
+        self.read_heads._reset_states()
+        # we also reset the memory to bias value
+        self.memory._reset_states()
+
+    def _reset(self):           # NOTE: should be called at __init__
+        self._init_weights()
+        self.type(self.dtype)   # put on gpu if possible
+        # reset internal states
+        self.usage_ts  = torch.zeros(self.batch_size, self.mem_hei)
+        self.link_ts   = torch.zeros(self.batch_size, self.write_head_params.num_heads, self.mem_hei, self.mem_hei)
+        self.preced_ts = torch.zeros(self.batch_size, self.write_head_params.num_heads, self.mem_hei)
+        self._reset_states()
+
+    def forward(self, hidden_vb):
+        # 1. first we update the usage using the read/write weights from {t-1}
+        self.usage_vb = self.write_heads._update_usage(self.usage_vb)
+        self.usage_vb = self.read_heads._update_usage(hidden_vb, self.usage_vb)
+        # 2. then write to memory_{t-1} to get memory_{t}
+        self.memory.memory_vb = self.write_heads.forward(hidden_vb, self.memory.memory_vb, self.usage_vb)
+        # 3. then we update the temporal link
+        self.link_vb, self.preced_vb = self.write_heads._temporal_link(self.link_vb, self.preced_vb)
+        # 4. then read from memory_{t} to get read_vec_{t}
+        read_vec_vb = self.read_heads.forward(hidden_vb, self.memory.memory_vb, self.link_vb, self.write_head_params.num_heads)
+        return read_vec_vb
@@ -0,0 +1,51 @@
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+import torch.nn as nn
+from torch.autograd import Variable
+
+from core.accessor import Accessor
+from core.heads.static_write_head import StaticWriteHead as WriteHead
+from core.heads.static_read_head import StaticReadHead as ReadHead
+from core.memory import External2DMemory as ExternalMemory
+
+class StaticAccessor(Accessor):
+    def __init__(self, args):
+        super(StaticAccessor, self).__init__(args)
+        # logging
+        self.logger = args.logger
+        # params
+        self.use_cuda = args.use_cuda
+        self.dtype = args.dtype
+
+        self.logger.warning("<--------------------------------===> Accessor:   {WriteHead, ReadHead, Memory}")
+
+        # functional components
+        self.write_heads = WriteHead(self.write_head_params)
+        self.read_heads = ReadHead(self.read_head_params)
+        self.memory = ExternalMemory(self.memory_params)
+
+        self._reset()
+
+    def _init_weights(self):
+        pass
+
+    def _reset_states(self):
+        # we reset the write/read weights of heads
+        self.write_heads._reset_states()
+        self.read_heads._reset_states()
+        # we also reset the memory to bias value
+        self.memory._reset_states()
+
+    def _reset(self):           # NOTE: should be called at __init__
+        self._init_weights()
+        self.type(self.dtype)   # put on gpu if possible
+        # reset internal states
+        self._reset_states()
+
+    def forward(self, hidden_vb):
+        # 1. first write to memory_{t-1} to get memory_{t}
+        self.memory.memory_vb = self.write_heads.forward(hidden_vb, self.memory.memory_vb)
+        # 2. then read from memory_{t} to get read_vec_{t}
+        read_vec_vb = self.read_heads.forward(hidden_vb, self.memory.memory_vb)
+        return read_vec_vb