Some partial supports for READ-only inplaced operations (Re-based) #1103

zzsfornlp · 2017-11-27T05:55:24Z

This is a re-based PR of #929 basing on the current master branching, sorry that I have not been following the new commits and changes between that PR and the current point. Thus about this I might need some help for checking if there will be any logical conflicts. Thanks!

Here is some notes about this PR:

Since NoBackprop and FlipGradient need special gradient manipulation, they have their own backward memories.
For auto-batch mode, currently simply forbid batching for thoes nodes. Firstly the memory-allocation part of auto-batching seems a little complex for me, so I do not change much there in the fear of breaking sth; Secondly, in auto-batching mode, maybe memory is even less important.
(TODO) For the memory saving of WRITE nodes, we might need some better mechanism for them.

Sharing memories for Reshape(f/b), NoBackprop(f), FlipGradient(f).

…to commit d256c73.

* Fixed pernicious bug in autobatching * Reverted to async and added stream

* Add NoneType assert for list arguments in _dynet.pyx. * remove additional assert

* fix same_dims check for cmult fwd * improved same_dims checks * Removed error in affine transform arg check * Some refactoring of broadcasting * Better profiling * Updates for cwise * Updated broadcasts * Fixed autobatch profile for csum

* Added more information in case of memory allocation error * Added two more lines for clarification * Added deviceID in cudaErrorMemoryAllocation error message

Trivial fix to make doxygen url work

some sbt versions ignored the options if they were in the wrong place

* Add pool memory info for profiling. * update * Add profile info when out of memory. * move show_pool_mem_info out of CG class and add it into low-level control code.

…only, and single-batch)

…backward.

* make {Nary, Unidirectional}TreeLSTMBuilder non-abstract again Commit bf29c18 added set_h_impl to RNNBuilder as an abstract method and the TreeLStmBuilders became abstract. This was fixed for the BidirectionalTreeLSTMBuilder but not the other two. This commit moves the the stub throwing a runtime error from BidirectionalTreeLSTMBuilder to the TreeLSTMBuilder super class. * TreeLSTMs: add nodes in arbitrary order; swig bindings Main feature: TreeLSTMs had to be constructed with the nodes in the tree added by their index. This is not practical if the nodes are already sequentially ordered with the child nodes not being first, e.g. when working with dependency trees. If set_num_elements is called, a fixed set of node space is reserved and nodes can be added in any order as long as children are added first. The old behavior still works as usual. Also: - Add documentation to treelstm C++ code - add swig bindings - add scala wrapper for uni- and bidirectional tree lstm * scala uni- and bidirectional tree lstm wrapper

* added complex structures page to docs * renamed complex structures page

* Removed devices.h from .h files * Made C++ compile * Fix python * Fixeed examples and tests * Fixed compile on Linux and CUDA * Fixed bug on Windows? * Cleaned headers of examples

…tricter grad check

* Run Travis CI on release tags * Create sdist before build to avoid unnecessary files * Fix replacement in .appveyor.yml

* Enhanced implmentation of --dynet-gpu option in python end * fix copy list bug

…to zzsfornlp-read_inplace0

neubig · 2017-12-22T18:10:21Z

OK, I finally got around to taking a look. First, thanks again for contributing this!

I did a few changes to ensure that things don't break with autobatching. The biggest one is that the behavior of the nodes themselves are not modified, but forward or backward is just skipped in the executor. This is useful because now the executor is solely in charge of handling inplacing properly, and if inplacing is not supported things will work as they did before.

I'm going to do a few more tests and also make sure that the memory profiling code is working properly, then probably merge this.

zzsfornlp · 2017-12-29T03:08:10Z

Thanks a lot for your help!

neubig · 2018-01-11T07:01:30Z

#1156 was merged!

zzsfornlp and others added 30 commits November 26, 2017 21:28

Some partial supports for READ-only inplaced operations.

3f18159

Sharing memories for Reshape(f/b), NoBackprop(f), FlipGradient(f).

Remove MSVC conditional and comments that were made unneccessary due …

d7a2e60

…to commit d256c73.

--dynet-profiling was not removing its argument.

0729050

bugfix for insert_dim (clab#1105)

a62eba1

Hotfix for warning in dim.h

4464888

cudnn_root env var and documentation added for pip (clab#1109)

b0eec13

add inverse hyperbolic cosine (acosh)

8e246ed

more trig functions

f63232f

make len(lookup_params) work

9a51a5a

the rest of the trig and inverse trig functions

910379a

Fix documentation

d6e0fd4

Add trig and hyperbolic functions to python doc

736cce9

fix missing packet access check

dfb2ce0

Merge branch 'master' of https://github.com/clab/dynet

e854be8

add circular convolution and correlation, fix bug in fold rows backward

03c9467

Fixed pernicious bug in autobatching (clab#1127)

bc0bf05

* Fixed pernicious bug in autobatching * Reverted to async and added stream

Add NoneType assert for list arguments in _dynet.pyx. (clab#1126)

fd0eda2

* Add NoneType assert for list arguments in _dynet.pyx. * remove additional assert

Improve speed for cwise operations (clab#1107)

8fe6563

* fix same_dims check for cmult fwd * improved same_dims checks * Removed error in affine transform arg check * Some refactoring of broadcasting * Better profiling * Updates for cwise * Updated broadcasts * Fixed autobatch profile for csum

add rnnlm autobatching example

7c38a9b

Cuda malloc msg (clab#1131)

5492ffe

* Added more information in case of memory allocation error * Added two more lines for clarification * Added deviceID in cudaErrorMemoryAllocation error message

Added constrained softmax operator. (clab#1129)

45d60a4

Trivial fix to make doxygen url work

cec1540

Merge pull request clab#1136 from akoehn/doc-fix

5e3be5c

Trivial fix to make doxygen url work

sbt options come before the sbt action (clab#1134)

3d8d692

some sbt versions ignored the options if they were in the wrong place

Fix unit bug for profiling and remove some unnecessary code. (clab#1138)

9048df6

Add pool memory info for profiling. (clab#1139)

a4393da

* Add pool memory info for profiling. * update * Add profile info when out of memory. * move show_pool_mem_info out of CG class and add it into low-level control code.

use FFTs for circular convolution and correlation (currently forward …

29e2f4e

…only, and single-batch)

hotfix for dynet-1137 (clab#1141)

e6bff4c

always use eigen FFT to compute circular conv/corr, both forward and …

1aef1b7

…backward.

scala: Add implicit conversions from seq to vector (clab#1135)

f19c74b

akoehn and others added 17 commits December 19, 2017 10:59

Merge branch 'master' of https://github.com/clab/dynet

8142fa5

Moved complex structures page from dynet.io to documentation (clab#1140)

fd0a01b

* added complex structures page to docs * renamed complex structures page

Refactoring to improve code separation (clab#1091)

750ed92

* Removed devices.h from .h files * Made C++ compile * Fix python * Fixeed examples and tests * Fixed compile on Linux and CUDA * Fixed bug on Windows? * Cleaned headers of examples

More verbose errors for cwise ops

6915288

clarify comment

a67895f

Merge branch 'master' of https://github.com/clab/dynet

f2f171f

more careful testing of gradients

84ca496

underflow problems were causing some spurious failures with the new s…

9b50b35

…tricter grad check

Run Travis CI on release tags (clab#1146)

674f863

* Run Travis CI on release tags * Create sdist before build to avoid unnecessary files * Fix replacement in .appveyor.yml

fix the bug when choosing no bias in cfsm (clab#1148)

152afd5

Fix cudnn ops after Tensor refactor in PR clab#1091. (clab#1149)

4b21614

Enhanced implmentation of --dynet-gpu option in python end (clab#1152)

77d5eb7

* Enhanced implmentation of --dynet-gpu option in python end * fix copy list bug

Merge branch 'read_inplace0' of https://github.com/zzsfornlp/dynet in…

8dff456

…to zzsfornlp-read_inplace0

Fixed broken assert code

45f2400

Fix over-aggressive assert

4363229

Some fixes to backward checks

b3ea437

neubig mentioned this pull request Dec 23, 2017

Support for in-place operations #1156

Merged

neubig closed this Jan 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some partial supports for READ-only inplaced operations (Re-based) #1103

Some partial supports for READ-only inplaced operations (Re-based) #1103

zzsfornlp commented Nov 27, 2017

neubig commented Dec 22, 2017

zzsfornlp commented Dec 29, 2017

neubig commented Jan 11, 2018

Some partial supports for READ-only inplaced operations (Re-based) #1103

Some partial supports for READ-only inplaced operations (Re-based) #1103

Conversation

zzsfornlp commented Nov 27, 2017

neubig commented Dec 22, 2017

zzsfornlp commented Dec 29, 2017

neubig commented Jan 11, 2018