Improve python wrapper #311

shelhamer · 2014-04-10T01:40:38Z

This work-in-progress seeks to polish the python wrapper, document, and include more detailed examples.

longjon · 2014-04-10T02:22:33Z

Let's also make sure everything that should has a docstring once the interface is stable.

shelhamer · 2014-04-10T05:41:35Z

@longjon re: gpu memory, it seems it's not actually an issue. Although mutable_cpu_data() is always used, further calls to Caffe actions like Forward() will sync properly. I checked by zeroing out params in gpu mode–the change was seen in the forward pass.

I don't know what problem I had before. This seems fine, unless I'm tired and somehow missing the issue.

longjon · 2014-04-10T06:47:36Z

Yes, I can confirm that, e.g., this works in GPU mode:

net.params['fc8'][0].data[...] = 0
net.ForwardPrefilled()   # or your preferred forward call

and this fits my model of what SyncedMemory is doing. However, note that referential transparency is subtly violated:

p = net.params['fc8'][0].data
net.ForwardPrefilled()
p[...] = 0
net.ForwardPrefilled()   # uses unchanged parameters

which makes me a bit uncomfortable. However, getting blobs has the same issue. In fact, in this code data behaves like a value:

bl = net.blobs['fc8'].data
net.ForwardPrefilled()
# bl does not contain up-to-date fc8, and writing to it has no effect

but in this code like a reference:

bl = net.blobs['fc8'].data
net.ForwardPrefilled()
bl2 = net.blobs['fc8'].data
# bl now contains the same data as bl2, and writing to it works

I think it's fine and probably the best compromise to keep things like they are, but to maximize sanity one should consider data to be a reference that becomes invalid whenever forward or backward are called (and this is true even for params, which don't change on forward/backward).

ilsvrc_2012_mean.npy has dims K x H x W. Code written for the old D x D x K mean needs to be rewritten!

Do forward pass by prefilled or packaging input + output blobs and returning a {output blob name: output list} dict.

Preserve the non-batch dimensions of blob arrays, even for singletons. The forward() and backward() helpers take lists of ndarrays instead of a single ndarray per blob, and lists of ndarrays are likewise returned. Note that for output the blob array could actually be returned as a single ndarray instead of a list.

shelhamer · 2014-05-15T06:54:11Z

@longjon Please review my proposed changes to the caffe.Net interface. @sergeyk Let me know what you think too.

Rewriting the rest of pycaffe (imagenet and detector wrappers + examples) according to the new interface will follow.

longjon · 2014-05-15T07:32:04Z

python/caffe/_caffe.cpp

@@ -1,6 +1,6 @@
 // Copyright 2014 BVLC and contributors.
 // pycaffe provides a wrapper of the caffe::Net class as well as some
-// caffe::Caffe functions so that one could easily call it from Python.
+// caffe::Caffe functions so that one could easily call it from python.


Really? See python.org, Python wiki page, etc. Just sayin'.

Ok, fair enough. Capital is fine and used throughout now.

longjon · 2014-05-15T08:06:45Z

Always the standard for Python is four-space indents (PEP8 or Google style guide).

longjon · 2014-05-15T08:10:19Z

python/caffe/pycaffe.py

+# Input preprocessing
+Net.mean = {}   # image mean (ndarray, input dimensional or broadcastable)
+Net.input_scale = {}  # for a model that expects data = input * input_scale
+Net.channel_swap = {}  # for RGB -> BGR and the like


Should the axis permutation also be part of this?

H x W x K is the scikit-image standard, so I'm comfortable leaving it out. It'd be easy to add on the model of the existing options if it turns out to be annoying.

Yep, H x W x K is pretty much the standard for images, I am thinking of non-image data. But I agree we don't need to be eager to add things that aren't used.

longjon · 2014-05-15T21:01:44Z

Some time ago I suggested (which I am now recalling and still partial to) moving the blob copying in forward from C++ to Python. This would mean that Net.forward could be simplified to (just sketching here)

for k, v in kwargs.iteritems():
    self.blobs[k].data[...] = np.asarray(v)
net._forward_prefilled()
# now deal with output

where CaffeNet.Forward goes away and CaffeNet.ForwardPrefilled gets renamed CaffeNet._forward_prefilled and need not be called by the user.

The advantages of doing it this way:

replace a bunch of C++ shenanigans with a few lines of Python
noncontiguous or non-float32 data are only copied once, whereas now they are copied twice
the extra checks and C++ exceptions simply go away, the exception for wrongly-sized input is automatic

What do you think?

longjon · 2014-05-15T21:34:21Z

Final note for now, regarding the blobs= mechanism. I see four ways this could be accomplished:

how it is now, specifying the blobs you want in addition to the output blobs as an argument
return the output blobs if you don't ask for anything specifically, otherwise return exactly what you asked for
always return output blobs only, the user can dig in net.blobs for anything else she wants
provide a mechanism (to=?) to stop computation at some layer, and always return the last computed blobs

The fourth (and optionally the second?) saves time if the user wants some intermediate level features and not later ones. Of course, that option can be implemented in addition to one of the others.

And three related comments:

the returned blobs have the caveats described above re: GPU mode (if we keep this let's document it)
it might be nice to be scolded if passing in a blob that will be clobbered in computation
I can't decide if the convenience justifies the grating-ness of blobs= + **kwargs. At least one can still run nets with blobs named blobs by filling in net.blobs oneself.

shelhamer · 2014-05-15T22:07:33Z

return the output blobs if you don't ask for anything specifically, otherwise return exactly what you asked for

This is equally reasonable, and one can ask for outputs + additional blobs by blobs=self.outputs + ['conv2', 'pool5'] or the like. For now I'm keeping it as it is, since the output blobs are always computed so they might as well be packaged in.

user can dig in net.blobs for anything else she wants

While true, it feels a bit awkward, and if we're going to have a convenience method let's make it convenient.

provide a mechanism (to=?) to stop computation at some layer, and always return the last computed blobs
[...]
saves time if the user wants some intermediate level features and not later ones

This is a good idea... for the future. For now the usual workaround of defining the decapitated net and running it is fine (ok, more like "barely tolerable").

the returned blobs have the caveats described above re: GPU mode (if we keep this let's document it)

I'm not sure how to fix it right now, so let's document it. We should make docs pages for the python wrapper and matlab wrapper in general with details like this.

it might be nice to be scolded if passing in a blob that will be clobbered in computation

Right now you're yelled at if you don't pass all the named input blobs. ~~I'm happy with that (feel free to add further checks).~~ Now it's going to yell whenever the args don't match the input blobs.

I can't decide if the convenience justifies the grating-ness of blobs= + **kwargs. At least one can still run nets with blobs named blobs by filling in net.blobs oneself.

I can relate–the collusion of named keywords + **kwargs feels like sin. But it's nice and I've made my peace.

shelhamer · 2014-05-15T22:10:34Z

moving the blob copying in forward from C++ to Python

This is an excellent idea. I'll do this, barring unforeseen issues.

@longjon

Take blob args and give blob returns as single ndarrays instead of lists of arrays. Assign the net blobs and diffs as needed on the python side, which reduces copies and simplifies the C++ side of the wrapper. Thanks @longjon for the suggestion.

...and refer to inputs as inputs and not images since general vectors and matrices are perfectly fine.

Don't run scripts in the module dir to avoid import collisions between io and caffe.io.

For compositionality and expectations.

shelhamer · 2014-05-20T19:19:40Z

Ready to brew with python.

Improve python wrapper

shelhamer added interface labels Apr 10, 2014

shelhamer mentioned this pull request Apr 13, 2014

Create Meanfile #320

Closed

sergeyk added this to the 1.0 milestone Apr 22, 2014

longjon mentioned this pull request Apr 25, 2014

Can you use python to train a network from scratch? #360

Closed

shelhamer changed the title ~~Python wrapper polish~~ Python wrapper improvements Apr 27, 2014

shelhamer changed the title ~~Python wrapper improvements~~ [WIP] Python wrapper improvements May 1, 2014

shelhamer added 12 commits May 13, 2014 18:10

match existing python formatting

51f276e

make python wrapper mean match binaryproto dimensions

47ec9ac

ilsvrc_2012_mean.npy has dims K x H x W. Code written for the old D x D x K mean needs to be rewritten!

add python io getters, mean helper, and image caffeinator/decaffeinator

8da2a32

pycaffe comments, lint

872ddf3

expose input and output blob names to python as lists

56ca978

set input preprocessing per blob in python

96cd02d

pycaffe Net.forward() helper

0e5a5cf

Do forward pass by prefilled or packaging input + output blobs and returning a {output blob name: output list} dict.

bad forward/backward inputs throw exceptions instead of crashing python

9d4324e

python Net.backward() helper and Net.BackwardPrefilled()

ac5e6fa

python forward() and backward() extract any blobs and diffs

af0b857

batch inputs in python by forward_all() and forward_backward_all()

1b23680

longjon reviewed May 15, 2014
View reviewed changes

shelhamer added 2 commits May 15, 2014 13:06

replace iterator with indices for consistency

459c8c1

resize to input dimensions when formatting in python

025c64e

shelhamer added 12 commits May 16, 2014 16:10

Net.caffeinate() and Net.decaffeinate() format/unformat lists

5102413

drop cute names in favor of Net.{pre,de}process() for input formatting

37123a5

...and refer to inputs as inputs and not images since general vectors and matrices are perfectly fine.

fix python mean subtraction

738c875

add caffe.io submodule for conversions, image loading and resizing

50d0b6d

split drawnet into module code and script

6b85fd0

Don't run scripts in the module dir to avoid import collisions between io and caffe.io.

fix padding for the last batch

bf4d726

image classification in python

2fc32d5

squash infuriating loop assignment bug in batching

111df0e

windowed detection in python

02ecf1d

preprocess single inputs instead of lists

8830dc5

For compositionality and expectations.

update notebook examples with new wrapper usage, re-organize

42bf2d2

shelhamer removed the work in progress label May 20, 2014

shelhamer changed the title ~~[WIP] Python wrapper improvements~~ Improve python wrapper May 20, 2014

shelhamer added a commit that referenced this pull request May 20, 2014

Merge pull request #311 from shelhamer/python-fixes

f048bea

Improve python wrapper

shelhamer merged commit f048bea into BVLC:dev May 20, 2014

shelhamer deleted the python-fixes branch May 20, 2014 19:20

shelhamer restored the python-fixes branch May 20, 2014 19:21

shelhamer deleted the python-fixes branch May 20, 2014 19:22

This was referenced May 20, 2014

Next: 0.999 #429

Merged

Make pycaffe errors behave in a reasonable way #35

Closed

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#311 from shelhamer/python-fixes

be83632

Improve python wrapper

longjon mentioned this pull request Oct 10, 2014

Performing net-surgery in pycaffe on a caffe.SGDSolver net #1257

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve python wrapper #311

Improve python wrapper #311

shelhamer commented Apr 10, 2014

longjon commented Apr 10, 2014

shelhamer commented Apr 10, 2014

longjon commented Apr 10, 2014

shelhamer commented May 15, 2014

longjon May 15, 2014

shelhamer May 15, 2014

longjon commented May 15, 2014

longjon May 15, 2014

shelhamer May 15, 2014

longjon May 15, 2014

longjon commented May 15, 2014

longjon commented May 15, 2014

shelhamer commented May 15, 2014

shelhamer commented May 15, 2014

shelhamer commented May 20, 2014

Improve python wrapper #311

Improve python wrapper #311

Conversation

shelhamer commented Apr 10, 2014

longjon commented Apr 10, 2014

shelhamer commented Apr 10, 2014

longjon commented Apr 10, 2014

shelhamer commented May 15, 2014

longjon May 15, 2014

Choose a reason for hiding this comment

shelhamer May 15, 2014

Choose a reason for hiding this comment

longjon commented May 15, 2014

longjon May 15, 2014

Choose a reason for hiding this comment

shelhamer May 15, 2014

Choose a reason for hiding this comment

longjon May 15, 2014

Choose a reason for hiding this comment

longjon commented May 15, 2014

longjon commented May 15, 2014

shelhamer commented May 15, 2014

shelhamer commented May 15, 2014

shelhamer commented May 20, 2014