Adapt the data interface from CXXNET authored by @tqchen & @antinucleon #407

kloudkl · 2014-05-12T06:53:45Z

One of the flexible features of CXXNET is that the data interface is easy to extend and relatively independent of the preprocessing steps. The design is a natural solution to partly solve #148.

The commit bcf933b literally translates the codes of CXXNET into the Caffe terminology. There are concerns about the copyright and the license that must be addressed. I would like to reach out to @tqchen and @antinucleon to get their permissions to adapt their codes. I believe the Apache License Version 2.0 under which CXXNET is licensed is compatible with the BSD 2-Clause license under which Caffe is licensed.

The API is ready for code review. The concrete implementations will follow when the API is stabilized.

Why has class semaphore disappeared? http://www.boost.org/doc/libs/1_31_0/libs/thread/doc/faq.html#question10

Yangqing · 2014-05-13T16:14:43Z

I will leave it to Evan and Sergey to decide... Personally I don't like adding more complication to things, it seems that we already have a good solution in terms of data layers that Sergey has, so this seems a little bit redundant and overcomplicated.

shelhamer · 2014-05-26T02:57:12Z

Sergey and I will review this after the NIPS deadline 06/06 at the latest. A re-design of data layers for inheritance and modularity is worthy of consideration, so we will take a look at your proposal!

shelhamer · 2014-06-08T20:59:29Z

My stance on the data layers is that it is best to separate the data source from the data transformations, and even to separate the data and the labels. For instance, one might use the same image collection for different tasks with different ground truth such as image classification, detection, or segmentation.

Having the transformations should be a simplification as data layers will be reduced to their IO.

@Yangqing Sergey and I will talk this one over. Of course splitting of the data layers in general is an idea apart from this concrete proposal. Do you see taking apart the data layers as over-complicated in itself? Thanks for pointing this PR out to us.

kloudkl · 2014-06-09T05:58:16Z

I will further simplify the implementation for easier extension and add tests for the existing functionality this week.

Yangqing · 2014-06-09T22:44:00Z

My thought on having the data layer separated is to have the following:

(1) The data layer will have a member variable "next_batch_", which is a vector of Blobs that holds the next batch. The reason we have a vector of blobs is because a data layer may want to produce multiple output blobs.

(2) The data layer has an abstract function GetNextBatch() that will be implemented by specific methods to fill prefetched data into next_batch_.

(3) The data layer has a Forward() function that basically copies next_batch_ to its outputs.

(4) The GetNextBatch() will be called in the child thread, just like what we are doing for the current data layer.

For one to write a custom data layer, s/he simply needs to initialize things properly in the constructor, and then implement the GetNextBatch() function to fill in the values for the next batch.

@shelhamer @sergeyk - should you desire, I can refactor the current data layer according to the plan above.

Taking apart data transforms and the prefetching behavior is pretty good - for example, subtracting the mean does not cost too much time and could well be done outside the data layer.

No offense to anyone, but having three additional classes DataBatch, DataFetcher, and DataIterator is too complicated while the same task could be done just in one abstract class... I'd personally like fellow graduate students to understand the code quickly rather than needing to dig through a lot of classes before they can write a simple piece of algorithm, and in that regard will sacrifice modularity and scalability (if any) a little bit.

kloudkl · 2014-06-11T01:12:49Z

I'm glad that there is a much simpler design. Once it's implemented, this can be closed.

sergeyk · 2014-06-16T06:40:24Z

@Yangqing your refactor would be much appreciated

kloudkl · 2014-08-27T11:33:54Z

We don't need this PR any more. Related work is done in #710, #954 and #963.

kloudkl added 9 commits May 12, 2014 14:33

Adapt the data interface from CXXNET authored by @tqchen & @antinucleon

bcf933b

Remove mshadow::Shape

8bd5e34

Add DataIteratorParameter in caffe.proto

e16abd5

Set up the concrete DataIterators

f658950

Add DATA_BUILD_DIR to the Makefile

60508dc

Implement ImageDataIterator and stablize the data interface API

0b6c3d3

Adapt the ThreadBuffer of CXXNET into DataFetcher

8f6e5e0

Replace cxxnet thread with boost::thread in the DataFetcher

74a97cc

Replace CXXNET Semaphore with boost::mutex and condition_variable

76c76f5

Why has class semaphore disappeared? http://www.boost.org/doc/libs/1_31_0/libs/thread/doc/faq.html#question10

shelhamer assigned sergeyk May 21, 2014

shelhamer added enhancement labels May 26, 2014

sguada mentioned this pull request Jun 17, 2014

how to make a prediction in C++ #499

Closed

This was referenced Jun 24, 2014

Feed the ImageDataLayer with OpenCV images directly from memory #251

Closed

The data layer is also plagued with the uninitialized prefetch rng core dump bug #553

Closed

sguada mentioned this pull request Jun 28, 2014

Allow images of different sizes as inputs #557

Closed

kloudkl closed this Aug 27, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapt the data interface from CXXNET authored by @tqchen & @antinucleon #407

Adapt the data interface from CXXNET authored by @tqchen & @antinucleon #407

kloudkl commented May 12, 2014

Yangqing commented May 13, 2014

shelhamer commented May 26, 2014

shelhamer commented Jun 8, 2014

kloudkl commented Jun 9, 2014

Yangqing commented Jun 9, 2014

kloudkl commented Jun 11, 2014

sergeyk commented Jun 16, 2014

kloudkl commented Aug 27, 2014

Adapt the data interface from CXXNET authored by @tqchen & @antinucleon #407

Adapt the data interface from CXXNET authored by @tqchen & @antinucleon #407

Conversation

kloudkl commented May 12, 2014

Yangqing commented May 13, 2014

shelhamer commented May 26, 2014

shelhamer commented Jun 8, 2014

kloudkl commented Jun 9, 2014

Yangqing commented Jun 9, 2014

kloudkl commented Jun 11, 2014

sergeyk commented Jun 16, 2014

kloudkl commented Aug 27, 2014