[headers] remove many installed files which should remain private, add more fixes for downstream projects #2952

cenit · 2019-04-17T12:32:42Z

as per title.
Discussing about publishing the tool and live in peace with other libraries, I was informed that we export some header files that have the same name as other libraries. The best solution is to wrap them inside a darknet folder. Thanks to common usage of the tool which does not involve installing darknet system-wide, and thanks also to the power of CMake that can make this change invisible to the end user, this change should not impact anyone. In the meantime I also implemented some minor fixes for downstream projects using darknet through CMake

…her libraries

CMakeLists.txt

cenit · 2019-04-18T06:57:29Z

due to #2956 discussion, I just understood that it is not necessary to install all header files like it is done now. Since we want to reduce API exposure also with the CMake toolchain, we can just install darknet.h and yolo_v2_class.hpp, solving all our problems without even having to move headers to a separate folders.

AlexeyAB · 2019-04-18T13:15:18Z

Does this PR fixes some cases with OSx, build.sh, cuDNN, or something else?

cenit · 2019-04-18T13:47:37Z

yes some very minor changes. But let me prepare another commit to undo changes in header path location before considering merge

…also for non-msvc compilers

cenit · 2019-04-18T19:51:51Z

vcpkg CI integration on Linux is not really working. I have to debug what’s going on

cenit · 2019-04-18T19:52:34Z

Then it will be ready for merging considerations

AlexeyAB · 2019-04-18T22:34:29Z

Ok, just say when I can merge it.

Also is it known issue with CUDA 10.1? #2971

AlexeyAB · 2019-04-23T12:50:51Z

@cenit Hi,
Can we somehow add support for multiple CUDA Compute Capabilities?
Or at least always_CC3.0 + selected_CC? #3014

cenit · 2019-04-23T13:19:51Z

@cenit Hi,
Can we somehow add support for multiple CUDA Compute Capabilities?
Or at least always_CC3.0 + selected_CC? #3014

hi! Yes this should be my next priority, before even working on the setup script.
Let me just push another commit here so that we can consider this complete and then it will be my next stage

…ix variable not expanded for vcpkg

cenit · 2019-04-23T13:35:32Z

ok only one question remain unsolved for this PR (apart from the API expansion, which I strongly support but I'd prefer if you did it), please let me know your opinion so that I can apply it:
for downstream projects, do you prefer

#include <darknet.h>

or

#include <darknet/darknet.h>

?

The first looks shorter and easier, but if we want to expand headers from the two as of now (darknet.h and yolo_v2_class.hpp) to others, it is better if we already start to "namespace" our files inside "our folder". In this way, we can use also generic names without risking to overlap with files already installed on the system (think about darknet installed by vcpkg or a package manager, not when darknet lives in his own folder cloned through git, in this case there is no problem)

AlexeyAB · 2019-04-23T14:09:37Z

We can use the second way #include <darknet/darknet.h>. As it is used in OpenCV #include <opencv2/opencv.hpp> or Boost-library #include <boost/asio.hpp> in common way.
Will you do this in your PR?

it is better if we already start to "namespace" our files inside "our folder".

It is easy to do for C++ yolo_v2_class.hpp (also I should rename it to yolo_api.hpp or darknet_api.hpp) since we should only add 2 lines

namespace dark {
...
}

But it is much more difficult to do for C darknet.h since we should rename all API functions for adding prefix dark_ or dn_, so we should rename all these functions and their calls in the all c-files in Darknet. The main thing is not to spoil anything, so it can be a long process.

Also it affects on Python scripts, that are use C API.

Also we should think about C# API https://github.com/AlexeyAB/darknet/blob/099b71d1de6b992ce8f9d7ff585c84efd0d4bf94/build/darknet/YoloWrapper.cs
that is C-API

darknet/include/yolo_v2_class.hpp

Lines 59 to 65 in 099b71d

    
           extern "C" LIB_API int init(const char *configurationFilename, const char *weightsFilename, int gpu); 
        
           extern "C" LIB_API int detect_image(const char *filename, bbox_t_container &container); 
        
           extern "C" LIB_API int detect_mat(const uint8_t* data, const size_t data_length, bbox_t_container &container); 
        
           extern "C" LIB_API int dispose(); 
        
           extern "C" LIB_API int get_device_count(); 
        
           extern "C" LIB_API int get_device_name(int gpu, char* deviceName); 
        
           extern "C" LIB_API void send_json_custom(char const* send_buf, int port, int timeout);

which uses C++-API

darknet/src/yolo_v2_class.cpp

Lines 26 to 102 in 099b71d

    
           //static Detector* detector = NULL; 
        
           static std::unique_ptr<Detector> detector; 
        
           int init(const char *configurationFilename, const char *weightsFilename, int gpu) 
        
           { 
        
               detector.reset(new Detector(configurationFilename, weightsFilename, gpu)); 
        
               return 1; 
        
           } 
        
           int detect_image(const char *filename, bbox_t_container &container) 
        
           { 
        
               std::vector<bbox_t> detection = detector->detect(filename); 
        
               for (size_t i = 0; i < detection.size() && i < C_SHARP_MAX_OBJECTS; ++i) 
        
                   container.candidates[i] = detection[i]; 
        
               return detection.size(); 
        
           } 
        
           int detect_mat(const uint8_t* data, const size_t data_length, bbox_t_container &container) { 
        
           #ifdef OPENCV 
        
               std::vector<char> vdata(data, data + data_length); 
        
               cv::Mat image = imdecode(cv::Mat(vdata), 1); 
        
               std::vector<bbox_t> detection = detector->detect(image); 
        
               for (size_t i = 0; i < detection.size() && i < C_SHARP_MAX_OBJECTS; ++i) 
        
                   container.candidates[i] = detection[i]; 
        
               return detection.size(); 
        
           #else 
        
               return -1; 
        
           #endif    // OPENCV 
        
           } 
        
           int dispose() { 
        
               //if (detector != NULL) delete detector; 
        
               //detector = NULL; 
        
               detector.reset(); 
        
               return 1; 
        
           } 
        
           int get_device_count() { 
        
           #ifdef GPU 
        
               int count = 0; 
        
               cudaGetDeviceCount(&count); 
        
               return count; 
        
           #else 
        
               return -1; 
        
           #endif	// GPU 
        
           } 
        
           int get_device_name(int gpu, char* deviceName) { 
        
           #ifdef GPU 
        
               cudaDeviceProp prop; 
        
               cudaGetDeviceProperties(&prop, gpu); 
        
               std::string result = prop.name; 
        
               std::copy(result.begin(), result.end(), deviceName); 
        
               return 1; 
        
           #else 
        
               return -1; 
        
           #endif	// GPU 
        
           } 
        
           #ifdef GPU 
        
           void check_cuda(cudaError_t status) { 
        
               if (status != cudaSuccess) { 
        
                   const char *s = cudaGetErrorString(status); 
        
                   printf("CUDA Error Prev: %s\n", s); 
        
               } 
        
           } 
        
           #endif 
        
           struct detector_gpu_t { 
        
               network net; 
        
               image images[NFRAMES]; 
        
               float *avg; 
        
               float* predictions[NFRAMES]; 
        
               int demo_index; 
        
               unsigned int *track_id; 
        
           };

cenit · 2019-04-23T15:03:14Z

What I meant with namespacing under quotes was just the darknet/ folder. I will do it in this PR, in the next commit.

But I also understand your point, about real namespacing, which is maybe necessary only for the C++ header. Because as you said it’s more difficult and for C it impacts also python. It deserves another PR for sure. When discussing the API surface expansion we can discuss the rework of the namespace and a revision of C/C++/C#

…pkg when not testing integrated libs

…igs, since it causes internal compiler errors. Waiting for updates from vcpkg

cenit · 2019-05-04T17:43:58Z

I still have some improvements to apply for downstream users of darknet library. Sorry for the delay, I had some problems

AlexeyAB · 2019-05-07T21:01:58Z

@cenit Hi,

Is this PR ready and I can merge it?

I created several projects: https://github.com/AlexeyAB/darknet/projects
And added tasks which you do: https://github.com/AlexeyAB/darknet/projects/3

cenit · 2019-05-07T23:11:27Z

Hi no sorry still not finished. I am very sorry, I am too much busy these days. Tomorrow I will finish it!

edit: wonderful projects! Just had the time to read them, wow! Yes, that's exactly what I meant. In this way we can see better our way forwards!

cenit · 2019-05-13T11:09:23Z

waiting for microsoft/vcpkg#6417 to be merged since it fixes vcpkg on macOS. Some tests also on my part, then it should be ready to merge

AlexeyAB · 2019-05-13T11:17:01Z

Nice work! Will wait.

cenit · 2019-05-14T07:49:12Z

microsoft/vcpkg#6417 is merged. And PR should be almost ready. I'd like to do some other tests, but I don't know if I have time today

edit: I will read CI logs and check for regression later when it will be finished, since I changed some logics inside CMake

cenit · 2019-05-16T11:46:40Z

I think it should be ready to let users test it. In case of problems I will help debug.
Two things are important: best cuda compute capability is detected automatically and now we can use multiple CUDA compute capabilities in CMake easily. It was an easy win with this refactor!

AlexeyAB · 2019-05-16T12:18:40Z

Thanks a lot! Does it use by default two CCs: CC 3.0 + CC 7.0/7.5 selected based on CUDA-version?

cenit · 2019-05-16T13:11:53Z

for now it just choose the best CC based on your gpu and your cuda version. On CI, where the detection fails because there's no GPU, it builds for ALL CCs 😄 it seems to work well!
Manually from the cmake interface (command line or gui) you can tell the script to do it automatically (best CC) or to build for some specific CCs

AlexeyAB · 2019-05-17T20:01:48Z

Can we add CUDA CC selection to the Cmake-gui, or can we use by default two CCs: CC 3.0 + CC 7.0/7.5 selected based on CUDA-version?

I just can't find where can I set CUDA CC in the Cmake-gui:

Can we do it in the same way as it is done in OpenCV?

cenit · 2019-05-19T20:38:14Z

Sure, I will improve user interface in the next commit :)

cenit · 2019-05-20T13:19:26Z

now there's a CUDA_ARCHITECTURES variable with some pre-set values:

"Auto" detects local machine GPU compute arch automatically and builds for it
"Common" cover common architectures
"All" builds for all architectures known by the CUDA SDK found
"Names" is a list of architectures to enable by name, an example is given
"Numbers" is a list of compute capabilities (version number) to enable (an example 3.0 + 7.5 is given)

@AlexeyAB Please let me know if you like this version more

AlexeyAB · 2019-06-04T20:14:25Z

@cenit Hi, Thanks for multiple CCs! )

I think we can enable CUDNN_HALF even for CC >= 6.0: https://github.com/AlexeyAB/darknet/pull/2952/files#diff-af3b638bc2a3e6c650974192a53c7291R58

"Auto" detects local machine GPU compute arch automatically and builds for it

Does it detect CC based on GPU-card or on CUDA-version?
What behavior will be if there are several different GPUs?

May be it is better to use minimal CC3.0 + Auto(CC7.5) by default? It will solve issue for multiple different GPUs.

I got for RTX 2070 and CUDA 10.0:

Auto: compute_75,sm_75
Common: compute_30,sm_30;compute_35,sm_35;compute_50,sm_50;compute_52,sm_52;compute_60,sm_60;compute_61,sm_61;compute_70,sm_70;compute_75,sm_75;compute_70,compute_70;compute_75,compute_75
All: compute_20,sm_20;compute_20,sm_21;compute_30,sm_30;compute_35,sm_35;compute_50,sm_50;compute_52,sm_52;compute_32,sm_32;compute_37,sm_37;compute_53,sm_53;compute_60,sm_60;compute_61,sm_61;compute_70,sm_70;compute_75,sm_75
Kepler Maxwell Kepler+Tegra Maxwell+Tegra Pascal: compute_30,sm_30;compute_35,sm_35;compute_50,sm_50;compute_52,sm_52;compute_32,sm_32;compute_53,sm_53;compute_60,sm_60;compute_61,sm_61
compute_30,sm_30;compute_75,sm_75: compute_30,sm_30;compute_75,sm_75

cenit · 2019-06-04T22:50:16Z

@AlexeyAB hi! ok for CC>6.0 to enable CUDNN_HALF.

About the CC, it is completely delegated to CMake. In the new frameworks that we are exploiting CMake has this wonderful capability, which is getting better every version more. I'd leave Auto by default, not CC3.0+CC7.5, because that one is just one click away and I see it as a "more expert setting" than a plain automatic selection... but of course changing the default is very easy, just let me know. IMHO it is also less future-proof. Auto will work even in 5 years, 3.0+7.5 it will be "old"... so less mainteinance for you ;)

AlexeyAB · 2019-06-04T23:10:03Z

Do you mean that if I have Tesla V100 CC6.0 and CUDA 10.0 that supports CC7.5, then CMake will use CC6.0 automatically by using Auto?

cenit · 2019-06-05T04:31:53Z

Yes. It should, at least...

AlexeyAB · 2019-06-05T14:49:42Z

Thanks for PR, I merged it!
Can you show what line im CmakeList do CC-autodetection?

cenit · 2019-06-05T15:22:47Z

Hi!
Sure, it's this one:

darknet/CMakeLists.txt

Line 56 in 2347913

cuda_select_nvcc_arch_flags(CUDA_ARCH_FLAGS ${CUDA_ARCHITECTURES})

you give it a parameter (CUDA_ARCHITECTURES) which is created as a drop-down list with some options but can always overridden by user and it returns a variable (CUDA_ARCH_FLAGS) which contains the list of the options to pass to the compilers to have all the CCs requested

[headers] remove many installed files which should remain private, add more fixes for downstream projects

cenit added 2 commits April 17, 2019 14:17

[headers] move .h files to our own subfolder to avoid clashes with ot…

77917d0

…her libraries

[build.sh] use correct osx triplet name

ff7d920

cenit commented Apr 17, 2019

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

cenit added 2 commits April 18, 2019 09:17

[FindCUDNN] fix unnecessary message and dll symbol

eebe735

[build.sh] fix problems with empty env variables

df0d33d

cenit added 2 commits April 18, 2019 16:16

[dark.lib] set correct function visibility when building shared libs …

6497ba1

…also for non-msvc compilers

[ci] add tests with vcpkg on linux

f0a158e

cenit changed the title ~~[headers] move .h files to our own subfolder when installed to avoid clashes with other libraries~~ [headers] remove many installed files which should remain private, add more fixes for downstream projects Apr 18, 2019

disable vcpkg bootstrap when not required

b8962b2

improve CI compatibility with updated opencv@2 and opencv@3 on mac, f…

73cc42c

…ix variable not expanded for vcpkg

cenit added 4 commits April 23, 2019 17:25

[travis,cuda] add missing symbol

b805bde

install header files to darknet/ subfolder

49986d5

fully recicle install location in target definition

c2071db

[ci] allow vcpkg failures, export CC and CXX symbols, use stb from vc…

21e6372

…pkg when not testing integrated libs

cenit force-pushed the dev/cenit/include branch from a836369 to 21e6372 Compare April 30, 2019 13:51

cenit added 2 commits April 30, 2019 16:15

[ci] extend timeout for vcpkg, add all vcpkg configs to allowed failures

3a20020

[travis] disable vcpkg's built opencv[cuda] also on cuda-enabled conf…

84ee1f4

…igs, since it causes internal compiler errors. Waiting for updates from vcpkg

move towards using only modern cmake

f27f7cc

cenit added 3 commits May 13, 2019 17:23

move towards using only modern cmake - part 2

52dabbb

move towards using only modern cmake - part 3

d21dc00

fixes for ci and included files

920b29f

cenit force-pushed the dev/cenit/include branch from c8e1c53 to 920b29f Compare May 14, 2019 07:23

fix symbols usage

d853658

remove deprecated json file

5e242df

[cmake+cuda] improve friendliness of architecture selection

578d1c0

Merge branch 'master' into dev/cenit/include

a3d5cc4

cenit mentioned this pull request Jun 4, 2019

nvcc fatal : Unknown option 'fopenmp' using CMAKE #3314

Closed

AlexeyAB merged commit 2347913 into AlexeyAB:master Jun 5, 2019

cenit deleted the dev/cenit/include branch June 5, 2019 15:22

TomHeaven pushed a commit to TomHeaven/darknet that referenced this pull request Aug 13, 2020

Merge pull request AlexeyAB#2952 from cenit/dev/cenit/include

f9031c5

[headers] remove many installed files which should remain private, add more fixes for downstream projects

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[headers] remove many installed files which should remain private, add more fixes for downstream projects #2952

[headers] remove many installed files which should remain private, add more fixes for downstream projects #2952

cenit commented Apr 17, 2019

cenit commented Apr 18, 2019

AlexeyAB commented Apr 18, 2019

cenit commented Apr 18, 2019

cenit commented Apr 18, 2019

cenit commented Apr 18, 2019

AlexeyAB commented Apr 18, 2019

AlexeyAB commented Apr 23, 2019

cenit commented Apr 23, 2019

cenit commented Apr 23, 2019

AlexeyAB commented Apr 23, 2019

cenit commented Apr 23, 2019

cenit commented May 4, 2019

AlexeyAB commented May 7, 2019

cenit commented May 7, 2019 •

edited

Loading

cenit commented May 13, 2019

AlexeyAB commented May 13, 2019

cenit commented May 14, 2019 •

edited

Loading

cenit commented May 16, 2019

AlexeyAB commented May 16, 2019

cenit commented May 16, 2019

AlexeyAB commented May 17, 2019

cenit commented May 19, 2019

cenit commented May 20, 2019 •

edited

Loading

AlexeyAB commented Jun 4, 2019

cenit commented Jun 4, 2019

AlexeyAB commented Jun 4, 2019

cenit commented Jun 5, 2019

AlexeyAB commented Jun 5, 2019

cenit commented Jun 5, 2019

[headers] remove many installed files which should remain private, add more fixes for downstream projects #2952

[headers] remove many installed files which should remain private, add more fixes for downstream projects #2952

Conversation

cenit commented Apr 17, 2019

cenit commented Apr 18, 2019

AlexeyAB commented Apr 18, 2019

cenit commented Apr 18, 2019

cenit commented Apr 18, 2019

cenit commented Apr 18, 2019

AlexeyAB commented Apr 18, 2019

AlexeyAB commented Apr 23, 2019

cenit commented Apr 23, 2019

cenit commented Apr 23, 2019

AlexeyAB commented Apr 23, 2019

cenit commented Apr 23, 2019

cenit commented May 4, 2019

AlexeyAB commented May 7, 2019

cenit commented May 7, 2019 • edited Loading

cenit commented May 13, 2019

AlexeyAB commented May 13, 2019

cenit commented May 14, 2019 • edited Loading

cenit commented May 16, 2019

AlexeyAB commented May 16, 2019

cenit commented May 16, 2019

AlexeyAB commented May 17, 2019

cenit commented May 19, 2019

cenit commented May 20, 2019 • edited Loading

AlexeyAB commented Jun 4, 2019

cenit commented Jun 4, 2019

AlexeyAB commented Jun 4, 2019

cenit commented Jun 5, 2019

AlexeyAB commented Jun 5, 2019

cenit commented Jun 5, 2019

cenit commented May 7, 2019 •

edited

Loading

cenit commented May 14, 2019 •

edited

Loading

cenit commented May 20, 2019 •

edited

Loading