convert the format of the caltech pedestrian dataset to the format that yolo uses

This repo is adapted from

dependencies

opencv
numpy
scipy

how to

Convert the .tar files with video to .png frames by running $ ./generate-images.py {path_to_derectory_with_tar_files} {path_to_output} [amount_of_threads]. This script takes all files from <path_to_directory_with_tar_files>, extracts each archive and converts extracted .seq files to <path_to output>, finally it removes folders with extracted .seq files. Parallel mode provides way to go through above described flow in regime like "one thread per archive". To enable parallelism need just to specify <amount_of_threads> as integer value. The most efficient way is to specify number of threads equal to number of the archives in directory (sure if you have enough cores on CPU).
Squared images work better, which is why you can convert the 640x480 frames to square frames by running $ python squarify-images.py {path_to_derectory_with_images} {output_directory_path} {size_of_square_side} [amount_of_threads]. It converts images to 640x640(by adding white bar) and then to specified size.
Convert the .vbb annotation files to .txt files by running $ python generate-annotation.py {path_to_annotation.zip} {train_samples_output_directory} {test_samples_output_directory} [squarified_image_side_length]. It will create test.txt and train.txt into each output directory respectively. Also it will create bunch of .txt files named as images connected with them. For images wchich don't contain any lables .txt files won't be created.
Adjust .data yolo file
Adjust .cfg yolo file: take e.g. yolo-voc.2.0.cfg and set height = {height_of_your_img}, width = {width_of_your_img}, classes = 2, and in the final layer filters = 21 (= (classes + 5) * 3))

folder structure

|- caltech
|-- annotations
|-- test06
|--- V000.seq
|--- ...
|-- ...
|-- train00
|-- ...
|- caltech-for-yolo (this repo, cd)
|-- generate-images.py
|-- generate-annotation.py
|-- images
|-- labels
|-- test.txt
|-- train.txt

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
README.md		README.md
generate-annotation.py		generate-annotation.py
generate-images.py		generate-images.py
squarify-images.py		squarify-images.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

convert the format of the caltech pedestrian dataset to the format that yolo uses

dependencies

how to

folder structure

About

Releases

Packages

Languages

nyckyta/caltech-pedestrian-dataset-to-yolo-format-converter

Folders and files

Latest commit

History

Repository files navigation

convert the format of the caltech pedestrian dataset to the format that yolo uses

dependencies

how to

folder structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages