-
-
Notifications
You must be signed in to change notification settings - Fork 226
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Empty and Missing labels after conversion #44
Comments
Thanks for writing the issue. Looking at the global_json2yolo code, there are some flags.
Converting the COCO segmentation format to YOLO segmentation format.
Converting the COCO keypoints format to YOLO keypoints format.
To convert the COCO segmentation format to YOLO segmentation format.
This is the folder structure when we run the script. Please let us know your opinion. |
Thanks for reaching out and providing the details. From the script and folder structure you've shared, it looks like the issue could be related to the conversion script and its configuration when converting COCO format to YOLO format. Ensure that the conversion flags (use_segments and use_keypoints) are set appropriately for your dataset, ensuring that the conversion is accurately representing your data. In addition, verify that the source path for the JSON files is correctly specified. If you continue to encounter issues, please feel free to provide more information or reach out for further assistance. |
I keep running into the following error,
The directory annotations contains all the json files that are mentioned on the coco dataset website. |
@Mary14-design it seems like the script encounters a if ann.get("iscrowd", 0): This way, if |
Hi @Mary14-design, thanks for writing the issue. In the captions dataset, "category_id" , "bbox", "segmentation", and "keypoints" do not exist. If "bbox", "segmentation", and "keypoints" do not exist, we can assume that this dataset might be for the classification task. But even in the classification task, we need the class id to convert to the YOLO format. We updated this script global_json2yolo.py , so that when converting the caption dataset, there are no errors, but each text file is empty. Please let us know your opinion. Caption dataset.
Detection, Segmentation, and Keypoints dataset.
|
Hi @ryouchinsa! Thanks for the update and for diving deep into the conversion of caption datasets to YOLO format. As you've observed, without "category_id", "bbox", "segmentation", and "keypoints", the dataset is not directly compatible with typical YOLO use cases, which generally focus on object detection tasks. Your approach of allowing the script to run without errors and producing empty text files for the caption dataset is sensible under the circumstances. It ensures that users are aware of the limitations when attempting to apply object detection frameworks to datasets intended for different tasks like captioning. For anyone looking to work with such datasets in the future, it's important to note that YOLO requires specific annotation formats to train on detection, segmentation, and keypoints tasks effectively. Converting non-compatible datasets might require creative preprocessing or a reconsideration of the dataset's suitability for the intended task. Thanks for sharing your insights, and feel free to reach out if you have more questions or ideas! 😊 |
The number of files after conversion is less than the number of files in json. Also some of the converted files have empty labels.
The text was updated successfully, but these errors were encountered: