You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jul 1, 2024. It is now read-only.
Hi, I am working on segmenting car bodies in images using the Meta SAM model. I am facing a significant difference in performance between the UI demo on the official website and the code provided on the GitHub repository. UI demo performed remarkably well with just 1-2 clicks, however, when I attempted to use the code, results are very different and bad. Despite of providing multiple points, the results were not up to the mark as compared to the demo.
Using SAM Model Version:- "vit_h"
Used predictor_example file:- notebooks/predictor_example.ipynb
Examples: Image 1:
Original Image:
UI Demo Segmentation: - Performed well with 4 foreground points and 3 background points.
My Code Segmentation: - Poor results with the same point placement.
Image 2:
Original Image:
UI Demo Segmentation: - Good results with 4 foreground points and 4 background points.
My Code Segmentation: - Poor results with the same point placement.
I would appreciate any insights into why this discrepancy is happening.
Could it be related to hidden hyperparameter settings, optimizers, or learning rates used in the UI demo that aren't included in the GitHub code?
If this is the case, would it be possible to provide some guidance.
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Hi, I am working on segmenting car bodies in images using the Meta SAM model. I am facing a significant difference in performance between the UI demo on the official website and the code provided on the GitHub repository. UI demo performed remarkably well with just 1-2 clicks, however, when I attempted to use the code, results are very different and bad. Despite of providing multiple points, the results were not up to the mark as compared to the demo.
Using SAM Model Version:- "vit_h"
Used predictor_example file:- notebooks/predictor_example.ipynb
Examples:
![image3](https://private-user-images.githubusercontent.com/92079088/340618009-7c54a8a7-0f63-4277-99ef-f09eecec3680.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI5Mjk5ODUsIm5iZiI6MTcyMjkyOTY4NSwicGF0aCI6Ii85MjA3OTA4OC8zNDA2MTgwMDktN2M1NGE4YTctMGY2My00Mjc3LTk5ZWYtZjA5ZWVjZWMzNjgwLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDYlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODA2VDA3MzQ0NVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTc2NGJjMTY2OTVhMThlZmY3NTdhZDMyNzlhMWRkYTY4Yjk0YzhjYjU2MTJiMzAxMjQzY2Q2ZGU4YjNmNGIwYTAmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.N7qyLQMQ1q32NUbaihglmOMm37NDD8R50LiBIODrPTY)
Image 1:
Original Image:
UI Demo Segmentation: - Performed well with 4 foreground points and 3 background points.
![resized_sam_ui_3](https://private-user-images.githubusercontent.com/92079088/340617407-a8bb7a03-5fb0-45dd-af8a-751981efe147.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI5Mjk5ODUsIm5iZiI6MTcyMjkyOTY4NSwicGF0aCI6Ii85MjA3OTA4OC8zNDA2MTc0MDctYThiYjdhMDMtNWZiMC00NWRkLWFmOGEtNzUxOTgxZWZlMTQ3LmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDYlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODA2VDA3MzQ0NVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWY1NzMyNWQ0ZDRmNDYzODgwNzJiN2E4ZWU2ZTEwYzY5NWVmZGU3OWRkMWYzNjBmNTMxY2EyZjNlNWZlNzEzMjYmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.CN2EIjq68p9eK7_Rbh_00vSQGmUqHHm496U5jOv1N7I)
My Code Segmentation: - Poor results with the same point placement.
![code_output_3](https://private-user-images.githubusercontent.com/92079088/340616837-8577e8d3-1e2a-4425-868b-d0df99016387.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI5Mjk5ODUsIm5iZiI6MTcyMjkyOTY4NSwicGF0aCI6Ii85MjA3OTA4OC8zNDA2MTY4MzctODU3N2U4ZDMtMWUyYS00NDI1LTg2OGItZDBkZjk5MDE2Mzg3LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDYlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODA2VDA3MzQ0NVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTVjZmUwY2I4NjU3NTQyZjFlYzhlZjhiOGExNDI3MWU3MWFjNmRmMjQxYzY5YjVjYmI5ZGZiOGUzYmM1YTAzNjcmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.Lrbcg9biSHPk5GumgqR1z3mW6CDnJVTorD9ROiI-KTM)
Image 2:
![image2](https://private-user-images.githubusercontent.com/92079088/340616217-bb3d6fc7-364f-458a-8a9f-6016741a47e3.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI5Mjk5ODUsIm5iZiI6MTcyMjkyOTY4NSwicGF0aCI6Ii85MjA3OTA4OC8zNDA2MTYyMTctYmIzZDZmYzctMzY0Zi00NThhLThhOWYtNjAxNjc0MWE0N2UzLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDYlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODA2VDA3MzQ0NVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTZhMjM0Zjk0ZjdkYzliYzYzMTgwZDQ3MjRiMmMzMzEzYTczNjVmNjZiMWRmMjBiMGQyMTIxZWVmMDYxOGQyYzcmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.gO3u_Y3eo5caIe_JsB5tnTobm5f-9RuZ_GWgli3RiLs)
Original Image:
UI Demo Segmentation: - Good results with 4 foreground points and 4 background points.
![resized_sam_ui_2](https://private-user-images.githubusercontent.com/92079088/340617319-9eae1a7d-67f4-49d7-8cf2-f45a4e5c1cb6.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI5Mjk5ODUsIm5iZiI6MTcyMjkyOTY4NSwicGF0aCI6Ii85MjA3OTA4OC8zNDA2MTczMTktOWVhZTFhN2QtNjdmNC00OWQ3LThjZjItZjQ1YTRlNWMxY2I2LmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDYlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODA2VDA3MzQ0NVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTQ0OTliYmE0ZjUyMWUyMTI2YWMxNDJkYjc3MjU2NDQ5YTEyODNhYzc4NTRjZjJiOGIyYTM3N2I4MzYzZTVhZjImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.vQCkO6gBqGsRxWroxdTjP8RTbgDL2lJ6UxVNXOJ0LVE)
My Code Segmentation: - Poor results with the same point placement.
![code_output_2](https://private-user-images.githubusercontent.com/92079088/340616770-123bc20f-4374-494a-8dd6-4fb79ec3bac2.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjI5Mjk5ODUsIm5iZiI6MTcyMjkyOTY4NSwicGF0aCI6Ii85MjA3OTA4OC8zNDA2MTY3NzAtMTIzYmMyMGYtNDM3NC00OTRhLThkZDYtNGZiNzllYzNiYWMyLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA4MDYlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwODA2VDA3MzQ0NVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTJkYTMzMDY3NDM4MDIxMGU1ZmE2MDAzMTM1NjRhOTA3Y2NkYTljNjU3N2E0OWJjMzkxMmI0YjhiOWU4YmFkN2MmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.mYljfynl_v--MnbCU3HpMyV_J_Q9OAowHKn8kB6q314)
I would appreciate any insights into why this discrepancy is happening.
Could it be related to hidden hyperparameter settings, optimizers, or learning rates used in the UI demo that aren't included in the GitHub code?
If this is the case, would it be possible to provide some guidance.
The text was updated successfully, but these errors were encountered: