Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distilled teacher model #30

Open
Doan-IT opened this issue Dec 7, 2023 · 2 comments
Open

Distilled teacher model #30

Doan-IT opened this issue Dec 7, 2023 · 2 comments

Comments

@Doan-IT
Copy link

Doan-IT commented Dec 7, 2023

Sorry for my ignorance, i have small question about teacher model.
The teacher model pre-distilled with image dataset makes it smarter when training with other datasets?

Can you share the background of this rule. I feed there is no similarity between mvrec dataset and image dataset.

Thank you.

@willyfh
Copy link

willyfh commented Dec 23, 2023

Sorry for my ignorance, i have small question about teacher model. The teacher model pre-distilled with image dataset makes it smarter when training with other datasets?

Can you share the background of this rule. I feed there is no similarity between mvrec dataset and image dataset.

Thank you.

From my understanding, while there might not be a direct similarity between the ImageNet and MVTec datasets, the aim is for the features learned by the teacher model to encompass relevant aspects of visual patterns on a vast amount of data. During the distillation process for the student network, the student network is trained to emulate the behavior of the pretrained teacher (pre-distilled on ImageNet dataset), particularly on normal data of MVTec dataset. However, during testing on anomalous data, a high discrepancy between the student and teacher outputs is expected. The student hasn't been explicitly taught to replicate the teacher's output on anomalous instances, as the teacher's features are derived from the broader context of ImageNet representations. This discrepancy is leveraged as an indicator of anomalies, highlighting instances where the student deviates from the learned normal patterns.

@planemanner
Copy link

Addtion to good answer from @willyfh, I recommend you to read following paper for your comprehension about the mechanism.

Uninformed Students: Student–Teacher Anomaly Detection with Discriminative Latent Embeddings, CVPR 2020

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants