BYOL loss function #1058

vishnu-dev · 2023-07-16T13:23:52Z

🐛 Bug

The loss function in BYOL doesn't seem to match the one defined in the paper.

To Reproduce

Running the training for BYOL.

Code sample

lightning-bolts/src/pl_bolts/models/self_supervised/byol/byol_module.py

Lines 130 to 142 in 4f910f6

    
               def calculate_loss(self, v_online: Tensor, v_target: Tensor) -> Tensor: 
        
                   """Calculates similarity loss between the online network prediction of target network projection. 
        
                   Args: 
        
                       v_online (Tensor): Online network view 
        
                       v_target (Tensor): Target network view 
        
                   """ 
        
                   _, z1 = self.online_network(v_online) 
        
                   h1 = self.predictor(z1) 
        
                   with torch.no_grad(): 
        
                       _, z2 = self.target_network(v_target) 
        
                   return -2 * F.cosine_similarity(h1, z2).mean()

Expected behavior

The loss should be 2 - (2 * F.cosine_similarity(h1, z2).mean())?

The text was updated successfully, but these errors were encountered:

Borda · 2023-08-31T18:37:01Z

Thank you, @vishnu-dev mind sending PR with the fix? 🐰

vishnu-dev added bug Something isn't working help wanted Extra attention is needed labels Jul 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BYOL loss function #1058

BYOL loss function #1058

vishnu-dev commented Jul 16, 2023 •

edited

Loading

Borda commented Aug 31, 2023

BYOL loss function #1058

BYOL loss function #1058

Comments

vishnu-dev commented Jul 16, 2023 • edited Loading

🐛 Bug

To Reproduce

Code sample

Expected behavior

Borda commented Aug 31, 2023

vishnu-dev commented Jul 16, 2023 •

edited

Loading