performance with high-res images #85

bertsky · 2022-08-16T14:55:30Z

Sometimes the input comes with DPI 600 or beyond. It seems to me this makes eynollah become much slower. Larger resolution might be needed for newspapers, but there is always a point at which result quality does increase. I would assume that a single downscaling interpolation after import should not be too costly.

The documentation of allow_scaling says that it would also scale down images. But the implementation does not look like that's the case:

eynollah/qurator/eynollah/eynollah.py

Lines 437 to 444 in 8d5079c

    
           if dpi < DPI_THRESHOLD: 
        
               img_new, num_column_is_classified = self.calculate_width_height_by_columns(img, num_col, width_early, label_p_pred) 
        
               image_res = self.predict_enhancement(img_new) 
        
               is_image_enhanced = True 
        
           else: 
        
               num_column_is_classified = True 
        
               image_res = np.copy(img) 
        
               is_image_enhanced = False

IIUC only too small images get upsampled. I'd expect a secondary DPI_THRESHOLD2 at which downsampling would begin.

The text was updated successfully, but these errors were encountered:

vahidrezanezhad · 2022-08-18T11:00:38Z

Sometimes the input comes with DPI 600 or beyond. It seems to me this makes eynollah become much slower. Larger resolution might be needed for newspapers, but there is always a point at which result quality does increase. I would assume that a single downscaling interpolation after import should not be too costly.

The documentation of allow_scaling says that it would also scale down images. But the implementation does not look like that's the case:

eynollah/qurator/eynollah/eynollah.py

Lines 437 to 444 in 8d5079c

if dpi < DPI_THRESHOLD:

img_new, num_column_is_classified = self.calculate_width_height_by_columns(img, num_col, width_early, label_p_pred)

image_res = self.predict_enhancement(img_new)

is_image_enhanced = True

else:

num_column_is_classified = True

image_res = np.copy(img)

is_image_enhanced = False

IIUC only too small images get upsampled. I'd expect a secondary DPI_THRESHOLD2 at which downsampling would begin.

Two points about your comment. First, a DPI of 600 can not alone make eynollah slower. The problem with high resolution documents is (without allow_scaling option) that they can not be scaled down automatically. The allow_scaling should be True and if columns are detected correctly then down scaling can be a case.

The second point, allow_scaling lets you to scale down for documents with DPI bigger than 300. But scaling down will happen if its needed. This means if scale of document is much bigger than of "training scale" then scaling down will be applied.

bertsky · 2022-08-30T10:22:30Z

@vahidrezanezhad please help me understand:

First, a DPI of 600 can not alone make eynollah slower.

How is that? I can see lots of CPU-bound image processing. Most algorithms are O(n²). And even for the GPU-bound parts: they each need to downscale to the fixed input size of the respective model.

The problem with high resolution documents is (without allow_scaling option) that they can not be scaled down automatically.

Why not? Downsampling (with suitable interpolation algorithm) should be trivial – as opposed to upsampling, for which you built an elaborate model.

allow_scaling lets you to scale down for documents with DPI bigger than 300. But scaling down will happen if its needed. This means if scale of document is much bigger than of "training scale" then scaling down will be applied.

I am confused. Where does this actually happen?

bertsky · 2023-02-16T18:42:28Z

I am confused. Where does this actually happen?

Here:

eynollah/qurator/eynollah/eynollah.py

Lines 2007 to 2008 in 13bc237

    
           if self.allow_scaling: 
        
               img_org, img_res, is_image_enhanced = self.resize_image_with_column_classifier(is_image_enhanced, img_bin)

eynollah/qurator/eynollah/eynollah.py

Line 373 in 13bc237

    
           img_new, _ = self.calculate_width_height_by_columns(img, num_col, width_early, label_p_pred)

eynollah/qurator/eynollah/eynollah.py

Lines 318 to 323 in 13bc237

    
           if label_p_pred[0][int(num_col - 1)] < 0.9 and img_w_new < width_early: 
        
               img_new = np.copy(img) 
        
               num_column_is_classified = False 
        
           else: 
        
               img_new = resize_image(img, img_h_new, img_w_new) 
        
               num_column_is_classified = True

(So, essentially, if the column detector is confident enough, there can be downsampling.)

cneud added the question Further information is requested label Aug 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance with high-res images #85

performance with high-res images #85

bertsky commented Aug 16, 2022

vahidrezanezhad commented Aug 18, 2022

bertsky commented Aug 30, 2022

bertsky commented Feb 16, 2023

performance with high-res images #85

performance with high-res images #85

Comments

bertsky commented Aug 16, 2022

vahidrezanezhad commented Aug 18, 2022

bertsky commented Aug 30, 2022

bertsky commented Feb 16, 2023