Add support for high bit depth multichannel images #1888

wiredfool · 2016-05-05T16:09:20Z

Pillow (and PIL) is currently able to open 8 bit per channel multi-channel images (such as RGB) but is able to open higher bit depth images (e.g. I16, I32, or Float32 images) if they are single channel (e.g., grayscale).

Previous References

This has been requested many times: #1828, #1885, #1839, #1602, and farther back.

Requirements

We should be able to support common GIS formats as well as high bit depth RGB(A) images.
At least 4 channels, but potentially more (see Add tests for opening 2-5 layer uint16 greyscale TIFFs #1839)
Different pixel formats, including I16, I32, and Float.
There should be definitions for the array interface to exchange images with numpy/scipy
There should be enough support to read and write TIFFs and raw image data.
Support for resize, crop, and convert operations at the very least.

Background Reference Info

The rough sequence for image loading is:

Image file is opened
Each of the ImagePlugin _accept functions have a chance to look at the first few bytes to determine if they should attempt to open the file
The *ImagePlugin._open method is called giving the image plugin a chance to read more of the image and determine if it still wants to consider it a valid image of it's particular type. If it does, it passes back a tile definition which includes a decoder and an image size.
If there is a successful _open call, at some point later *ImagePlugin._load may be called on the image, which runs the decoder producing a set of bytes in a raw mode. This is where things like compression are handled, but the output of the decoder is not necessarily what we're storing in our internal structures.
The image is unpacked (Unpack.c) from the raw mode (e.g. I16;BS) into a storage (Storage.c) mode (I).
It's now possible to operate on the image (e.g. crop, pixel access, etc)

There are 3 (or 4) image data pointers, as defined in Imaging.h:

struct ImagingMemoryInstance {

    /* Format */
    char mode[IMAGING_MODE_LENGTH]; /* Band names ("1", "L", "P", "RGB", "RGBA", "CMYK", "YCbCr", "BGR;xy") */
    int type;       /* Data type (IMAGING_TYPE_*) */
    int depth;      /* Depth (ignored in this version) */
    int bands;      /* Number of bands (1, 2, 3, or 4) */
    int xsize;      /* Image dimension. */
    int ysize;

    /* Colour palette (for "P" images only) */
    ImagingPalette palette;

    /* Data pointers */
    UINT8 **image8; /* Set for 8-bit images (pixelsize=1). */
    INT32 **image32;    /* Set for 32-bit images (pixelsize=4). */

    /* Internals */
    char **image;   /* Actual raster data. */
    char *block;    /* Set if data is allocated in a single block. */

    int pixelsize;  /* Size of a pixel, in bytes (1, 2 or 4) */
    int linesize;   /* Size of a line, in bytes (xsize * pixelsize) */

    /* Virtual methods */
    void (*destroy)(Imaging im);
};

The only one that is guaranteed to be set is **image, which is an array of pointers to row data.

Changes Required

Definitions for all of the modes that we're planning, and potentially a [format];MB[#bands] style generic mode.

Core Imaging Structure

The imaging structure has the fields required to add the additional channels. (type, bands, pixelsize, linesize)
The **image pointer can be used for any width of pixel.
We may or may not want to set the **image32 pointer.
Currently type of IMAGING_TYPE_INT32 and IMAGING_TYPE_FLOAT32 imply 1 band. This will change.
Consider promoting int16 to IMAGING_TYPE_INT16

Storage

Updates to Storage.c, Unpack.c, Pack.c, Access.c, PyAccess.py, and Convert.c

Ways to Help

We need a better definition of the format requirements. What are the various types of images that are used in GIS, Medical, or other fields that we'd want to interpret? We need small, redistributable versions of images that we can test against.

[in progress]

The text was updated successfully, but these errors were encountered:

terramars · 2016-05-23T15:25:08Z

I'm having the same problem with 16 bit single-channel paletted TIFFs, created by GDAL. It would be "really" nice if Pillow could play nicely with GIS and scientific image formats, as GDAL is a pain in the ass and I'd rather not use it.

tiffinfo as follows:

TIFFReadDirectory: Warning, Unknown field with tag 33550 (0x830e) encountered.
TIFFReadDirectory: Warning, Unknown field with tag 33922 (0x8482) encountered.
TIFFReadDirectory: Warning, Unknown field with tag 34735 (0x87af) encountered.
TIFFReadDirectory: Warning, Unknown field with tag 34737 (0x87b1) encountered.
TIFFReadDirectory: Warning, Unknown field with tag 42113 (0xa481) encountered.
TIFF Directory at offset 0x34293c6 (54694854)
Image Width: 10774 Image Length: 12577
Bits/Sample: 16
Sample Format: unsigned integer
Compression Scheme: LZW
Photometric Interpretation: palette color (RGB from colormap)
Samples/Pixel: 1
Rows/Strip: 1
Planar Configuration: single image plane
Color Map: (present)
Tag 33550: 4.999617,4.999789,0.000000
Tag 33922: 0.000000,0.000000,0.000000,679006.067110,9955209.915048,0.000000
Tag 34735: 1,1,0,7,1024,0,1,1,1025,0,1,1,1026,34737,22,0,2049,34737,7,22,2054,0,1,9102,3072,0,1,32736,3076,0,1,9001
Tag 34737: WGS 84 / UTM zone 36S|WGS 84|
Tag 42113: 0
Predictor: horizontal differencing 2 (0x2)

bodokaiser · 2017-03-14T11:14:08Z

Any updates on this?

wiredfool · 2017-03-14T21:52:59Z

Unfortunately, no.

vfdev-5 · 2018-02-20T21:02:44Z

@wiredfool what do you think about to add the support of multichannel images as sequence of Image ? For example, 4 channels image with uint16 is represented (more less equivalently) by
['<PIL.Image.Image image mode=I;16 size=... >', '<PIL.Image.Image image mode=I;16 size=...>', ..., '<PIL.Image.Image image mode=I;16 size=...>']. I mean by that, maybe, to provide a class inheriting from Image and tuple and override all method to work on a tuple of images... Sure that it looks like a hack, however it could unlock more features (and create issues :) ) at least while working with Image.fromarray.

wiredfool · 2018-02-21T07:39:47Z

To do anything useful with it, we'd have to have support in the C layer, so it would have to be at the core imaging layer, and especially Unpack/Pack.

vfdev-5 · 2018-02-21T21:37:57Z

@wiredfool following your "Ways to help",

We need a better definition of the format requirements. What are the various types of images that are used in GIS, Medical, or other fields that we'd want to interpret?

For GIS, as there is a huge amount of different formats (for example, gdal format list), this can be left for GIS libraries as gdal, rasterio etc.
However, a support of Image.fromarray on input multi-channel (3,4,5,...) arrays of dtype np.uint16, np.float32 would be, imho, essential.

We need small, redistributable versions of images that we can test against.

For GIS imagery, this can be easily created manually with gdal, rasterio.

I would like to give a hand on this, so, feel free to ask me.

edowson · 2018-06-07T18:42:13Z

PIL cannot handle processing multi-channel images. They get truncated to 3-ch images if you perform any transformation using PIL. #3160

akinuri · 2018-06-08T13:20:28Z

Related: How to open a TIF (CMYK, 16-bit) image file?

bjtho08 · 2019-02-15T14:21:59Z

What is the status of this issue? It has been almost three years since the first proposal. I am unfortunately unable to provide any help since I have zero experience with coding in C, but I am among the people that is awaiting support for e.g. multi-channel floating-point images (with possibilities for negative pixel values). This especially useful in deep learning, where it is preferable to have all values normalized with zero mean. PIL has some really awesome ImageOps, which is one of the reasons for wanting this support.

hugovk · 2019-02-17T14:00:40Z

@bjtho08 No updates.

#2485 links to a multipage RGB TIFF containing float64 values.

omaghsoudi · 2019-07-05T01:42:56Z

Please fix the issue with multi-channel 16 bit images.
Thank you!

aclark4life · 2024-03-19T22:38:07Z

Maybe can make some progress on this in 2024, pending acceptance of AcademySoftwareFoundation/tac#631

aclark4life · 2024-04-15T22:44:27Z

in favor of dealing directly with np.arrays and cv2 functions for manipulating the data as images. It's not as convenient as what PIL offers but 8bit is a deal breaker.

@herronelou Can you (or anyone?) say any more about the convenience of PIL and how meaningful > 8 bit multichannel support in PIL would be? Would you switch back to PIL if this feature were added and would you expect an uptick in usage from VFX studios in general? I got interested in VFX recently so I'm especially curious about this issue now.

terramars · 2024-04-15T22:59:11Z

I can just say that for GIS, if you want to deal with tiffs that aren't extremely simple you're stuck going into gdal internals to do anything, even just read them into an array. I'm still sad 7 years later I had to waste time learning that tool and couldn't just do Image.open on them. Maybe someone else implemented it by now but I doubt.

…

On Mon, Apr 15, 2024, 3:44 PM Jeffrey A. Clark ***@***.***> wrote: in favor of dealing directly with np.arrays and cv2 functions for manipulating the data as images. It's not as convenient as what PIL offers but 8bit is a deal breaker. @herronelou <https://github.com/herronelou> Can you (or anyone?) say any more about the convenience of PIL and how meaningful > 8 bit multichannel support in PIL would be? Would you switch back to PIL if this feature were added and would you expect an uptick in usage from VFX studios in general? I got interested in VFX recently so I'm especially curious about this issue now. — Reply to this email directly, view it on GitHub <#1888 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABHA64K6XBSEB6M3GBVL2ZTY5RJ6JAVCNFSM4CDASTI2U5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TEMBVG44TGNJYGY4Q> . You are receiving this because you commented.Message ID: ***@***.***>

herronelou · 2024-04-15T23:15:35Z

@aclark4life For the most part, VFX studios tend to work with EXR file formats. Internally most of our softwares process in 32bit float, although saving the resulting images in 16bit float is usually enough, except for a small number of specific data passes that we tend to store in other channels.

I've not been doing much personally recently that could have used PIL, the main cases I've run into when I posted were external tools we brought into our pipeline that used PIL for their image reading, and we had to strip it away so we could run our 16bit float images through without the loss caused by going through 8bit, so yes, absolutely, if PIL supported those natively we wouldn't need to go out of our way to strip PIL away when somebody uses it, which would be great.

rbavery · 2024-04-17T17:42:28Z

PIL is used in many ML frameworks for reading images, like FastAI and detectron2 and countless ML projects. When someone tries to use these frameworks or projects as examples with their high bit depth multichannel images, often the first thing to cause grief is this issue. On multiple occasions I've had to rewrite image data loaders for ML because Pillow does not support multichannel float32 tifs. This imagery is really common in geospatial analysis, most satellite imagery comes in high bit depth.

aclark4life · 2024-04-23T13:11:44Z

@cgohlke Does any of your code here potentially help us by way of example to implement high bit depth multichannel in Pillow? https://github.com/cgohlke/tifffile/blob/master/tifffile/_imagecodecs.py

Thanks for any info

aclark4life · 2024-05-15T14:30:29Z

Via @wiredfool , thanks!

I think that there's a good argument for planar image storage, i.e. r/g/b in separate arrays. Any single band calculation would just work, and the more complicated modes (e.g., channels with different bit depth) would be trivial to add, as they would essentially just be part of a list of planes.It would complicate the shufflers, and especially those image formats that currently just splat into an array without using the packer/unpacker. It's also less useful for luminance style calculations, though it's possible. There's definitely a tension in image formats on the interleaved vs planar approach, and I suspect it comes down to "one is easier for basic images, and one is more general.
I think there's a super strong argument for being able to have our storage be directly compatible with the arrow memory layout. I'm unclear if we could have arbitrary structs there, if we'd just want a linear array of one datatype, or if we'd want to do a tensor layout, or what the mechanics are for a dataframe style interop. Arrow + the evolution of the array interface would give us 0 copy interaction with polars/pandas2 and anything else in the new data space.
I think that interleaved storage with anything more than 1|3|4 channel x [list of pixel storage modes] is going to be a pain.
GIS is going to be a pain. I'd still recommend using gdal backed (e.g. rasterio) readers/writers for that, as we've got 0 support for pyramids, spatial metadata, and tiled tiffs. It's a huge field, and we're not even at square 1 for it.

So looking at that, I think there's two definite possibilities for progress.

Planar Image Storage, in parallel with the current interleaved image storage. There's probably a couple of core bits here that would need to be in C, but most could probably be done at the Image.py layer.
Arrow as a core storage interface. This is going to be all c, with a very small shim for the dataframe interface.

aclark4life · 2024-05-17T00:21:26Z

Also possibly of interest: https://github.com/girder/large_image

wiredfool · 2024-05-17T13:36:40Z

FWIW, some references on Arrow.

https://arrow.apache.org/docs/format/Columnar.html The columnar format
https://arrow.apache.org/docs/format/Other.html A reference to the tensor arrangement.
https://arrow.apache.org/docs/python/interchange_protocol.html Dataframe interchange protocol.
https://github.com/apache/arrow-nanoarrow Nano-arrow, a very small implementation of the arrow layout
https://arrow.apache.org/docs/python/index.html PyArrow. I'm not sure we'd want to pull this in as a dependency, but it's a full set of bindings against the C++ Arrow interface.

aclark4life · 2024-05-29T20:32:53Z

Can anyone suggest some test data we can use to develop this feature? This event is happening tomorrow and would be nice to have a success target in mind e.g. "If we can read/write this type of data …" https://www.meetup.com/dcpython/events/301086016/

rbavery · 2024-05-30T20:22:48Z

I think that interleaved storage with anything more than 1|3|4 channel x [list of pixel storage modes] is going to be a pain.

In case it isn't too much pain to work with more than 4 bands, we host this example subset of Eurosat, here is an example image s3://wherobots-examples/data/eurosat_small/Highway/Highway_1.tif.

Each image is 13 bands, uint16, planar

>>> tiff_image = tifffile.TiffFile("Highway_1.tif")
>>> print(tiff_image.pages[0].tags['PlanarConfiguration'].value)
PLANARCONFIG.CONTIG

aclark4life · 2024-05-31T22:39:34Z

@wiredfool If we use Arrow that implies adding a dependency on pyarrow, ideally optionally via extras like pip install pillow[arrow], correct?

wiredfool · 2024-06-01T16:01:30Z

@aclark4life Maybe. There's definitely a C-only implementation (nanoarrow) that might be what we want, since all of our image allocations are in the C layer now. PyArrow might be easier for integration/interop at the high level, but my sense here is that it wouldn't necessarily be giving us a whole lot that we'd not already have with a C arrow implementation + our usual set of accessors.

aclark4life · 2024-06-20T10:44:11Z

Folks interested in this issue, please test #8224 and give feedback, thanks all

wiredfool added the Enhancement label May 5, 2016

wiredfool self-assigned this May 5, 2016

wiredfool mentioned this issue May 5, 2016

Unable to read 16bit tiff #1828

Closed

bodokaiser mentioned this issue Mar 14, 2017

support int16 grayscale images pytorch/vision#105

Closed

hugovk mentioned this issue Apr 12, 2017

Unable to open multipage multichannel TIFF #2485

Closed

wiredfool mentioned this issue May 29, 2017

Tiff: IOError: cannot identify 16-bit TIF file #2556

Closed

vigliensoni mentioned this issue Sep 11, 2017

Tiff: IOError: cannot identify 16-bit TIF file DDMAL/pil_rodan#1

Closed

hugovk mentioned this issue Jan 12, 2018

Error when loading image #2955

Closed

Dref360 mentioned this issue Jan 25, 2018

Minor change: Added tif and tiff to white_list_formats keras-team/keras#9187

Merged

Amir-Arsalan mentioned this issue Apr 4, 2018

Unable to properly read multi-channel 16-bit png files imageio/imageio#329

Closed

hugovk mentioned this issue Jun 7, 2018

Multi-channel images get truncated to 3 channels #3160

Closed

aclark4life added NumPy and removed NumPy labels Jun 30, 2018

radarhere mentioned this issue Jan 6, 2019

Cannot read TIFF file #3536

Closed

tomgoddard mentioned this issue Mar 7, 2019

Cannot open 16-bit lossless jpeg with Pillow pydicom/pydicom#813

Closed

adamjstewart mentioned this issue May 2, 2019

No support for multi-channel images pytorch/vision#882

Closed

aclark4life added this to Backlog in Pillow May 11, 2019

aclark4life moved this from Backlog to In progress in Pillow May 11, 2019

kiblee mentioned this issue Jul 5, 2019

Keras issue for loading uint16 (16 bits) images by ImageDataGenerator keras-team/keras#13023

Closed

aclark4life mentioned this issue Mar 21, 2024

Additional financial or other support #7610

Open

aclark4life changed the title ~~Tracking Issue for high bit depth multichannel images~~ Add support for high bit depth multichannel images Apr 15, 2024

This was referenced Apr 18, 2024

Follow-up discussion on New Project Proposal - Pillow AcademySoftwareFoundation/tac#631

Closed

cannot identify image file (PNG file from scanner) #7993

Closed

aclark4life mentioned this issue Apr 24, 2024

Add JPEG XL Open/Read support via libjxl #7848

Open

aclark4life added the Hasn't worked in 20 years label May 18, 2024

radarhere mentioned this issue May 31, 2024

Demonstrated change #8094

Closed

aclark4life mentioned this issue Jun 9, 2024

Update license identifier to MIT-CMU #7942

Open

aclark4life mentioned this issue Jun 20, 2024

Hack202406 #8154

Closed

This was referenced Jun 20, 2024

Add support for high bit depth multichannel images #8157

Closed

transpose() with multi-band format #8161

Merged

aclark4life mentioned this issue Jul 10, 2024

Add support for high bit depth multichannel images #8223

Closed

yoursunny linked a pull request Jul 10, 2024 that will close this issue

Add support for high bit depth multichannel images #8224

Open

aclark4life pinned this issue Jul 31, 2024

aclark4life mentioned this issue Aug 3, 2024

Add GPU/CUDA Support? #5787

Closed

wiredfool mentioned this issue Aug 25, 2024

Arrow Support #8329

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for high bit depth multichannel images #1888

Add support for high bit depth multichannel images #1888

wiredfool commented May 5, 2016 •

edited

Loading

terramars commented May 23, 2016

bodokaiser commented Mar 14, 2017

wiredfool commented Mar 14, 2017

vfdev-5 commented Feb 20, 2018

wiredfool commented Feb 21, 2018

vfdev-5 commented Feb 21, 2018

edowson commented Jun 7, 2018 •

edited

Loading

akinuri commented Jun 8, 2018

bjtho08 commented Feb 15, 2019

hugovk commented Feb 17, 2019

omaghsoudi commented Jul 5, 2019 •

edited by radarhere

Loading

aclark4life commented Mar 19, 2024

aclark4life commented Apr 15, 2024

terramars commented Apr 15, 2024 via email

herronelou commented Apr 15, 2024

rbavery commented Apr 17, 2024 •

edited

Loading

aclark4life commented Apr 23, 2024

aclark4life commented May 15, 2024

aclark4life commented May 17, 2024

wiredfool commented May 17, 2024

aclark4life commented May 29, 2024 •

edited

Loading

rbavery commented May 30, 2024 •

edited

Loading

aclark4life commented May 31, 2024

wiredfool commented Jun 1, 2024

aclark4life commented Jun 20, 2024 •

edited

Loading

Add support for high bit depth multichannel images #1888

Add support for high bit depth multichannel images #1888

Comments

wiredfool commented May 5, 2016 • edited Loading

Previous References

Requirements

Background Reference Info

Changes Required

Core Imaging Structure

Storage

Ways to Help

terramars commented May 23, 2016

bodokaiser commented Mar 14, 2017

wiredfool commented Mar 14, 2017

vfdev-5 commented Feb 20, 2018

wiredfool commented Feb 21, 2018

vfdev-5 commented Feb 21, 2018

edowson commented Jun 7, 2018 • edited Loading

akinuri commented Jun 8, 2018

bjtho08 commented Feb 15, 2019

hugovk commented Feb 17, 2019

omaghsoudi commented Jul 5, 2019 • edited by radarhere Loading

aclark4life commented Mar 19, 2024

aclark4life commented Apr 15, 2024

terramars commented Apr 15, 2024 via email

herronelou commented Apr 15, 2024

rbavery commented Apr 17, 2024 • edited Loading

aclark4life commented Apr 23, 2024

aclark4life commented May 15, 2024

aclark4life commented May 17, 2024

wiredfool commented May 17, 2024

aclark4life commented May 29, 2024 • edited Loading

rbavery commented May 30, 2024 • edited Loading

aclark4life commented May 31, 2024

wiredfool commented Jun 1, 2024

aclark4life commented Jun 20, 2024 • edited Loading

wiredfool commented May 5, 2016 •

edited

Loading

edowson commented Jun 7, 2018 •

edited

Loading

omaghsoudi commented Jul 5, 2019 •

edited by radarhere

Loading

rbavery commented Apr 17, 2024 •

edited

Loading

aclark4life commented May 29, 2024 •

edited

Loading

rbavery commented May 30, 2024 •

edited

Loading

aclark4life commented Jun 20, 2024 •

edited

Loading