The image functionality is pretty underwhelming. Some things I tried that it fai...

mrtranscendence · on July 14, 2023

This is probably because it can’t actually understand images, it’s relying on other services to deal with it. So it can do an image search to find what something might be an image of, or use ocr to extract text, but it’ll fail on tasks that involve the idiosyncrasies of each image.

netdur · on July 16, 2023

I believe it uses something like image to text to provide general understanding of image to LLM, maybe in form of vector floats even... making it good at understanding content of image but cannot do ops you listed.

famouswaffles · on July 16, 2023

Curious if you have access to Bing's Vision and have tried the same experiments ?