I was curious how text-to-image converter AI has improved ever since the release of Dall-E in January 2021. This was followed by Dall-E 2 release in April 2022.
Text: Zebra Unicorn in Space
Below’s the one from Dall-E 2
As seen clearly, the Crayon only picks up cues of ‘Zebra’ and ‘Space’ but misses the nuance of ‘Unicorn’. Also, the face is blurred and Zebra is mostly in a running/standing position. Whereas, in Dall-E 2, we have a clear face with the Zebra (and a unicorn) in different positions.
This curiosity led me to explore if the advances in OpenAI’s models have also improved upon algorithmic bias, only to end up disappointed!
Text: Successful Professional
Clearly, all light-skinned people and mostly, males! Dall-E 2 was even worse: