Katherine Crawson is @ Eletheur & IMHO is indisputably most responsible for the advances in text=>image generation. Dall-E 2 is Dall-E and her insight to use diffusion, the intermediate proof of concept of diffusion + Dall-E is GLIDE.
They did invent the idea of applying it to image generation, leading to OpenAI citing her _tweets_ (how cool is that?) in a paper for GLIDE, which as other comments note, looks just like a proof of concept of DallE-2.
https://twitter.com/RiversHaveWings & https://github.com/crowsonkb