CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders
–Neural Information Processing Systems
CLIPDraw is an algorithm that synthesizes novel drawings from natural language input. It does not require any additional training; rather, a pre-trained CLIP language-image encoder is used as a metric for maximizing similarity between the given description and a generated drawing. Crucially, CLIPDraw operates over vector strokes rather than pixel images, which biases drawings towards simpler human-recognizable shapes.
Neural Information Processing Systems
Oct-10-2024, 07:35:59 GMT
- Technology: