Re-thinking inverse graphics with large language models

P Kulits, H Feng, W Liu, V Abrevaya… - arxiv preprint arxiv …, 2024 - arxiv.org
Inverse graphics--the task of inverting an image into physical variables that, when rendered,
enable reproduction of the observed scene--is a fundamental challenge in computer vision …

Can Large Language Models Understand Symbolic Graphics Programs?

Z Qiu, W Liu, H Feng, Z Liu, TZ **ao, KM Collins… - arxiv preprint arxiv …, 2024 - arxiv.org
Against the backdrop of enthusiasm for large language models (LLMs), there is an urgent
need to scientifically assess their capabilities and shortcomings. This is nontrivial in part …

Shapewalk: Compositional shape editing through language-guided chains

H Slim, M Elhoseiny - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Editing 3D shapes through natural language instructions is a challenging task that requires
the comprehension of both language semantics and fine-grained geometric details. To …

SceneMotifCoder: Example-driven visual program learning for generating 3D object arrangements

HII Tam, HID Pun, AT Wang, AX Chang… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite advances in text-to-3D generation methods, generation of multi-object
arrangements remains challenging. Current methods exhibit failures in generating …

Learning to Edit Visual Programs with Self-Supervision

RK Jones, R Zhang, A Ganeshan, D Ritchie - arxiv preprint arxiv …, 2024 - arxiv.org
We design a system that learns how to edit visual programs. Our edit network consumes a
complete input program and a visual target. From this input, we task our network with …

Learning to infer generative template programs for visual concepts

RK Jones, S Chaudhuri, D Ritchie - arxiv preprint arxiv:2403.15476, 2024 - arxiv.org
People grasp flexible visual concepts from a few examples. We explore a neurosymbolic
system that learns how to infer programs that capture visual concepts in a domain-general …

Leroy: Library Learning for Imperative Programming Languages

A Bellur, R Alghamdi, K Workneh… - arxiv preprint arxiv …, 2024 - arxiv.org
Library learning is the process of building a library of common functionalities from a given
set of programs. Typically, this process is applied in the context of aiding program synthesis …

Dpa-net: Structured 3d abstraction from sparse views via differentiable primitive assembly

F Yu, Y Qian, X Zhang, F Gil-Ureta, B Jackson… - … on Computer Vision, 2024 - Springer
We present a differentiable rendering framework to learn structured 3D abstractions in the
form of primitive assemblies from sparse RGB images capturing a 3D object. By leveraging …

Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy

A Ganeshan, T Groueix, P Guerrero, R Měch… - arxiv preprint arxiv …, 2024 - arxiv.org
Pattern images are everywhere in the digital and physical worlds, and tools to edit them are
valuable. But editing pattern images is tricky: desired edits are often programmatic: structure …

ShapeLib: designing a library of procedural 3D shape abstractions with Large Language Models

RK Jones, P Guerrero, NJ Mitra, D Ritchie - arxiv preprint arxiv …, 2025 - arxiv.org
Procedural representations are desirable, versatile, and popular shape encodings.
Authoring them, either manually or using data-driven procedures, remains challenging, as a …