sia.hackernoon.com

Inverting the Observation Model: How to Generate Code from Any Output

TL;DR →

Detailing the process of training a conditional neural network to invert this observation model, starting with a randomly sampled program and incrementally denoising it to match a target output.

Table of Links

Abstract and 1. Introduction

Appendix

A. Mutation Algorithm

B. Context-Free Grammars

C. Sketch Simulation

D. Complexity Filtering

E. Tree Path Algorithm

F. Implementation Details

3.2 Policy

3.2.1 Forward Process

3.2.2 Reverse Mutation Paths

Since we have access to the ground-truth mutations, we can generate targets to train a neural network by simply reversing the sampled trajectory through the forward process Markov-Chain, z0 → z1 → . . .. At first glance, this may seem a reasonable choice. However, training to simply invert the last mutation can potentially create a much noisier signal for the neural network.

Consider the case where, within a much larger syntax tree, a color was mutated as,

Authors:

(1) Shreyas Kapur, University of California, Berkeley ([email protected]);

(2) Erik Jenner, University of California, Berkeley ([email protected]);

(3) Stuart Russell, University of California, Berkeley ([email protected]).

This paper is available on arxiv under CC BY-SA 4.0 DEED license.

This story on HackerNoon has a decentralized backup on Sia.

Meta Data: 📄