2f3c6a4cd8af177f6456e7e51a916ff3-Supplemental.pdf
–Neural Information Processing Systems
"Name" is the name of the operation in our search space. "TFFunction" is the TensorFlow function that the name is mapped to when a DNA instruction is being converted to a line of TensorFlow code. "Argument Mapping" describes how the values in a DNA's argument set are mapped to the corresponding TensorFlow function arguments. This vocabulary is largely constructed from the lowest level TF operations needed to create Transformers (see Appendix A.5). We also add commonly used math primitives such as SIN and ABS. Here we provide additional implementation details. Relative Dimensions: We use relative dimensions [13] instead of absolute dimensions for each instruction's "dimension size" argument. This allows us to resize the models to fit within our parameter limits (32M to 38M parameters). The vocabulary for these relative dimensions is [1, 2, 4, 8, 12, 16, 24, 32, 48, 64].
Neural Information Processing Systems
Apr-25-2026, 08:28:51 GMT
- Country:
- North America > United States (0.28)
- Genre:
- Research Report
- New Finding (0.46)
- Experimental Study (0.46)
- Research Report
- Industry:
- Energy > Power Industry (0.46)
- Technology: