reproducibilityindex.ai

An Iterative Approach to Synthesize Data Transformation Programs

Authors: Bo Wu, Craig A. Knoblock

IJCAI 2015 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We evaluated the approach with a variety of transformation scenarios. The results show that the approach signiﬁcantly reduces the time used to generate the transformation programs, especially in complicated scenarios.
Researcher Affiliation	Academia	Bo Wu Computer Science Department University of Southern California Los Angeles, California bowu@isi.edu Craig A. Knoblock Information Science Institute University of Southern California Los Angeles, California knoblock@isi.edu
Pseudocode	Yes	Algorithm 1: Program Adaptation
Open Source Code	Yes	Data and code are available at http://bit.ly/1Gt Z4Gc. The code is also available as the data transformation tool of Karma (http://www.isi.edu/integration/karma).
Open Datasets	Yes	Data and code are available at http://bit.ly/1Gt Z4Gc.
Dataset Splits	No	The paper mentions providing examples iteratively until programs transform all records correctly but does not specify any training/validation/test splits.
Hardware Specification	Yes	We performed the experiments on a laptop with 8G RAM and 2.66GHz CPU.
Software Dependencies	No	The paper mentions using Karma, Gulwani's approach, Metagol DF, and Flashfill, but does not specify version numbers for any key software dependencies.
Experiment Setup	No	The paper describes the comparison methodology and how time was measured for program generation, but it does not provide specific hyperparameters or system-level training settings like learning rates, batch sizes, or optimizer configurations, as might be found in typical machine learning setups.