Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..
A Regular Matching Constraint for String Variables
Authors: Roberto Amadini, Peter J. Stuckey
IJCAI 2023 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | Empirical evidences show the effectiveness of our approach, implemented within the constraint programming framework, and tested against state-ofthe-art string solvers. Empirical results underline the importance of implementing native support for the match constraint. We compared the performance of our approach against different state-of-the-art string solving technologies. We implemented our match propagator in G-STRINGS, a string solver based on CP solver GECODE. We compare its performance against the approach using decomposition (1), which we will call G-STRINGSdec. We also include in the evaluation the state-of-the-art SMT solvers CVC5 [Barbosa et al., 2022] and Z3 [de Moura and Bjรธrner, 2008]. Fig. 3 shows the runtimes in seconds for n = 2, 3, . . . , 26. The results are shown in Tab. 1 |
| Researcher Affiliation | Collaboration | Roberto Amadini1 , Peter J. Stuckey2,3 1University of Bologna, Bologna, Italy 2Monash University, Melbourne, Victoria, Australia 3OPTIMA ARC Industrial Transformation and Training Centre, Melbourne, Australia |
| Pseudocode | Yes | Algorithm 1 PROPMATCH function. Algorithm 2 CHECKBLOCK function. |
| Open Source Code | No | We will disclose the source code in case of acceptance of the paper. |
| Open Datasets | No | For the Latin Square problem: "We use 2n string variables: r1, . . . , rn for the rows and c1, . . . , cn for the columns". For Shortest Matching Superstring: "We generated 16 sets Sl,n of patterns for n {32, 64, 128, 256} and l {5, 10, 15, 20}. Each Sl,n contains n random patterns such that there exists a string of length l matched by all patterns (but there could be a shorter one)." While they describe how they generated their problem instances/data, there is no statement about public availability or access information for these generated datasets. |
| Dataset Splits | No | The paper describes how problem instances were generated (e.g., Latin square for n=2 to 26, Shortest matching superstring with n and l parameters) but does not specify formal training, validation, or test splits of any dataset. It implies evaluation on generated instances. |
| Hardware Specification | Yes | We run all the experiments on a Ubuntu 22.04 machine with 8 GB of RAM and 1.60 GHz Intel i5 CPU, with a timeout of 300s. |
| Software Dependencies | Yes | We used CVC5 1.0.2 and Z3 4.11.2 with Z3str3 string solver (we also tried Z3seq, which did not improve Z3 s performance). |
| Experiment Setup | Yes | We encoded the problem in Mini Zinc [Nethercote et al., 2007] for G-STRINGS and G-STRINGSdec, while for SMT solvers we used the SMT-LIB [Barrett et al., 2010] language. For SMT solvers, we replaced each match constraint with a corresponding str.indexof function... We run all the experiments on a Ubuntu 22.04 machine with 8 GB of RAM and 1.60 GHz Intel i5 CPU, with a timeout of 300s. For the shortest matching superstring problem: "l is an upper bound on the length of the shortest string matched by all patterns. We use this information to provide an initial bound for each problem." |