reproducibilityindex.ai

Fast Sampling-Based Sketches for Tensors

Authors: William Joseph Swartworth, David Woodruff

ICML 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	We evaluate the correctness of the ℓ0 samplers. Our ℓ0-sampler is theoretically guaranteed to output a uniformly random entry of the support, up to some constant factor. That is, the probability we output a fixed entry of the support of a tensor X is [ c1 \|supp(X)\|, c2 \|supp(X)\|] for some absolute constants c1 and c2. In order to keep our analysis simple, the constants c1 and c2 gotten from unwinding our proof are more extreme than necessary. We remedy this by empirically showing that our sampler is in fact much closer to uniform. All of our experiments suggest that our ℓ0 sampling procedure behaves very nearly perfectly on the tensors described above. All experiments are accurate to within a few percent of what one would expect for uniform sampling. See section C in the appendix for tables showing our experimental data.
Researcher Affiliation	Academia	William Swartworth 1 David Woodruff 1 1Carnegie Mellon University.
Pseudocode	No	The paper does not contain explicit pseudocode or algorithm blocks, only mathematical descriptions of constructions and proofs.
Open Source Code	Yes	The code for the experiments, along with additional implementations of our sketch are available at https:// github.com/wswartworth/tensor Sampling.
Open Datasets	No	The paper describes custom-generated data for experiments (e.g., "disjoint rectangle support", "rectangle B... with an additional ... random entries") rather than using a named publicly available dataset with a specific access link or formal citation.
Dataset Splits	No	The paper mentions running "1000 trials" on tensors of a certain shape but does not specify explicit training, validation, or test dataset splits.
Hardware Specification	No	The paper does not specify any particular hardware components such as GPU or CPU models used for running the experiments.
Software Dependencies	No	The paper does not provide specific software dependencies with version numbers (e.g., Python 3.8, PyTorch 1.9).
Experiment Setup	Yes	We choose our ℓ0 sampler to have 10 buckets at each sampling level. For an N N N tensor X our sampling rates begin at 1/N 3 and increase in powers of 5. ... All experiments are carried out on a tensor of shape 40 40 40 for 1000 trials.