Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].
On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression
Authors: Denny Wu, Ji Xu
NeurIPS 2020 | Venue PDF | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We illustrate the results of Theorem 1 in Figure 2 (noiseless case) and Figure 8 (noisy case) for both discrete and continuous design for dx and dβ with Ī£x = diag(dx), Σβ = diag(dβ) and Ī£w = I (see design details in Appendix D). In Figure 2, we plot the prediction risk of all three joint relations deļ¬ned above (see Appendix D for details). In Figure 4 we conļ¬rm our ļ¬ndings in Theorem 4 (for additional results on different distributions see Figure 10). Speciļ¬cally, we set Ī£w = I, Ī£x = diag(dx) and Σβ = Σα x. Theorem 10 is supported by Figure 6, where we plot the prediction risk of the generalized ridge regression estimator under different Ī£w and optimally tuned Ī»opt. We demonstrate the effectiveness of this heuristic in Figure 7. |
| Researcher Affiliation | Academia | Denny Wu University of Toronto and Vector Institute EMAIL Ji Xu Columbia University EMAIL |
| Pseudocode | No | The paper focuses on mathematical derivations and analysis, and does not include any pseudocode or algorithm blocks. |
| Open Source Code | No | The paper does not mention releasing any source code or provide any links to a code repository for the described methodology. |
| Open Datasets | No | The paper analyzes a linear model with a random design setting and general data covariance, and conducts simulations to verify theoretical results. It does not use or provide access information for a publicly available or open dataset in the traditional sense, as the 'data' for the experiments is generated based on specified parameters. |
| Dataset Splits | No | The paper focuses on theoretical analysis and simulations based on defined parameters (e.g., p/n = gamma, n=300, p=600). It does not specify traditional training, validation, or test dataset splits, as these concepts are typically applied to pre-existing datasets rather than parameters for data generation. |
| Hardware Specification | No | The paper does not mention any specific hardware (e.g., CPU, GPU models, or cloud resources) used for running its simulations or experiments. |
| Software Dependencies | No | The paper does not specify any software dependencies with version numbers (e.g., Python, PyTorch, specific libraries or solvers) used for its analysis or simulations. |
| Experiment Setup | Yes | Figure 2: Finite sample prediction risk... We set γ = 2 and (n, p) = (300, 600). Figure 4: We set Ī£w = I and Σβ = Σα x where dx has two point masses on 1 and 5 with probability 3/4 and 1/4 respectively. Left: optimal Ī»; solid lines represents the noiseless case Ļ = 0 and dashed lines represents SNR ξ = 5. Figure 6: Left: dx to have 4 point masses (1, 2, 3, 4) with equal probabilities and dβ with 2 point masses on 1 and 5 with probabilities 3/4 and 1/4, respectively; Right: dx has 2 point masses on 1 and 5 with probabilities 3/4 and 1/4, respectively, and Σβ = Ī£2 x; we set Ī£w = Σα β. Noiseless Ļ = 0. |