Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in Coakley et alK. L. Coakley, T. Snelleman, H. Hoos, and O. E. Gundersen, "The embrace of open science: An analysis of a decade of AI research and 56 800 conference papers," Under Review, 2026..

International Conference on Machine Learning (ICML) - 2014

Documentation Rate of Empirical Papers by Reproducibility Variable

Distribution of Empirical Papers by Number of Documented Variables

Website:

Venue Year Papers
Reproducibility Score Reproducibility Score based on Gundersen et al. (2025). See Methods for details.
Documentation Score Documentation Score is the average score over the seven reproducibility variables for empirical research papers. See Methods for details.
% Empirical Percentage of papers that are empirical research vs theoretical research.
% Industry Percentage of empirical research papers with at least one author from Industry.
Website
ICML 2014 310 0.3 3.0 93.55% 27.59%
Pseudocode
Open Source Code
Open Datasets
Dataset Splits
Hardware Specification
Software Dependencies
Experiment Setup
(Near) Dimension Independent Risk Bounds for Differentially Private Learning 3
A Bayesian Framework for Online Classifier Ensemble 4
A Bayesian Wilcoxon signed-rank test based on the Dirichlet process 4
A Clockwork RNN 3
A Compilation Target for Probabilistic Programming Languages 3
A Consistent Histogram Estimator for Exchangeable Graph Models 5
A Convergence Rate Analysis for LogitBoost, MART and Their Variant 3
A Deep Semi-NMF Model for Learning Hidden Representations 4
A Deep and Tractable Density Estimator 4
A Discriminative Latent Variable Model for Online Clustering 3
A Divide-and-Conquer Solver for Kernel Support Vector Machines 6
A Highly Scalable Parallel Algorithm for Isotropic Total Variation Models 3
A Kernel Independence Test for Random Processes 2
A PAC-Bayesian bound for Lifelong Learning 3
A Physics-Based Model Prior for Object-Oriented MDPs 1
A Single-Pass Algorithm for Efficiently Recovering Sparse Cluster Centers of High-dimensional Data 4
A Statistical Convergence Perspective of Algorithms for Rank Aggregation from Pairwise Data 2
A Statistical Perspective on Algorithmic Leveraging 1
A Unified Framework for Consistency of Regularized Loss Minimizers 0
A Unifying View of Representer Theorems 0
A new Q(lambda) with interim forward view and Monte Carlo equivalence 0
A reversible infinite HMM using normalised random measures 3
Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization 1
Active Detection via Adaptive Submodularity 4
Active Learning of Parameterized Skills 1
Active Transfer Learning under Model Shift 3
Adaptive Monte Carlo via Bandit Allocation 3
Adaptivity and Optimism: An Improved Exponentiated Gradient Algorithm 1
Admixture of Poisson MRFs: A Topic Model with Word Dependencies 2
Affinity Weighted Embedding 4
Aggregating Ordinal Labels from Crowds by Minimax Conditional Entropy 4
Agnostic Bayesian Learning of Ensembles 3
Alternating Minimization for Mixed Linear Regression 2
An Adaptive Accelerated Proximal Gradient Method and its Homotopy Continuation for Sparse Optimization 2
An Analysis of State-Relevance Weights and Sampling Distributions on L1-Regularized Approximate Linear Programming Approximation Accuracy 1
An Asynchronous Parallel Stochastic Coordinate Descent Algorithm 4
An Efficient Approach for Assessing Hyperparameter Importance 3
An Information Geometry of Statistical Manifold Learning 2
Anomaly Ranking as Supervised Bipartite Ranking 4
Anti-differentiating approximation algorithms:A case study with min-cuts, spectral, and flow 4
Approximate Policy Iteration Schemes: A Comparison 2
Approximation Analysis of Stochastic Gradient Langevin Dynamics by using Fokker-Planck Equation and Ito Process 0
Asymptotically consistent estimation of the number of change points in highly dependent time series 2
Asynchronous Distributed ADMM for Consensus Optimization 5
Austerity in MCMC Land: Cutting the Metropolis-Hastings Budget 4
Automated inference of point of view from user interactions in collective intelligence venues 4
Bayesian Max-margin Multi-Task Learning with Data Augmentation 3
Bayesian Nonparametric Multilevel Clustering with Group-Level Contexts 3
Bayesian Optimization with Inequality Constraints 2
Beta Diffusion Trees 1
Bias in Natural Actor-Critic Algorithms 1
Boosting multi-step autoregressive forecasts 4
Boosting with Online Binary Learners for the Multiclass Bandit Problem 3
Buffer k-d Trees: Processing Massive Nearest Neighbor Queries on GPUs 6
Circulant Binary Embedding 4
Clustering in the Presence of Background Noise 1
Coding for Random Projections 2
Coherent Matrix Completion 2
Cold-start Active Learning with Robust Ordinal Matrix Factorization 3
Combinatorial Partial Monitoring Game with Linear Feedback and Its Applications 1
Communication-Efficient Distributed Optimization using an Approximate Newton-type Method 3
Compact Random Feature Maps 3
Composite Quantization for Approximate Nearest Neighbor Search 3
Compositional Morphology for Word Representations and Language Modelling 4
Computing Parametric Ranking Models via Rank-Breaking 3
Concentration in unbounded metric spaces and algorithmic stability 0
Concept Drift Detection Through Resampling 4
Condensed Filter Tree for Cost-Sensitive Multi-Label Classification 4
Consistency of Causal Inference under the Additive Noise Model 2
Convergence rates for persistence diagram estimation in Topological Data Analysis 2
Convex Total Least Squares 2
Coordinate-descent for learning orthogonal matrices through Givens rotations 4
Coupled Group Lasso for Web-Scale CTR Prediction in Display Advertising 4
Covering Number for Efficient Heuristic-based POMDP Planning 4
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition 5
Deep AutoRegressive Networks 3
Deep Boosting 4
Deep Generative Stochastic Networks Trainable by Backprop 2
Deep Supervised and Convolutional Generative Stochastic Network for Protein Secondary Structure Prediction 4
Demystifying Information-Theoretic Clustering 1
Densifying One Permutation Hashing via Rotation for Fast Near Neighbor Search 3
Deterministic Anytime Inference for Stochastic Continuous-Time Markov Processes 2
Deterministic Policy Gradient Algorithms 1
Diagnosis determination: decision trees optimizing simultaneously worst and expected testing cost 1
Dimension-free Concentration Bounds on Hankel Matrices for Spectral Learning 1
Discovering Latent Network Structure in Point Process Data 4
Discrete Chebyshev Classifiers 2
Discriminative Features via Generalized Eigenvectors 3
Distributed Representations of Sentences and Documents 3
Distributed Stochastic Gradient MCMC 3
Doubly Stochastic Variational Bayes for non-Conjugate Inference 3
Dual Query: Practical Private Query Release for High Dimensional Data 4
Dynamic Programming Boosting for Discriminative Macro-Action Discovery 3
Effective Bayesian Modeling of Groups of Related Count Time Series 3
Efficient Algorithms for Robust One-bit Compressive Sensing 2
Efficient Approximation of Cross-Validation for Kernel Methods using Bouligand Influence Function 4
Efficient Continuous-Time Markov Chain Estimation 2
Efficient Dimensionality Reduction for High-Dimensional Network Estimation 3
Efficient Gradient-Based Inference through Transformations between Bayes Nets and Neural Nets 2
Efficient Label Propagation 4
Efficient Learning of Mahalanobis Metrics for Ranking 4
Elementary Estimators for High-Dimensional Linear Regression 3
Elementary Estimators for Sparse Covariance Matrices and other Structured Moments 2
Ensemble Methods for Structured Prediction 4
Ensemble-Based Tracking: Aggregating Crowdsourced Structured Time Series Data 4
Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm 2
Estimating Latent-Variable Graphical Models using Moments and Likelihoods 2
Exchangeable Variable Models 4
Exponential Family Matrix Completion under Structural Constraints 1
Factorized Point Process Intensities: A Spatial Analysis of Professional Basketball 1
Fast Allocation of Gaussian Process Experts 5
Fast Computation of Wasserstein Barycenters 4
Fast Multi-stage Submodular Maximization 3
Fast Stochastic Alternating Direction Method of Multipliers 5
Fast large-scale optimization by unifying stochastic gradient and quasi-Newton methods 4
Filtering with Abstract Particles 5
Finding Dense Subgraphs via Low-Rank Bilinear Optimization 2
Finito: A faster, permutable incremental gradient method for big data problems 3
Forward-Backward Greedy Algorithms for General Convex Smooth Functions over A Cardinality Constraint 1
GEV-Canonical Regression for Accurate Binary Class Probability Estimation when One Class is Rare 4
Gaussian Approximation of Collective Graphical Models 1
Gaussian Process Classification and Active Learning with Multiple Annotators 5
Gaussian Process Optimization with Mutual Information 3
Gaussian Processes for Bayesian Estimation in Ordinary Differential Equations 3
GeNGA: A Generalization of Natural Gradient Ascent with Positive and Negative Convergence Results 0
Generalized Exponential Concentration Inequality for Renyi Divergence Estimation 1
Geodesic Distance Function Learning via Heat Flow on Vector Fields 3
Global graph kernels using geometric embeddings 5
Globally Convergent Parallel MAP LP Relaxation Solver using the Frank-Wolfe Algorithm 3
Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization 5
Graph-based Semi-supervised Learning: Realizing Pointwise Smoothness Probabilistically 3
Guess-Averse Loss Functions For Cost-Sensitive Multiclass Boosting 5
Hamiltonian Monte Carlo Without Detailed Balance 2
Hard-Margin Active Linear Regression 1
Heavy-tailed regression with a generalized median-of-means 1
Hierarchical Conditional Random Fields for Outlier Detection: An Application to Detecting Epileptogenic Cortical Malformations 1
Hierarchical Dirichlet Scaling Process 3
Hierarchical Quasi-Clustering Methods for Asymmetric Networks 1
High Order Regularization for Semi-Supervised Learning of Structured Output Problems 3
Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques 3
Inferning with High Girth Graphical Models 2
Influence Function Learning in Information Diffusion Networks 4
Input Warping for Bayesian Optimization of Non-Stationary Functions 2
Joint Inference of Multiple Label Types in Large Networks 2
K-means recovers ICA filters when independent components are sparse 3
Kernel Adaptive Metropolis-Hastings 4
Kernel Mean Estimation and Stein Effect 3
Large-Margin Metric Learning for Constrained Partitioning Problems 3
Large-margin Weakly Supervised Dimensionality Reduction 2
Large-scale Multi-label Learning with Missing Labels 3
Latent Bandits. 2
Latent Confusion Analysis by Normalized Gamma Construction 1
Latent Semantic Representation Learning for Scene Classification 4
Latent Variable Copula Inference for Bundle Pricing from Retail Transaction Data 3
Learnability of the Superset Label Learning Problem 0
Learning Character-level Representations for Part-of-Speech Tagging 4
Learning Complex Neural Network Policies with Trajectory Optimization 2
Learning Graphs with a Few Hubs 3
Learning Latent Variable Gaussian Graphical Models 1
Learning Mixtures of Linear Classifiers 2
Learning Modular Structures from Network Data and Node Variables 4
Learning Ordered Representations with Nested Dropout 2
Learning Polynomials with Neural Networks 1
Learning Sum-Product Networks with Direct and Indirect Variable Interactions 5
Learning Theory and Algorithms for revenue optimization in second price auctions with reserve 4
Learning by Stretching Deep Networks 3
Learning from Contagion (Without Timestamps) 3
Learning the Consistent Behavior of Common Users for Target Node Prediction across Social Networks 3
Learning the Irreducible Representations of Commutative Lie Groups 2
Learning the Parameters of Determinantal Point Process Kernels 2
Learning to Disentangle Factors of Variation with Manifold Interaction 3
Least Squares Revisited: Scalable Approaches for Multi-class Prediction 3
Linear Programming for Large-Scale Markov Decision Problems 2
Linear Time Solver for Primal SVM 6
Linear and Parallel Learning of Markov Random Fields 1
Local Ordinal Embedding 5
Local algorithms for interactive clustering 3
Low-density Parity Constraints for Hashing-Based Discrete Integration 3
Lower Bounds for the Gibbs Sampler over Mixtures of Gaussians 2
Making Fisher Discriminant Analysis Scalable 6
Making the Most of Bag of Words: Sentence Regularization with Alternating Direction Method of Multipliers 5
Marginal Structured SVM with Hidden Variables 3
Marginalized Denoising Auto-encoders for Nonlinear Representations 3
Margins, Kernels and Non-linear Smoothed Perceptrons 1
Max-Margin Infinite Hidden Markov Models 4
Maximum Margin Multiclass Nearest Neighbors 0
Maximum Mean Discrepancy for Class Ratio Estimation: Convergence Bounds and Kernel Selection 3
Memory (and Time) Efficient Sequential Monte Carlo 2
Memory Efficient Kernel Approximation 5
Memory and Computation Efficient PCA via Very Sparse Random Projections 1
Methods of Moments for Learning Stochastic Languages: Unified Presentation and Empirical Comparison 2
Min-Max Problems on Factor Graphs 2
Model-Based Relational RL When Object Existence is Partially Observable 2
Modeling Correlated Arrival Events with Latent Semi-Markov Processes 4
Multi-label Classification via Feature-aware Implicit Label Space Encoding 5
Multi-period Trading Prediction Markets with Connections to Machine Learning 2
Multimodal Neural Language Models 3
Multiple Testing under Dependence via Semiparametric Graphical Models 3
Multiresolution Matrix Factorization 2
Multivariate Maximal Correlation Analysis 2
Narrowing the Gap: Random Forests In Theory and In Practice 4
Near-Optimal Joint Object Matching via Convex Relaxation 3
Near-Optimally Teaching the Crowd to Classify 3
Nearest Neighbors Using Compact Sparse Codes 5
Neural Variational Inference and Learning in Belief Networks 3
Nonlinear Information-Theoretic Compressive Measurement Design 3
Nonmyopic ε-Bayes-Optimal Active Learning of Gaussian Processes 3
Nonnegative Sparse PCA with Provable Guarantees 3
Nonparametric Estimation of Multi-View Latent Variable Models 4
Nonparametric Estimation of Renyi Divergence and Friends 1
Nuclear Norm Minimization via Active Subspace Selection 5
On Measure Concentration of Random Maximum A-Posteriori Perturbations 2
On Modelling Non-linear Topical Dependencies 2
On Robustness and Regularization of Structural Support Vector Machines 5
On learning to localize objects with minimal supervision 3
On p-norm Path Following in Multiple Kernel Learning for Non-linear Feature Selection 5
On the convergence of no-regret learning in selfish routing 2
One Practical Algorithm for Both Stochastic and Adversarial Bandits 2
Online Bayesian Passive-Aggressive Learning 4
Online Clustering of Bandits 4
Online Learning in Markov Decision Processes with Changing Cost Sequences 0
Online Multi-Task Learning for Policy Gradient Methods 2
Online Stochastic Optimization under Correlated Bandit Feedback 1
Optimal Budget Allocation: Theoretical Guarantee and Efficient Algorithm 4
Optimal Mean Robust Principal Component Analysis 3
Optimal PAC Multiple Arm Identification with Applications to Crowdsourcing 3
Optimization Equivalence of Divergences Improves Neighbor Embedding 2
Outlier Path: A Homotopy Algorithm for Robust SVM 4
PAC-inspired Option Discovery in Lifelong Reinforcement Learning 3
Pitfalls in the use of Parallel Inference for the Dirichlet Process 1
Prediction with Limited Advice and Multiarmed Bandits with Paid Observations 1
Preference-Based Rank Elicitation using Statistical Models: The Case of Mallows 1
Preserving Modes and Messages via Diverse Particle Selection 3
Probabilistic Matrix Factorization with Non-random Missing Data 4
Probabilistic Partial Canonical Correlation Analysis 2
Programming by Feedback 3
Provable Bounds for Learning Some Deep Representations 3
Pursuit-Evasion Without Regret, with an Application to Trading 2
Putting MRFs on a Tensor Train 5
Quasi-Monte Carlo Feature Maps for Shift-Invariant Kernels 3
Randomized Nonlinear Component Analysis 5
Rank-One Matrix Pursuit for Matrix Completion 5
Rectangular Tiling Process 3
Recurrent Convolutional Neural Networks for Scene Labeling 5
Reducing Dueling Bandits to Cardinal Bandits 3
Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem 3
Riemannian Pursuit for Big Matrix Recovery 5
Robust Distance Metric Learning via Simultaneous L1-Norm Minimization and Maximization 4
Robust Inverse Covariance Estimation under Noisy Measurements 6
Robust Learning under Uncertain Test Distributions: Relating Covariate Shift to Model Misspecification 3
Robust Principal Component Analysis with Complex Noise 3
Robust RegBayes: Selectively Incorporating First-Order Logic Domain Knowledge into Bayesian Models 2
Robust and Efficient Kernel Hyperparameter Paths with Guarantees 3
Saddle Points and Accelerated Perceptron Algorithms 5
Safe Screening with Variational Inequalities and Its Application to Lasso 2
Sample Efficient Reinforcement Learning with Gaussian Processes 2
Sample-based approximate regularization 4
Scalable Bayesian Low-Rank Decomposition of Incomplete Multiway Tensors 3
Scalable Gaussian Process Structured Prediction for Grid Factor Graph Applications 3
Scalable Semidefinite Relaxation for Maximum A Posterior Estimation 3
Scalable and Robust Bayesian Inference via the Median Posterior 4
Scaling SVM and Least Absolute Deviations via Exact Data Reduction 2
Scaling Up Approximate Value Iteration with Options: Better Policies with Fewer Iterations 2
Scaling Up Robust MDPs using Function Approximation 2
Signal recovery from Pooling Representations 3
Skip Context Tree Switching 3
Sparse Reinforcement Learning via Convex Optimization 3
Sparse meta-Gaussian information bottleneck 4
Spectral Bandits for Smooth Graph Functions 3
Spectral Regularization for Max-Margin Sequence Tagging 3
Spherical Hamiltonian Monte Carlo for Constrained Target Distributions 4
Square Deal: Lower Bounds and Improved Relaxations for Tensor Recovery 1
Stable and Efficient Representation Learning with Nonnegativity Constraints 4
Standardized Mutual Information for Clustering Comparisons: One Step Further in Adjustment for Chance 3
Statistical analysis of stochastic gradient methods for generalized linear models 3
Statistical-Computational Phase Transitions in Planted Models: The High-Dimensional Setting 1
Stochastic Backpropagation and Approximate Inference in Deep Generative Models 3
Stochastic Dual Coordinate Ascent with Alternating Direction Method of Multipliers 4
Stochastic Gradient Hamiltonian Monte Carlo 4
Stochastic Inference for Scalable Probabilistic Modeling of Binary Matrices 5
Stochastic Neighbor Compression 6
Stochastic Variational Inference for Bayesian Time Series Models 2
Structured Generative Models of Natural Source Code 3
Structured Low-Rank Matrix Factorization: Optimality, Algorithm, and Applications to Image Processing 3
Structured Prediction of Network Response 2
Structured Recurrent Temporal Restricted Boltzmann Machines 3
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits 4
The Coherent Loss Function for Classification 4
The Falling Factorial Basis and Its Statistical Applications 2
The Inverse Regression Topic Model 3
The f-Adjusted Graph Laplacian: a Diagonal Modification with a Geometric Interpretation 1
Thompson Sampling for Complex Online Problems 2
Time-Regularized Interrupting Options (TRIO) 2
Topic Modeling using Topics from Many Domains, Lifelong Learning and Big Data 4
Towards End-To-End Speech Recognition with Recurrent Neural Networks 4
Towards Minimax Online Learning with Unknown Time Horizon 1
Towards an optimal stochastic alternating direction method of multipliers 3
Towards scaling up Markov chain Monte Carlo: an adaptive subsampling approach 3
Tracking Adversarial Targets 1
Transductive Learning with Multi-class Volume Approximation 3
True Online TD(lambda) 2
Two-Stage Metric Learning 3
Understanding Protein Dynamics with L1-Regularized Reversible Hidden Markov Models 2
Understanding the Limiting Factors of Topic Modeling via Posterior Contraction Analysis 3
Unimodal Bandits: Regret Lower Bounds and Optimal Algorithms 2
Universal Matrix Completion 1
Variational Inference for Sequential Distance Dependent Chinese Restaurant Process 3
Von Mises-Fisher Clustering Models 3
Wasserstein Propagation for Semi-Supervised Learning 0
Weighted Graph Clustering with Non-Uniform Uncertainties 1