reproducibilityindex.ai

Contextually Supervised Source Separation with Application to Energy Disaggregation

Authors: Matt Wytock, J. Kolter

AAAI 2014 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	On smaller datasets which include labels, we demonstrate that contextual supervision improves signiﬁcantly over a reasonable baseline and existing unsupervised methods for source separation. Finally, we analyze the case of 2 loss theoretically and show that recovery of the signal components depends only on cross-correlation between features for different signals, not on correlations between features for the same signal. In this section we evaluate contextual supervision for energy disaggregation on one synthetic dataset and two real datasets.
Researcher Affiliation	Academia	Matt Wytock and J. Zico Kolter Machine Learning Department Carnegie Mellon University Pittsburgh, Pennsylvania 15213
Pseudocode	No	The paper does not contain structured pseudocode or algorithm blocks.
Open Source Code	No	The paper does not provide concrete access to source code for the methodology described.
Open Datasets	Yes	On real data, we begin with a dataset from Pecan Street, Inc. (http://www.pecanstreet.org/) that is relatively small (less than 100 homes), but comes with labeled data allowing us to validate our unsupervised algorithm quantitatively. Finally, we apply the same methodology to disaggregate large-scale smart meter data from Paciﬁc Gas and Electric (PG&E) consisting of over 4000 homes and compare the results of our contextually supervised model to aggregate statistics from survey data.
Dataset Splits	No	The paper mentions segmenting synthetic data (
Hardware Specification	No	The paper does not provide specific hardware details used for running its experiments.
Software Dependencies	No	The paper does not provide specific ancillary software details with version numbers.
Experiment Setup	Yes	In all experiments, we tune the model using hyperparameters that weigh the terms in the optimization objective; in the case of energy disaggregation, the model including hyperparameters and β is shown in Table 3. We set these hyperparameters using a priori knowledge about the relative frequency of each signal over the entire dataset. ... The speciﬁcation of our energy disaggregation model is given in Table 3 we capture the non-linear dependence on temperature with radial-basis functions (RBFs), include a Base category which models energy used as a function of time of day, and featureless Appliance category representing large spikes of energy which do not correspond to any available context. For simplicity, we penalize each category s deviations from the model using 1 loss; but for heating and cooling we ﬁrst multiply by a smoothing matrix Sn (1 s on the diagonal and n super diagonals) capturing the thermal mass inherent in heating and cooling: we expect energy usage to correlate with temperature over a window of time, not immediately. We use gi(yi) and the difference operator to encode our intuition of how energy consumption in each category evolves over time.