SysRem¶

SysRem is an algorithm to remove systematic effects, defined as those which occur similarly (linearly) across a set of observations such as sets of light curves or spectra. The algorithm was described by Tamuz et al. 2005 (MNRAS 356, 1466) in the context of correcting systematic effects in samples of light curves. Repeated application allows to remove more complex effects but the danger of removing actual signal tends to increase with the number of iteration.

Note

The data (or residuals) are stored such that X[::,0] gives the first observation, i.e., the observation is stored in the first column. The data matrix is, therefore, stored here as the transpose of that adopted for the PCA etc..

Note

Compared to the presentation by Tamuz et al., the roles of a and c are exchanged.

Example: Application to mock data¶

from PyAstronomy import pyasl
import numpy as np
import matplotlib.pylab as plt

# No. of data sets (e.g., light curves)
nds = 50
# No. of data points per data set
nobs = 200

obs, sigs = [], []

# Generate mock data
x = np.linspace(0,1,nobs)-0.5
for i in range(nds):
    # Add noise
    y = np.random.normal(0,0.01+0.0001*i,nobs)
    # Add 3rd degree polynomial
    p = (i, 0.2*i, 0.3*i)
    y += np.polyval(p,x)
    # Add moving Gaussian signal
    y -= 0.02 * np.exp(-(x-(i/nds-0.5))**2/(2*0.02**2))
    # Add growing but stationary Gaussian signal
    y -= (i**2*0.05) * np.exp(-x**2/(2*0.06**2))
    obs.append(y)
    sigs.append(np.ones_like(y)*0.01+0.0001*i)

    plt.plot(x, y, '.-')
plt.title("Mock data")
plt.show()

sr = pyasl.SysRem(obs, sigs)
# First iteration
r, a, c = sr.iterate()
plt.subplot(2,1,1)
plt.title("Residuals after first (top) and second iteration (bottom)")
plt.imshow(r, origin='lower', aspect="auto")
# Second iteration
r, a, c = sr.iterate()
plt.subplot(2,1,2)
plt.imshow(r, origin='lower', aspect="auto")
plt.show()

Example: SYSREM, PCA, and eigenvectors¶

SYSREM, Principle Component Analysis, and eigenvectors (Download notebook)

API¶

Interface class¶

class PyAstronomy.pyasl.SysRem(obs, sigs, ms_obs=True, ms_feat=False, a0=None, ms_warn=True)¶

Implementation of the SysRem algorithm.

SysRem was described by Tamuz et al. 2005 (MNRAS 356, 1466) in the context of correcting systematic effects in samples of light curves, but has been applied in other areas such as planetary atmospheric spectroscopy.

Note

The data (or residuals) are stored such that X[::,0] gives the first observation, i.e., the observation is stored in the first column. The data matrix is, therefore, stored here as the transpose of that adopted for the PCA etc..

Note

Compared to the presentation by Tamuz et al., the roles of a and c are exchanged.

Parameters

obslist of 1d arrays or 2d array: List of observations (e.g., light curves) or 2d array such that obs[::,0] is the first observation.
sigslist of 1d arrays or 2d array: Uncertainties (same format as obs)
a01d array, optional: Starting values for ‘a’ parameter (same length as the observations). If not given, values linearly increasing from 0 to 1 are assumed.
ms_obsboolean, optional: Subtract mean from observations in initial data matrix (columns of data matrix). Default is True.
ms_featboolean, optional: Subtract mean from features in initial data matrix (e.g., spectral bins). Default is False.
ms_warnboolean, optional: If True (default), a warning will be printed if either ms_obs or ms_feat is True so that the data will be manipulated by subtracting mean(s). Set False to suppress warning.

Attributes

rijslist of 2d arrays: List of residual arrays. Each call of the iterate method adds an updated residual array to the list.
aclist of tuples of arrays: The a and c parameters used to obtain the updated residual array from the previous one.

Methods

`get_cumulative_model`()	Get the cumulative model (sum of all available models)
`get_model`([last, a, c])	Get model corresponding to a,c vector
`iterate`([atol, rtol, imax, a0])	A single SysRem iteration: Remove linear systematic effect

get_cumulative_model()¶: Get the cumulative model (sum of all available models)

get_model(last=True, a=None, c=None)¶

Get model corresponding to a,c vector

Parameters

lastboolean, optional: If True (default), the model for the last iteration will be returned (a,c must not be given then).
a, carrays, optional: If given, these will be used to calculate the model (last must be set False)

iterate(atol=0.001, rtol=0, imax=1001, a0=None)¶

A single SysRem iteration: Remove linear systematic effect

Parameters

atol, rtolfloat, optional: Absolute and relative tolerance for a-c iteration required to stop. The pertinent quantity tested is the difference between consecutive models divided by the uncertainties. By default, relative tolerance is ignored.
imaxint: Maximum of iterations. Throws exception if convergence is not reached earlier.
a0array, optional: Starting value for SYSREM model PCA-like component. If not specified, the value defined during instance initialization will be used.

Returns

rij2d array: Updated residual matrix with best-fit model subtracted.
aarray: Best-fit ‘a’ values
carray: Best-fit ‘c’ values

Helper functions¶

PyAstronomy.pyasl.sysrem_iter_c(rij, sigij, a, sigij2=None)¶

Estimate ‘c’

Parameters

rij2d array: Residuals (rij[::,i] = ith of nds observation with nobs data points)
sigij2d array: Uncertainties
a1d array: Current estimate of ‘a’ (length is nobs)
sigij22d array, optional: Square of uncertainties. If given, sigij will be ignores.

Returns

c1d array: Estimate of ‘c’

PyAstronomy.pyasl.sysrem_iter_a(rij, sigij, c, sigij2=None)¶

Estimate ‘a’

Parameters

rij2d array: Residuals (rij[::,i] = ith of nds observation with nobs data points)
sigij2d array: Uncertainties
c1d array: Current estimate of ‘a’ (length is nds)
sigij22d array, optional: Square of uncertainties. If given, sigij will be ignores.

Returns

a1d array: Estimate of ‘a’

PyAstronomy.pyasl.sysrem_data_prepare(obs, sigs, ms_obs=True, ms_feat=False)¶

Construct data sets for SysRem

Parameters

obslist of arrays: Observations
sigslist of arrays, optional: Uncertainties of the observations

Returns

rij2d array: The residuals (mean-subtracted observations) so that rij[::,i] gives the i-th observation
sm2d array: The uncertainties in the same format as rij
a01d array: A dummy guess for the ‘a’ vector (linear between 0 and 1)
ms_obsboolean, optional: Subtract mean from observations (columns of data matrix). Default is True.
ms_featboolean, optional: Subtract mean from features (e.g., spectral bins). Default is False.

PyAstronomy.pyasl.sysrem_update_rij(rij, a, c)¶

Subtract current model from residuals

In matrix notation Tamuz et al. write (Eq. 6) R’ = R - c a^T. The operation here implemented corresponds to (R’ = R - c^T)^T -> R’^T = R^T - a c^T (where ^T is the transpose, R is a matrix and a and c are vectors).

Returns

New rij2d array: Updated residuals

SysRem¶

Example: Application to mock data¶

Example: SYSREM, PCA, and eigenvectors¶

API¶

Interface class¶

Helper functions¶

Table of Contents

Previous topic

Next topic

This Page