You are on page 1of 9

GEOPHYSICS, VOL. 77, NO. 2 (MARCH-APRIL 2012); P. V21–V29, 11 FIGS.

10.1190/GEO2011-0260.1
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

Microseismic data denoising using a 3C group sparsity constrained


time-frequency transform

Ismael Vera Rodriguez1, David Bonar1, and Mauricio Sacchi1

ABSTRACT three components of a single receiver, the extension of the


sparse transform to 3C data is accomplished when the three
Noise contamination is an important problem in microseismic components are considered to share the same sparsity pattern
data processing, due to the low magnitude of the seismic events in the time-frequency plane. Application of the 3C sparse trans-
induced during fluid injection. In this study, a noncoherent noise form to synthetic and real microseismic data sets demonstrate
attenuation technique based on a constrained time-frequency the advantages of this technique when the denoised results
transform is presented. When applied to 1C data, the transform are compared against the original and low-pass filtered version
corresponds to a sparse representation of the microseismic sig- of the noisy data. Furthermore, a comparison of hodograms be-
nal in terms of a dictionary of complex Ricker wavelets. The use tween original, low-pass, and denoised traces shows that the de-
of complex wavelets possesses the advantage that signals with noising process preserves the phase and relative amplitude
arbitrary phase can be represented with enhanced sparsity. A information present in the input data. The benefits of the 3C
synthetic example illustrates the superior performance of the transform are highlighted particularly in cases where the wave
sparse constraint for denoising objectives when compared to arrivals are measured in the three components of a receiver but
the standard least-squares regularization. As the arrival time are only visible in two components due to the prevailing signal-
and frequency content of any wavefront are equivalent in the to-noise ratio.

INTRODUCTION processing methods to attenuate noise is frequency filtering. Fre-


quency filters are effective to attenuate frequencies outside user-
Microseismicity induced during hydraulic injections is character- defined cut-off values. However, signal and noise often share
ized for its small magnitude (Maxwell, 2005; Shemeta and frequency bands which means that part of the signal is also filtered
Anderson, 2010). This is an important reason why microseismic out with the noise, while the noise might not be fully attenuated. A
records often display a low signal-to-noise ratio (S/N). Accuracy more advanced denoising technique requires the use of matched fil-
and reliability of the location and other event attributes derived from ters (Eisner et al., 2008). A matched filter is designed by selecting a
microseismic traces is influenced by this strong noise content microseismic event with high S/N to be used as a “master” or re-
(Maxwell, 2009). Thus, noise attenuation is a desirable step in ference event. Through the crosscorrelation of the master event with
microseismic processing to improve the quality of subsequent other parts of the signal, events with lower S/N, similar source
processes. mechanism and nearby location are “matched” and found. Other
In general, noise can be considered as the part of the measured examples of advanced denoising techniques are found in global
signal that is not of interest and it is usually divided into coherent seismology. For instance, in the analysis of earthquake precursors,
and noncoherent. Two different approaches can be followed to im- Sobolev and Lyubushin (2006) apply a wavelet transform and
prove the S/N in microseismic experiments, either improving the thresholding criterion in the time-frequency plane to facilitate the
acquisition equipment and/or acquisition geometry, or through identification of microseismic events. Similarly, Baig et al.
the use of signal processing techniques. One of the most basic signal (2009) employ time-frequency analysis to denoise seismic noise

Manuscript received by the Editor 19 July 2011; revised manuscript received 1 November 2011; published online 16 February 2012.
1
University of Alberta, Department of Physics and Signal Analysis and Imaging Group (SAIG), Edmonton, Alberta, Canada. E-mail: verarodr@ualberta.ca;
bonar@ualberta.ca; msacchi@ualberta.ca.
© 2012 Society of Exploration Geophysicists. All rights reserved.

V21
V22 Vera Rodriguez et al.
2 3
crosscorrelations. In reflection seismology, Bonar and Sacchi a1
(2010) proposed a method for spectral decomposition to 6 7
6 a2 7
analyze seismic sections. The decomposition technique is based 6 7
6 7
upon the transform of the time domain data into a sparse time- s ¼ ½ W1 W2 W3 ::: WN 6 a 7
6 3 7 ¼ Ra: (3)
frequency map. When implemented for denoising purposes this 6 . 7
6 .. 7
method is equivalent to basis pursuit denoising (BPDN) (Chen 4 5
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

et al., 1998).
aN
In this study, we are interested in attenuating noncoherent or am-
bient noise. Following the exposition by Bonar and Sacchi (2010),
we seek a sparse representation of the microseismic signal in terms Equation 3 is an underdetermined system that accepts an infinite
of a dictionary of complex Ricker wavelets. By imposing sparsity number of solutions. In other words, there are multiple possible re-
and misfit constraints to the decomposition process, we find a de- presentations of s in terms of linear combinations of the columns
noised version of the original signal. An extension of the time- of R. A common approach to solve underdetermined systems is
frequency transform to 3C data is accomplished through the concept through the damped least-squares (DLS) method. Mathematically,
of group sparsity (Yuan and Lin, 2006; Eldar and Bolcskei, 2009). this entails solving the following problem
Group sparsity is a condition that arises naturally in problems such
as linear regression (Yuan and Lin, 2006), brain activity imaging
(Cotter et al., 2005), or the determination of the direction of arrival âl2 ¼ arg min½ks − Rak22 þ λl2 kak22 : (4)
a
in antennas (Hyder and Mahata, 2010). In the field of microseismi-
city analysis, group sparsity has been recently used in the joint de- The cost function inside the square brackets consists of two
termination of origin time, hypocenter location, and moment tensor terms. The first term is the misfit function, a measure of fidelity
of microseismic events (Vera Rodriguez et al., 2010a, 2010b). Con- of fit between the observations and the synthesized data, and the
sidering that the three components of a single receiver share the second term is the l2 regularization term. Minimization of this
same sparsity pattern in the time-frequency plane, a group sparsity cost function yields the DLS solution âl2 . In this case, the solu-
thresholding criteria (Fornasier and Rauhut, 2008) is implemented tion vector of coefficients can be easily found utilizing the meth-
with a fast iterative soft thresholding algorithm (FISTA) (Beck and od of conjugate gradients (CG) (Hestenes and Stiefel, 1952). The
Teboulle, 2009) to simultaneously recover denoised versions of the
trade-off parameter λl2 is used to balance the relative strengths of
three components. The resulting methodology has been tested with
the misfit and regularization terms. An alternative solution to our
synthetic and real microseismic data sets. In all cases, the improve-
problem entails using an l1 regularization term. In other words,
ment in S/N with respect to the input noisy traces is clearly visible in
we will impose a measure to promote sparsity on the coefficients
the reconstructions obtained through the 3C sparse time-frequency
needed to represent the seismogram. This can be accomplished by
transform.
solving

1C SPARSE TIME-FREQUENCY TRANSFORM âl1 ¼ arg min½ks − Rak22 þ λl1 kak1 : (5)
a
The decomposition of a discrete seismic trace s½n in terms of
dilations and translations of a reference complex wavelet w½n is Minimization of the cost function inside square brackets in equa-
given by (Bonar and Sacchi, 2010) tion 5 yields the sparse solution âl1 . The trade-off parameter λl1
balances the fidelity of fit versus the l1 regularization term.
X
s½n ¼ ak ½iwk ½n − i; (1) Sparse solutions from l1 regularization are estimated using the
i;k
fast iterative soft thresholding algorithm (FISTA) (Beck and
Teboulle, 2009) (Appendix A). The denoised version of the input
where the indexes i and k are related, respectively, to the translations data is obtained via
and dilations of w½n, and the complex coefficients ak ½i control the
amplitude and phase of the wavelets. More specifically, the index k ŝ ¼ Râlp ; (6)
reflects a parameter/property of the wavelet that controls its dilation
or frequency content. In a Ricker wavelet, for example, k represents
the central frequency of the wavelet. In matrix-vector notation, where p can be either one or two depending on which regular-
equation 1 can be written as the convolution of a set of complex ization approach we are using. In Figure 1, we provide a syn-
coefficient vectors fak g with a set of dilated versions of the refer- thetic example where we examine the two denoising schemes.
ence wavelet fwk g, In this case, a complex Ricker wavelet is used as the reference
wavelet to create the dictionary R (Figure 2), where the dilation
X
s¼ W k ak ; (2) index k is related to the central frequency that describes each
k wavelet in R. Complex wavelets can be easily computed using
the Hilbert Transform. Notice that, although the construction
where s is the vector representation of the trace s½n and Wk is the of the dictionary requires a fixed reference wavelet, in this case
convolution matrix of the wavelet with dilation index k. To general- a complex Ricker wavelet, the dictionary is not adaptive and can
ize this model, the convolution matrices of a set of N dilated ver- be used to represent a wide range of source signatures in the
sions of the reference wavelet can be incorporated to create a input data through translations and changes in phase and fre-
redundant dictionary R (Mallat, 2008), quency content of the reference wavelet. To produce a fair
Microseismic data denoising V23

assessment of the denoising results, we have determined λl1 and operator (Fornasier and Rauhut, 2008) is introduced into the inver-
λl2 using the χ 2 criterion for the goodness of fit (see e.g., Aster sion routine (see Appendix A). The denoised version of the 3C data
et al., 2005). Representations using DLS tend to be “spread” in is finally recovered via
the solution (Figure 1c), whereas, sparse representations are well
localized and produce cleaner signal estimations using a smaller
number of nonzero coefficients (Figure 1e). For the purpose of
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

denoising, it is clear that sparse representations produce improved Ŝ ¼ RÂl2;1 : (10)


results when compared to DLS representations.
Examples of application of the 1C sparse time-
frequency transform to synthetic and real mi-
croseismic data are also presented in Vera
Rodriguez et al. (2011).

EXTENSION OF THE ALGORITHM


TO THE 3C CASE VIA GROUP
SPARSITY
The extension of the sparse time-frequency
transform to 3C data is accomplished through
the concept of group sparsity (Yuan and Lin,
2006; Eldar and Bolcskei, 2009). Group sparsity
arises naturally in a wide variety of problems, for
example, in microseismic monitoring a method
based in group sparsity has been proposed for
the simultaneous recovery of origin time, hypo-
center location, and seismic moment tensor of
microseismic events (Vera Rodriguez et al.,
2010a, 2010b). In the present application, the
concept of group sparsity is incorporated by con-
sidering the three components of a recording sta-
tion as a multiple measurement vector (MMV)
(e.g., Cotter et al., 2005; Hyder and Mahata,
2010). In other words, it is assumed that the three
components of a single receiver are measuring
the same information (seismic arrivals), even Figure 1. (a) Synthetic trace composed of two arrivals with arbitrary phase and fre-
though the amplitudes of the measurements in quency content. (b) Synthetic trace contaminated with random noise. (c) Inversion re-
each individual component might be different sults using the damped least-squares method (equation 4). (d) Data reconstruction from
the time-frequency map in (c). (e) Inversion results using the l1 -norm regularization
due to the arrival polarizations. The MMV as- (equation 5). (f) Data reconstruction from the time-frequency map in (e).
sembled from the input data is

S ¼ ½x y z ; (7)

where x, y, and z are the recordings from the three components of


one receiver. Hence, the 3C version of equation 5 is

Âl2;1 ¼ arg min½kS − RAk22;2 þ λl2;1 kAk2;1 ; (8)


A

where the columns of Âl2;1 are the sparse representations in the time-
frequency plane of the input three components. The symbol jj · jjp;q
denotes the mixed lp;q -norm defined as (Kowalski and
Torrésani, 2009)

X X  q 1
p q
p
kXkp;q ¼ n k
jxn;k j ; (9)

where xn;k is the nth row, kth column element of the matrix X. To Figure 2. Reference complex Ricker wavelet. Real part (dashed
apply FISTA to MMVs a multidimensional shrinkage-thresholding line), imaginary part (dotted line), and envelope (solid line).
V24 Vera Rodriguez et al.
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

Figure 3. Comparison of denoising results between the 1C and 3C versions of the sparse time-frequency transform. (a) Synthetic 3C data. (b)
Synthetic data contaminated with random noise. (c) Denoised data using the 1C sparse time-frequency transform. (d) Denoised data using the
3C sparse time-frequency transform. H1 and H2 stand for horizontal components.

Both 1C and 3C versions of the sparse transform display compar-


able denoising results when applied to synthetic data (Figure 3). The
real advantage of the 3C approach is evident in cases where the
wave arrival, although present in the three components, is only
above the prevailing noise content in two of them. Using informa-
tion of a single trace at a time does not allow the 1C sparse trans-
form to recover the arrival where it is below the noise level
(Figure 3c). The 3C sparse transform, on the other hand, uses in-
formation from the components where the arrival is above the noise
to identify and reconstruct the arrival in the component where the
noise is dominant (Figure 3d).

APPLICATION TO DENOISE
MICROSEISMIC DATA
The 3C sparse transform is first tested with a synthetic microseis-
mic data set. The synthetic data consists of direct P and S arrivals
computed in a 1D velocity model (Figure 4). The amplitude varia-
tions due to the source radiation pattern are included in the model-
ing, corresponding to a pure double couple source mechanism.
Wave arrivals are computed in a vertical array of eight 3C receivers
located at a horizontal distance of ∼400 m from the source location.
The resulting synthetic data is dominated by the amplitudes of the S
arrivals (Figure 5, first row). The S/N is defined by

μs
S∕N ¼ ; (11)
σn

where σ n is the standard deviation of the noise and μs the expected


signal amplitude. For simplicity, the expected signal value is con-
Figure 4. Velocity model used to compute the synthetic microseis- sidered as the maximum absolute value of the data samples in the
mic data. Triangles denote receiver depth locations. The star de- synthetic traces. Hence, a S∕N ¼ 5 is set in the data (Figure 5, sec-
notes the source depth. ond row). Applying a low-pass filter to the noisy data provides some
Microseismic data denoising V25

visible improvement in the S/N, however, the signal of interest is Notice also that low-pass frequency filters introduce apparent de-
also negatively affected in its frequency content (Figure 5, third lays in the wave arrivals, which are not present in the sparse
row). The denoised version of the data using the 3C sparse trans- transform.
form on the other hand, preserves the frequency content of the wave Figures 7 through 11 present the results of applying the 3C sparse
arrivals while removing also the random noise (Figure 5, fourth transform to a real microseismic data set. The time-frequency maps
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

row). The advantage of denoising using the 3C sparse transform obtained from the sparse transform provide an alternative domain to
is particularly evident in the vertical component of this example, analyze the microseismic data (Figure 7). Furthermore, the time-
where the weak S arrivals in the first six receivers are clearly below frequency analysis of the microseismic traces can reveal variations
the noise level in the noisy data, but are partially recovered by the in the dominant frequency of the wave arrivals in different receivers.
transform (Figure 5, column c). Another important property of In the example shown in Figure 7, the arrivals in the deeper three
the transform is its ability to preserve the amplitude information receivers display a lower dominant frequency than in the rest of the
of the wave arrivals. Amplitude information is important for ana- array of receivers. If time picks were to be obtained from these time-
lyses of, for example, polarization angles and source mechanisms. frequency maps, different frequency slides can be analyzed to
Polarization angles are usually estimated through hodogram determine where the onset of the arrivals is more evident. In this
analysis (Figure 6). Random noise introduces uncertainty in the es- example, the maximum dominant frequency observed in all recei-
timation of polarization angles by causing the hodogram curves to vers for the arrivals around 60 ms is used to low-pass filter the raw
deviate from the actual wave arrival angles (Figure 6, column b). microseismic data (Figures 8b, 9b, 10b, and Figure 11, column b).
The effect of reducing the frequency content in the input data is This frequency filter is not optimum however for the receivers
to smooth the hodogram curves, which can allow a more robust where the dominant frequency is lower. Additionally, the compo-
estimation of polarization angles (Figure 6, column c). Similar ef- nent of the random noise that is below the cut-off frequency of
fects are observed when the traces are denoised using the 3C sparse the filter is preserved in the data. As a consequence, the low-pass
transform (Figure 6, column d), with the advantage that the ampli- filtered traces display higher noise content before the first arrivals
tude and frequency content of the arrivals of interest are preserved. when compared with the traces that were denoised with the sparse

Figure 5. Denoising results in a synthetic micro-


seismic data set. Columns (a), (b), and (c) corre-
spond to the horizontals H1 and H2, and vertical
components, respectively. From top to bottom, the
first row is the synthetic microseismic data. The
second row is the synthetic data contaminated with
random noise. The third row is the noisy data fil-
tered with a low-pass frequency filter. The fourth
row is the denoised data using the 3C sparse trans-
form.
V26 Vera Rodriguez et al.

transform. The presence of cleaner wave arrivals in the denoised between ∼0.9 and ∼1.3 in synthetic and real data examples ana-
traces is of advantage for processes such as automatic time-picking, lyzed in this study. The best λl2;1 value for a particular data set
selection of hodogram windows, and amplitude stacking. After the can be found by constructing a Pareto curve (e.g., Berg and
first arrival, the sparse transform retains the oscillations of the signal Friedlander, 2008) or by simple trial and error. The preservation
that represent potential information. For example, consider the of the amplitude and phase properties of the arrivals in the input
arrival around ∼110 ms in the H1 component of receiver 12
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

data is corroborated by hodogram analysis. Hodogram curves for


(Figure 8). While the arrival is noticeable in the denoised and receiver seven of the real data set (Figure 11) show smoother
low-pass filtered versions of the data, the denoised trace is visibly behavior in the denoised traces than in the original and low-pass
cleaner before the onset. Similarly, the wave arrivals between filtered versions of the data. The low-pass filtered version of the
100 ms and 150 ms of the H2 component (Figure 9) display cleaner data is noisier before the onset of the arrivals and displays a lower
onsets in the denoised results. The amount of noise rejected by the frequency content that is translated into a small delay in the selected
transform is related to the sparsity imposed into the time-frequency time window for hodogram analysis (Figure 11, first row). Cleaner
representation of the data, which is controlled by the trade-off para- onsets in the wave arrivals facilitates the selection of the hodogram
meter λl2;1 . By normalizing the maximum absolute amplitude of the windows improving the results of automatic algorithms that per-
input data to one, the estimated optimum value for λl2;1 oscillated form this task.

Figure 6. Comparison of hodogram analysis for receiver three of the synthetic microseismic data set of Figure 5. Column (a) synthetic data.
Column (b) synthetic data contaminated with random noise. Column (c) noisy data filtered with a low-pass frequency filter. Column (d)
denoised data using the 3C sparse transform. Vertical lines in the first row delimit the hodogram window. Hodogram curves in column
(a) are repeated with dotted lines in columns (b), (c), and (d) for comparison. H1 and H2 stand for horizontal components.
Microseismic data denoising V27
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

Figure 7. (a) Denoised vertical component of the real microseismic


data set. (b) Time-frequency map at central frequency 222 Hz for Figure 9. Denoising results in the horizontal H2 component of the
the data in (a). Time-frequency map at central frequency 297 Hz for real microseismic data. (a) Raw microseismic data. (b) Raw data
the data in (a). filtered with a low-pass frequency filter. (c) Denoised data using
the 3C sparse transform.

Figure 8. Denoising results in the horizontal H1 component of the Figure 10. Denoising results in the vertical component of the real
real microseismic data. (a) Raw microseismic data. (b) Raw data microseismic data. (a) Raw microseismic data. (b) Raw data filtered
filtered with a low-pass frequency filter. (c) Denoised data using with a low-pass frequency filter. (c) Denoised data using the 3C
the 3C sparse transform. sparse transform.
V28 Vera Rodriguez et al.

observed in synthetic and real data show the ef-


ficacy of the method to remove ambient noise
without significantly affecting the frequency
content and/or amplitude of the signal of interest.
Hodogram analyses of the denoised results
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

further verifies the preservation of amplitude


and phase information after the denoising pro-
cess. Additionally, denoised results displayed
cleaner onsets useful for manual or automatic se-
lection of the hodogram windows and higher fre-
quency content than low-pass filtered versions of
the original data. This extra information allowed
the hodogram curves of the denoised data to dis-
play a smooth behavior keeping the main trends
of particle movement. Through the use of the
denoising method in a preprocessing stage,
improved results can be expected in the estima-
tion of polarization angles, automatic time-
picking and amplitude stacking of microseismic
traces.

ACKNOWLEDGMENTS
This work was partially supported by the
sponsors of the Signal Analysis and Imaging
Group (SAIG) and Pinnacle/Halliburton. We also
acknowledge Fairborne Energy Ltd. for provid-
ing the real microseismic data used in this study.
APPENDIX A
FISTA FOR GROUP SPARSE
SOLUTIONS

Figure 11. Comparison of hodogram analysis with real microseismic data (receiver 7). To incorporate the mixed l2;1 -norm within the
Column (a) raw data. Column (b) data filtered with a low-pass frequency filter. Column FISTA algorithm for the purposes of promoting
(c) denoised data using the 3C sparse transform. Vertical lines in the first row delimit the group sparsity, a brief introduction to the iterative
hodogram window. Hodogram curves in column (a) are repeated with dotted lines in soft thresholding function (ISTA) is presented.
columns (b) and (c) for comparison.
The mathematical framework for this algorithm
was first presented in Daubechies et al. (2004).
CONCLUSIONS Minimizing the cost function defined in equa-
tion 5 through a gradient based approach, the ISTA algorithm iter-
Processing of microseismcity records is a challenge due to, ates toward the solution, from an initial estimate a0 , by:
among other problems, the low S/N displayed by the signals of in-
 
terest. Application of techniques and processing tools developed in 1 T
other knowledge areas can be of great advantage to deal with akþ1 ¼ T λl ∕2α ak − R ðs − Rak Þ : (A-1)
1 α
microseismicity records. The method presented in this study is a
time-frequency transform constrained to impose sparsity into the The constant α is chosen to be greater than or equal to the maximum
time-frequency representation of the signal. When compared eigenvalue of RT R to prevent the argument, i.e., gradient step, with-
against the equivalent l2 -norm transform, the sparsity constrained in the operator T τ from becoming a negative function. The operator
transform using the l1 -norm displays improved results for denoising T τ is known as the soft thresholding function and is defined by
purposes. This transform can be implemented component by
component (1C algorithm) or by simultaneously utilizing the three T τ ðgÞ ¼ ðjgj − τÞþ signðgÞ; (A-2)
components of a receiver (3C algorithm). In particular, the 3C ver-
sion of the sparse transform is advantageous in cases where the where g is a scalar value, τ is a value greater than zero, and
wave arrival exists in the three components of a receiver but the ðjgj − τÞþ is the maximum between ðjgj − τÞ and zero. To accelerate
amplitude of the arrival is below the noise level in one component. the convergence rate of ISTA, the FISTA algorithm, developed in
In such cases, the 3C algorithm uses information from the compo- Beck and Teboulle (2009), uses a very specific linear combination
nents where the arrival amplitudes are above the noise level to re- of the previous two points or iterations. Specifically, this combina-
cover the amplitude information in the other component. The results tion at the kth iteration is
Microseismic data denoising V29

 
tk−2 Berg, E., and M. Friedlander, 2009, Probing the pareto frontier for basis
pursuit solutions: SIAM Journal on Scientific Computing, 31, no. 2,
yk ¼ ak−1 þ ðak−1 − ak−2 Þ; (A-3)
tk 890–912, doi: 10.1137/080714488.
Bonar, D., and M. Sacchi, 2010, Complex spectral decomposition via inver-
sion strategies: 80th Annual International Meeting, SEG, Expanded
where the variable tk is defined by Abstracts, 1408–1412.
Chen, S., D. Donoho, and M. Saunders, 1998, Atomic decomposition
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi by basis pursuit: SIAM Journal on Scientific Computing, 20, no. 1,
1þ 1 þ 4t2k−1
Downloaded 02/09/15 to 129.96.252.188. Redistribution subject to SEG license or copyright; see Terms of Use at http://library.seg.org/

33–61, doi: 10.1137/S1064827596304010.


tk ¼ : (A-4)
2 Cotter, S., B. Rao, K. Engan, and K. Kreutz-Delgado, 2005, Sparse solutions
to linear inverse problems with multiple measurement vectors: IEEE
Transactions on Signal Processing, 53, no. 7, 2477–2488, doi:
For the first iteration of the FISTA algorithm y1 is chosen to be a0 10.1109/TSP.2005.849172.
and t1 is chosen to be one. The next iterate of the solution model, Daubechies, I., M. Defrise, and C. De Mol, 2004, An iterative thresholding
akþ1 , is then calculated using yk : algorithm for linear inverse problems with a sparsity constraint: Commu-
nications on Pure and Applied Mathematics, 57, no. 11, 1413–1457, doi:
  10.1002/(ISSN)1097-0312.
1 T Eisner, L., D. Abbott, W. Barker, J. Lakings, and M. Thornton, 2008, Noise
akþ1 ¼ T λl ∕2α y k − R ðs − Ry k Þ : (A-5) suppression for detection and location of microseismic events using a
1 α matched filter: 78th Annual International Meeting, SEG, Expanded
Abstracts, 1431–1435.
By changing the regularization term to the mixed l2;1 -norm as in Eldar, Y. C., and H. Bolcskei, 2009, Block-sparsity: Coherence
and efficient recovery: IEEE International Conference on Acoustics,
equation 8, the derivation of the ISTA algorithm, and by extension Speech, and Signal Processing, IEEE Computer Society,
the FISTA algorithm, is relatively unchanged (Fornasier and 2885–2888.
Fornasier, M., and H. Rauhut, 2008, Recovery algorithms for vector-valued
Rauhut, 2008). The only difference that arises within the ISTA data with joint sparsity constraints: SIAM Journal of Numerical Analysis,
or FISTA algorithm is the definition of the soft thresholding func- 46, no. 2, 577–613, doi: 10.1137/0606668909.
tion. Rather than the soft thresholding operator T τ , a new multidi- Hestenes, M. R., and E. Stiefel, 1952, Methods of conjugate gradients for
solving linear systems: Journal of Research of National Bureau Standards,
mensional soft thresholding operator Gj is required. This new 49, no. 6, 409–436.
multidimensional soft thresholding operator Gτ is defined by Hyder, M., and K. Mahata, 2010, Direction-of-arrival estimation using a
mixed l2;0 norm approximation: IEEE Transactions on Signal Processing,
g 58, no. 9, 4646–4655, doi: 10.1109/TSP.2010.2050477.
Gτ ðgÞ ¼ ðkgk2 − τÞþ (A-6) Kowalski, M., and B. Torrésani, 2009, Sparsity and persistence: Mixed
kgk2 norms provide simple signal models with dependent coefficients: Signal,
Image and Video Processing, 3, no. 3, 251–264, doi: 10.1007/s11760–
008–0076–1.
where g is a vector group. For the purposes of the FISTA algorithm Mallat, S., 2008, A wavelet tour of signal processing: The sparse way,
incorporating group sparsity, each predefined group is thresholded 3rd ed.: Academic Press.
separately using the Gτ soft thresholding operator. Therefore, the Maxwell, S., 2005, A brief guide to passive seismic monitoring: CSEG
National Convention, 177–178.
FISTA algorithm incorporating group sparsity becomes Maxwell, S., 2009, Confidence and accuracy of microseismic images: CSPG
CSEG CWLS Convention, 480–483.
X
N   Shemeta, J., and P. Anderson, 2010, It’s a matter of size: Magnitude and
1 T moment estimates for microseismic data: The Leading Edge, 29,
Akþ1 ¼ Gλl ∕2α Y k;i − R ðSi − RY k;i Þ (A-7)
i¼1
2;1 α 296–302, doi: 10.1190/1.3353726.
Sobolev, G., and A. Lyubushin, 2006, Microseismic impulses as earthquake
precursors: Izvestiya, Physics of the Solid Earth, 42, no. 9, 721–733, doi:
where the index i refers to each separate group of the total N groups, 10.1134/S1069351306090023.
Vera Rodriguez, I., D. Bonar, and M. Sacchi, 2011, Microseismic record
which in the present formulation represents the vectors containing de-noising using a sparse time-frequency transform: 81st Annual
the components x, y, and z for each defined time-frequency International Meeting, SEG, Expanded Abstracts, 1693–1698.
Vera Rodriguez, I., M. Sacchi, and Y. Gu, 2010b, Towards a near real-time
location. system for event hypocenter and source mechanism recovery via compres-
sive sensing: 80th Annual International Meeting, SEG, Expanded
REFERENCES Abstracts, 2140–2145.
Vera Rodriguez, I., M. Sacchi, and Y. Gu, 2010a, Continuous hypocenter
Aster, R., B. Borchers, and C. Thurber, 2005, Parameter estimation and and source mechanism inversion via a green’s function-based matching
inverse problems (International Geophysics): Academic Press. pursuit algorithm: The Leading Edge, 29, 334–337, doi: 10.1190/
Baig, A., M. Campillo, and F. Brenguier, 2009, Denoising seismic noise 1.3353731.
cross correlations: Journal of Geophysical Research, 114, B08310, doi: Yuan, M., and Y. Lin, 2006, Model selection and estimation in regression
10.1029/2008JB006085. with grouped variables: Journal of the Royal Statistical Society: Series B
Beck, A., and M. Teboulle, 2009, A fast iterative shrinkage-thresholding (Statistical Methodology), 68, no. 1, 49–67, doi: 10.1111/j.1467–9868
algorithm for linear inverse problems: SIAM Journal of Imaging Sciences, .2005.00532.x.
2, no. 1, 183–202, doi: 10.1137/080716542.

You might also like